SYSTEM all green source leboncoin.fr queue 12,841 pages p99 latency 214ms dataflirt.com · scraper/leboncoin-fr
RUN · 84 active pipelines · leboncoin.fr live

French property data,
at warehouse scale.

We extract property sales, rentals, DPE ratings, and seller metadata from Leboncoin. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Listings extracted
842K /day
Price updates
3.1M /24h
Agency records
42K /run
Active pipelines
84
Uptime
99.94%
Data Dictionary

Every field we extract from leboncoin.fr

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Property Listings objects from leboncoin.fr. All fields typed and schema-versioned.

ad_idtitlecategory_namepriceroomssquare_metersproperty_typedpe_ratingges_ratingdescriptionurlimage_urlspublication_datecitydepartmentregionpostal_code
property_listings
● 200 OK
"ad_id": "2418593021",
"title": "Appartement 3 pièces 65 m²",
"price": 345000.0,
"rooms": 3,
"square_meters": 65.0,
"dpe_rating": "C",
"city": "Lyon",
"postal_code": "69003"
# ad_idtitlecategory_namepriceroomssquare_meters
1
2
3

Complete list of extractable fields for Seller Intelligence objects from leboncoin.fr. All fields typed and schema-versioned.

seller_idseller_nameis_prosiretactive_listingsjoined_dateresponse_ratephone_numberseller_urlagency_website
seller_intelligence
● 200 OK
"seller_id": "849201",
"seller_name": "Agence Stéphane Plaza",
"is_pro": true,
"siret": "48129384700012",
"active_listings": 42,
"joined_date": "2018-04-12",
"phone_number": "+33478123456"
# seller_idseller_nameis_prosiretactive_listingsjoined_date
1
2
3

Complete list of extractable fields for Pricing & Valuation objects from leboncoin.fr. All fields typed and schema-versioned.

ad_idcurrent_priceoriginal_priceprice_drop_pctprice_per_sqmcity_average_sqmfees_includedfees_amountcurrencytimestamp
pricing_& valuation
● 200 OK
"ad_id": "2418593021",
"current_price": 345000.0,
"original_price": 360000.0,
"price_drop_pct": 4.1,
"price_per_sqm": 5307.69,
"fees_included": true,
"currency": "EUR"
# ad_idcurrent_priceoriginal_priceprice_drop_pctprice_per_sqmcity_average_sqm
1
2
3

Complete list of extractable fields for Energy & Diagnostics objects from leboncoin.fr. All fields typed and schema-versioned.

ad_iddpe_scoredpe_valueges_scoreges_valueconstruction_yearheating_typeenergy_cost_estimate_minenergy_cost_estimate_maxdiagnostic_date
energy_& diagnostics
● 200 OK
"ad_id": "2418593021",
"dpe_score": "C",
"dpe_value": 145,
"ges_score": "A",
"ges_value": 4,
"heating_type": "Individuel électrique",
"energy_cost_estimate_min": 850
# ad_iddpe_scoredpe_valueges_scoreges_valueconstruction_year
1
2
3

Complete list of extractable fields for Search Results objects from leboncoin.fr. All fields typed and schema-versioned.

keywordcategoryregiondepartmentcitypositionad_idtitlepriceis_propromoted_adscraped_at
search_results
● 200 OK
"keyword": "appartement avec balcon",
"city": "Bordeaux",
"position": 1,
"ad_id": "2418593021",
"price": 345000.0,
"is_pro": true,
"promoted_ad": false,
"scraped_at": "2026-05-12T09:14:33Z"
# keywordcategoryregiondepartmentcityposition
1
2
3

Capabilities

Everything you need from Leboncoin — structured and scaled

Our Leboncoin scraper handles every layer of the platform: property listings, DPE diagnostics, seller intelligence, and geolocation data — with Datadome circumvention and session management built in.

Full Property Extraction

Title, description, price, rooms, square metres, property type, and high-resolution image URLs scraped at the listing level.

Energy Diagnostics (DPE/GES)

Capture exact DPE and GES letter grades, numerical consumption values, and estimated annual energy costs for regulatory compliance modelling.

Pro vs Particulier Filtering

Distinguish between private sellers and real estate agencies. Extract SIRET numbers and agency metadata for professional listings.

Granular Geolocation

Extract region, department, city, and postal code data to map listings accurately across the French territory.

Price Drop Tracking

Monitor listing duration and capture original versus current pricing to calculate price drop percentages over time.

Contact Information

Extract revealed phone numbers for both private sellers and agencies via automated GraphQL API interaction.

Search Result Pagination

Iterate through thousands of search result pages based on complex filters like price ranges, property types, and specific departments.

Datadome Circumvention

Bypass Leboncoin's aggressive Datadome anti-bot protection using residential proxies and TLS fingerprint spoofing.

Change Detection

Run continuous pipelines with hash-based diffing to track new listings, sold properties, and price adjustments without redundant data.

// engagement pipeline

From search parameters to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide search URLs, department codes, or specific agency profiles. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and Datadome bypass for leboncoin.fr.

Validation & QA
d 4–6

Schema validation, null-rate checks, location normalisation, and sample records before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Leboncoin pipeline handles the hard parts

Leboncoin employs strict Datadome protection and complex API structures. Here is how we maintain reliable extraction.

pipeline-monitor · leboncoin.fr · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Datadome bypass via TLS fingerprinting

Leboncoin uses Datadome to block automated traffic. Our infrastructure uses French residential proxies combined with strict TLS fingerprint spoofing, realistic HTTP/2 headers, and automated CAPTCHA solving to maintain high success rates.

API extraction
Direct GraphQL interception

Rather than scraping fragile HTML, our crawlers intercept Leboncoin's internal GraphQL API payloads. This ensures cleaner data extraction, captures hidden metadata, and reduces pipeline breakage when the frontend UI changes.

Phone number access
Automated interaction flows

Phone numbers on Leboncoin are masked by default. Our Playwright instances automate the click-to-reveal interactions, managing the necessary session tokens to extract contact details without triggering rate limits.

Data normalisation
Cleaning user-generated content

Private seller listings often contain messy, unstructured data. We apply post-processing layers to normalise property types, extract actual square footage from text blocks, and standardise DPE formatting before delivery.

Change detection
Only re-scrape what's changed

For large regional sweeps, we maintain a hash index of last-seen values per listing. Subsequent runs only push diffs — capturing price drops or delistings — reducing compute cost and downstream processing load.

Applications

Who uses Leboncoin data — and how

Teams across industries use leboncoin.fr data to build competitive products and smarter operations.

01
PropTech Valuation Models

Automated valuation models (AVMs) ingest historical pricing, DPE ratings, and time-on-market metrics to estimate property values accurately.

02
Real Estate Agency Prospecting

Agencies monitor private seller listings (particuliers) in their territory to identify potential mandates and contact leads directly.

03
Market Liquidity Analysis

Institutional investors track listing volume and average time-on-market across different departments to gauge regional real estate liquidity.

04
Investment Yield Calculation

Investors cross-reference sale prices with rental listings in the same postal code to calculate gross rental yields programmatically.

05
Energy Renovation Targeting

Renovation companies target properties with F or G energy ratings (passoires thermiques) to offer compliance upgrades to sellers.

06
Urban Planning Research

Municipalities and researchers analyse housing supply, rental costs, and property types to inform local housing policies.

Why DataFlirt

"Leboncoin holds the highest volume of real estate liquidity data in France. Accessing it programmatically requires bypassing aggressive anti-bot layers."

Extracting data from Leboncoin requires defeating Datadome protection, managing complex GraphQL API payloads, and normalising inconsistent user-generated inputs. DataFlirt handles the proxy rotation, session management, and schema validation so your team can focus on building valuation models and market analysis tools.

Technical Spec

Leboncoin scraper — technical capabilities

Everything supported by our leboncoin.fr scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

Datadome bypass
Automated evasion using French residential proxies and TLS fingerprinting
Supported
GraphQL API extraction
Direct interception of backend payloads for structured metadata
Supported
Pro vs Private filtering
Boolean flags and SIRET extraction for professional accounts
Supported
DPE/GES extraction
Capture both letter grades and numerical consumption estimates
Supported
Historical price tracking
Monitor listings over time to capture price drops and duration
Supported
Phone number extraction
Automated interaction to reveal masked contact numbers
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed fields since last run
Supported
Messaging system automation
Sending automated messages to sellers via the Leboncoin inbox
Partial
Saved searches extraction
Accessing user-specific saved searches and alerts behind login
Partial
Infrastructure

Infrastructure powering the Leboncoin pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Datadome Circumvention

We maintain pools of French residential ISP proxies combined with advanced TLS fingerprint spoofing to bypass Datadome blocks consistently.

GraphQL Payload Engineering

Our crawlers intercept and parse Leboncoin's internal GraphQL requests, extracting structured JSON directly rather than relying on fragile DOM selectors.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
XLS
Legacy Excel format for direct business user consumption
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery — compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoint to query your extracted Leboncoin dataset
BigQuery
Streamed directly into your dataset with schema auto-detect
Snowflake
Stage + COPY INTO workflow — incremental or full-replace
Postgres
Upsert into your existing schema with conflict resolution
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About leboncoin.fr scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Leboncoin legal?

Scraping publicly available real estate listings is generally permissible for analytical purposes. DataFlirt targets only public, non-authenticated property data. We do not extract personal user accounts or circumvent authentication walls. Clients should review Leboncoin's ToS and consult legal counsel for specific use cases.

How do you bypass Datadome?

We use French residential ISP proxies, full Playwright browser sessions with realistic TLS fingerprints, and request timing modelled on human behaviour. We monitor for CAPTCHA challenges and trigger automated solver queues immediately.

Can you extract phone numbers?

Yes. We automate the interaction required to reveal masked phone numbers on listings. This data is extracted and appended to the final payload for both private and professional sellers.

Do you capture DPE and GES ratings?

Yes. We extract the exact letter grades for both DPE and GES, along with the numerical energy consumption estimates and the diagnostic date where provided.

How fast can you scrape a department?

Depending on the total listing volume, a full sweep of a French department typically completes within 2 to 4 hours. We can configure continuous diffing pipelines to run at higher frequencies for specific postal codes.

Can you filter by PRO vs Particulier?

Yes. Our schema includes a boolean flag distinguishing professional agencies from private sellers, allowing you to filter the dataset easily in your warehouse.

What is the minimum viable engagement?

Our smallest packages start at a defined regional scope (e.g., specific departments or cities) with weekly delivery. For national coverage or real-time tracking, we price based on volume and delivery frequency. Contact us with your use case for a scoped quote.

$ dataflirt scope --new-project --source=leboncoin.fr ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off extraction of a specific department or a continuous national feed of real estate listings — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →