SYSTEM all green source seloger.com queue 18,492 listings p99 latency 187ms dataflirt.com · scraper/seloger-com
RUNNING : 84 active pipelines : seloger.com live

French real estate data,
extracted at scale.

We extract property listings, agency details, DPE energy ratings, and historical pricing from Seloger. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your defined schedule.

Properties extracted
1.4M /week
Price updates
342K /24h
Agency records
41K /run
Active pipelines
84
Uptime
99.94%
Data Dictionary

Every field we extract from seloger.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Property Listings objects from seloger.com. All fields typed and schema-versioned.

listing_idproperty_typetransaction_typepricecurrencyarea_sqmroomsbedroomslocation_citylocation_postaldpe_ratingges_ratingagency_iddescriptionurl
property_listings
● 200 OK
"listing_id": "194827491",
"property_type": "Apartment",
"transaction_type": "Sale",
"price": 450000.0,
"area_sqm": 65.5,
"rooms": 3,
"location_city": "Paris",
"location_postal": "75011",
"dpe_rating": "D",
"ges_rating": "B"
# listing_idproperty_typetransaction_typepricecurrencyarea_sqm
1
2
3

Complete list of extractable fields for Agency Data objects from seloger.com. All fields typed and schema-versioned.

agency_idnameaddresspostal_codecityphone_numberwebsiteactive_listings_countagency_fees_pctsiret
agency_data
● 200 OK
"agency_id": "847291",
"name": "Century 21 Bastille",
"address": "14 Rue de la Roquette",
"postal_code": "75011",
"city": "Paris",
"active_listings_count": 42,
"agency_fees_pct": 5.0,
"siret": "39482910200014"
# agency_idnameaddresspostal_codecityphone_number
1
2
3

Complete list of extractable fields for Pricing & Market objects from seloger.com. All fields typed and schema-versioned.

listing_idcurrent_priceoriginal_priceprice_drop_pctdays_on_marketprice_per_sqmestimated_rental_yieldproperty_taxhoa_fees
pricing_& market
● 200 OK
"listing_id": "194827491",
"current_price": 450000.0,
"original_price": 465000.0,
"price_drop_pct": 3.2,
"days_on_market": 45,
"price_per_sqm": 6870.2,
"property_tax": 950.0,
"hoa_fees": 120.0
# listing_idcurrent_priceoriginal_priceprice_drop_pctdays_on_marketprice_per_sqm
1
2
3

Complete list of extractable fields for Building & Amenities objects from seloger.com. All fields typed and schema-versioned.

listing_idyear_builtfloor_numbertotal_floorshas_elevatorhas_balconyparking_spacesheating_typeinternet_type
building_& amenities
● 200 OK
"listing_id": "194827491",
"year_built": 1975,
"floor_number": 4,
"total_floors": 6,
"has_elevator": true,
"has_balcony": false,
"parking_spaces": 1,
"heating_type": "Collective electric"
# listing_idyear_builtfloor_numbertotal_floorshas_elevatorhas_balcony
1
2
3

Complete list of extractable fields for Search Results objects from seloger.com. All fields typed and schema-versioned.

query_locationquery_typepage_numberpositionlisting_idis_promotednew_build_flagscraped_atlisting_url
search_results
● 200 OK
"query_location": "Lyon",
"query_type": "Buy",
"page_number": 1,
"position": 3,
"listing_id": "203948172",
"is_promoted": true,
"new_build_flag": false,
"scraped_at": "2026-08-14T10:22:00Z"
# query_locationquery_typepage_numberpositionlisting_idis_promoted
1
2
3

Capabilities

Deep extraction of the French real estate market

Seloger protects its data with aggressive Datadome integration. We bypass these protections to extract clean property metadata, energy certifications, and agency details at scale.

Comprehensive Property Metadata

Extract price, square metres, room counts, floor levels, and detailed textual descriptions for apartments, houses, and commercial spaces.

DPE & GES Extraction

Capture the official Energy Performance Certificate (DPE) and Greenhouse Gas emission (GES) ratings critical for French real estate compliance.

Price Drop Tracking

Monitor listings over time to detect price reductions, calculating the exact percentage drop and tracking days on market.

Agency Intelligence

Map properties to specific real estate agencies. Extract SIRET numbers, contact details, and calculate total portfolio value per agency.

Precise Geolocation

Extract latitude and longitude coordinates hidden in the frontend state for precise mapping and spatial analysis.

New Build Detection

Isolate VEFA (Vente en l'Etat Futur d'Achèvement) projects, including delivery dates and eligible tax exemption schemes like Pinel.

Financial Breakdown

Separate the net seller price from agency fees (FAI). Extract monthly HOA charges and estimated annual property taxes.

Media Extraction

Retrieve high-resolution image URLs, floor plan links, and virtual tour endpoints associated with each listing.

Continuous Market Sync

Run daily diffs on specific postal codes or cities to capture new listings within minutes of publication.

// engagement pipeline

From target coordinates to structured database

Brief in. Clean data out.

Define Scope
d 0

Provide postal codes, cities, or property types. We design the extraction schema covering required fields and update frequency.

Pipeline Build
d 2–4

We configure Playwright crawlers, implement Datadome bypass logic, and map the Seloger GraphQL API responses.

Validation & QA
d 4–6

Schema validation, null-rate checks on critical fields like DPE, and price anomaly detection before full production launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or via Webhook on your defined cadence.

Under the hood

Handling Seloger's technical barriers

Extracting data from Seloger requires defeating enterprise-grade bot protection and decoding complex frontend state. We handle the infrastructure so you can focus on the data.

pipeline-monitor · seloger.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot evasion
Datadome bypass via TLS fingerprinting

Seloger relies on Datadome for bot mitigation. We utilise French residential proxies combined with custom TLS fingerprinting and Playwright stealth plugins to generate valid Datadome clearance cookies without triggering CAPTCHAs.

API interception
GraphQL endpoint decoding

Rather than scraping the DOM, our pipelines intercept the underlying GraphQL queries used by Seloger's frontend. This provides cleaner data, exposes hidden fields like precise coordinates, and reduces bandwidth overhead.

Pagination handling
Map-based spatial crawling

Seloger limits standard search pagination to 100 pages. For dense areas like Paris, we implement spatial crawling algorithms that divide the map into smaller bounding boxes to ensure 100% listing coverage.

State extraction
Next.js hydration parsing

Critical property metadata is often embedded within the Next.js hydration state (__NEXT_DATA__). We parse this JSON blob directly to extract agency SIRETs and DPE scores that may not render immediately in the viewport.

Diff processing
Efficient market updates

We hash property records to detect changes in price or status. Your downstream systems receive only the modified records, preventing storage bloat and reducing processing costs for daily market monitoring.

Applications

Who uses Seloger data

Teams across industries use seloger.com data to build competitive products and smarter operations.

01
PropTech Valuation Models

Automated valuation models (AVMs) consume price-per-square-metre data and historical transaction trends to price new properties accurately.

02
Real Estate Investment

Funds and individual investors scan the market for high-yield rental opportunities or properties requiring renovation to improve DPE ratings.

03
Agency Competitor Analysis

Brokerages monitor competitor portfolios, track average days on market, and identify agencies losing market share in specific districts.

04
Urban Planning & Research

Municipalities and researchers analyse housing supply, energy efficiency distribution (DPE), and rent inflation across different regions.

05
Lead Generation

B2B service providers target real estate agencies based on their active listing volume and specific property types.

06
Mortgage & Insurance Profiling

Financial institutions assess regional property values and building risks to adjust lending criteria and insurance premiums.

Why DataFlirt

"Seloger contains the definitive dataset for French real estate, but accessing it requires bypassing aggressive Datadome protection and reverse engineering complex GraphQL queries."

Most engineering teams fail at scraping Seloger because Datadome blocks standard HTTP clients immediately. DataFlirt handles the proxy rotation, TLS fingerprinting, and CAPTCHA solving required to maintain a stable pipeline. Your team gets structured property records without managing infrastructure.

Technical Spec

Seloger scraper technical specifications

Everything supported by our seloger.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

Datadome bypass
Automated clearance cookie generation using residential IPs and stealth browsers
Supported
GraphQL interception
Direct extraction from Seloger's undocumented backend APIs
Supported
DPE / GES parsing
Extraction of official energy and greenhouse gas ratings
Supported
Geo-coordinate extraction
Latitude and longitude data extracted from map state
Supported
High-res image URLs
Complete array of property photos without watermarks where available
Supported
Agency SIRET extraction
Corporate identification numbers for agency mapping
Supported
Historical price drops
Detection of price changes based on listing ID tracking
Supported
Direct messaging to agency brokers
Automated submission of contact forms requires user interaction and validation
Partial
User saved searches and alerts
Requires authenticated user sessions and account credentials
Partial
Infrastructure

Infrastructure built for scale

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheusDatadome Bypass Modules
Anti-Bot Infrastructure

We maintain custom Playwright stealth configurations and TLS fingerprinting tools specifically tuned to bypass Datadome's behavioural analysis.

French Residential Proxies

Seloger strictly geo-blocks non-French traffic. We utilise exclusive pools of French ISP proxies to ensure high success rates and low latency.

GraphQL Parsing Engine

Our pipelines map Seloger's GraphQL schema directly to our normalisation layer, ensuring schema stability even when the visual DOM changes.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Nested structures ideal for NoSQL databases
CSV
Flat files for immediate analyst use
XLS
Excel format for business stakeholders
Parquet
Columnar storage for efficient querying
AWS S3
Direct delivery to your cloud storage
Webhook
Real-time POST requests for new listings
API
On-demand programmatic access to scraped records
BigQuery
Direct streaming inserts into your data warehouse
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About seloger.com scraping, legality, and pipeline operations.

Ask us directly →
How do you handle Seloger's Datadome protection?

We use a combination of French residential proxies, custom TLS fingerprinting, and Playwright stealth plugins. This approach generates valid behavioural metrics that Datadome accepts, allowing us to extract data without triggering blocks.

Can you extract exact geographical coordinates?

Yes. While Seloger obfuscates exact addresses on the frontend, the underlying map state and GraphQL responses often contain precise latitude and longitude coordinates which we extract and normalise.

How frequently can you update the property data?

For targeted postal codes or specific cities, we can run hourly diff pipelines. For full national coverage, we recommend a daily or weekly cadence to manage infrastructure costs effectively.

Do you extract DPE and GES energy ratings?

Yes. Energy Performance Certificates (DPE) and Greenhouse Gas (GES) ratings are mandatory in France and are extracted as structured fields in every property record.

Can we track price changes on properties?

Yes. We use the unique listing IDs to track properties over time. Our change-detection system logs any price reductions, calculating the percentage drop and the total days the property has been on the market.

Is it legal to scrape French real estate data?

Extracting publicly available factual data, such as property prices and square footage, is generally permissible. DataFlirt does not extract personal user data or bypass authentication walls. Clients must ensure their specific use cases comply with French laws and GDPR.

$ dataflirt scope --new-project --source=seloger.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Stop fighting Datadome blocks. Tell us the regions and property types you need, and we will deliver clean, structured Seloger data directly to your infrastructure.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →