SYSTEM all green source hepsiemlak.com queue 12,941 pages p99 latency 218ms dataflirt.com · scraper/hepsiemlak-com
RUN · 64 active pipelines · hepsiemlak.com live

Hepsiemlak data,
at warehouse scale.

We extract residential, commercial, and land listings, agent intelligence, and Emlak Endeksi valuation data from Hepsiemlak. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Postgres on your cadence.

Properties extracted
84,210 /day
Price updates
315K /24h
Agent records
18,450 /run
Active pipelines
64
Uptime
99.94%
Data Dictionary

Every field we extract from hepsiemlak.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Property Listings objects from hepsiemlak.com. All fields typed and schema-versioned.

listing_idtitlepricecurrencycitydistrictneighborhoodproperty_typeroom_countgross_sqmnet_sqmbuilding_agefloorheating_typedate_listedagent_id
property_listings
● 200 OK
"listing_id": "12345678",
"title": "Kadikoy Moda 3+1 Deniz Manzarali Daire",
"price": 14500000,
"currency": "TRY",
"city": "Istanbul",
"district": "Kadikoy",
"room_count": "3+1",
"gross_sqm": 145,
"building_age": "5-10",
"heating_type": "Kombi (Dogalgaz)"
# listing_idtitlepricecurrencycitydistrict
1
2
3

Complete list of extractable fields for Pricing & Valuation objects from hepsiemlak.com. All fields typed and schema-versioned.

listing_idcurrent_priceoriginal_priceprice_drop_pctprice_per_sqmemlak_endeksi_valueestimated_rentroi_yearsdeposit_amountdues_amountprice_history
pricing_& valuation
● 200 OK
"listing_id": "12345678",
"current_price": 14500000,
"original_price": 15000000,
"price_drop_pct": 3.33,
"price_per_sqm": 100000,
"dues_amount": 1500,
"emlak_endeksi_value": 14250000,
"roi_years": 18
# listing_idcurrent_priceoriginal_priceprice_drop_pctprice_per_sqmemlak_endeksi_value
1
2
3

Complete list of extractable fields for Location & Map objects from hepsiemlak.com. All fields typed and schema-versioned.

listing_idlatitudelongitudecitydistrictneighborhoodstreetmap_zoomnearby_transitnearby_schoolsdistance_to_center
location_& map
● 200 OK
"listing_id": "12345678",
"latitude": 40.9801,
"longitude": 29.0234,
"city": "Istanbul",
"district": "Kadikoy",
"neighborhood": "Caferaga",
"nearby_transit": "['Moda Tram', 'Kadikoy Metro']",
"distance_to_center": "1.2 km"
# listing_idlatitudelongitudecitydistrictneighborhood
1
2
3

Complete list of extractable fields for Agent & Agency objects from hepsiemlak.com. All fields typed and schema-versioned.

agent_idagent_nameagency_nameagency_idphone_numbermobile_numberactive_listings_counttotal_salesjoined_dateoffice_addressagency_logo_url
agent_& agency
● 200 OK
"agent_id": "A98765",
"agent_name": "Ahmet Yilmaz",
"agency_name": "Moda Emlak Danismanlik",
"agency_id": "AG1234",
"mobile_number": "+905551234567",
"active_listings_count": 42,
"joined_date": "2018-04-12",
"office_address": "Caferaga Mah. Moda Cad. No:15 Kadikoy / Istanbul"
# agent_idagent_nameagency_nameagency_idphone_numbermobile_number
1
2
3

Complete list of extractable fields for Features & Amenities objects from hepsiemlak.com. All fields typed and schema-versioned.

listing_idinterior_featuresexterior_featuresneighborhood_featurestransportationfacadeviewbuilding_featuresparking_typesecurity_features
features_& amenities
● 200 OK
"listing_id": "12345678",
"interior_features": "['Balkon', 'Ebeveyn Banyosu', 'Parke Zemin', 'Klima']",
"exterior_features": "['Asansor', 'Isi Yalitim', 'Su Deposu']",
"view": "['Deniz', 'Sehir']",
"parking_type": "Kapali Otopark",
"security_features": "['7/24 Guvenlik', 'Kamera Sistemi']"
# listing_idinterior_featuresexterior_featuresneighborhood_featurestransportationfacade
1
2
3

Capabilities

Complete Turkish property intelligence

Our Hepsiemlak scraper handles dynamic map searches, heavily paginated listing categories, agent directories, and regional index data with built-in proxy rotation and anti-bot circumvention.

Residential & Commercial

Extract flats, villas, offices, and retail spaces with category-specific attributes like room counts, gross/net square metres, and heating types.

Land & Plot Details

Capture zoning status, floor area ratio (KAKS/TAKS), and parcel numbers for land listings across all Turkish provinces.

Real-Time Price Tracking

Monitor asking prices, rent drops, and per-square-metre valuations across regions. Timestamped per crawl.

Emlak Endeksi Extraction

Scrape regional property index data, average listing durations, and historical pricing trends by district and neighborhood.

Agent & Agency Portfolios

Map the Turkish real estate broker network, tracking listing volumes, contact details, and market share per agency.

Geolocation Data

Extract precise latitude/longitude coordinates and structural boundary metadata directly from internal map APIs.

Amenity & Feature Parsing

Normalise unstructured interior, exterior, and building features into queryable arrays for advanced filtering.

Media Metadata

Capture high-resolution image URLs, floor plans, and flags for 3D virtual tour availability.

Historical Change Detection

Track property time-on-market, status changes, and price adjustments over time using hash-based state diffing.

Regional Aggregation

Run broad searches across specific cities like Istanbul, Ankara, or Izmir at scale without hitting pagination caps.

// engagement pipeline

From search query to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target cities, property types, or agency IDs. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, and session management for hepsiemlak.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and sample listings before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Postgres on agreed cadence.

Under the hood

Bypassing real estate portal protections

Hepsiemlak limits pagination depth, monitors request velocity, and obfuscates agent contact details. Here is how we maintain pipeline stability.

pipeline-monitor · hepsiemlak.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Geographic proxy targeting
Turkish residential IPs

We route requests through Turkish residential ISP proxies to bypass regional blocks and WAF rules that target data centre IP ranges.

Pagination limit circumvention
Micro-neighborhood polygon searches

Hepsiemlak caps search results at 50 pages. We programmatically sub-divide broad city searches into granular neighborhood polygons and price brackets to extract every listing without hitting the cap.

Phone number rendering
JavaScript click simulation

Agent contact numbers are obfuscated on listing pages. We execute JavaScript via Playwright to simulate user clicks, exposing and capturing the underlying phone numbers.

Map API interception
Direct XHR parsing

Instead of scraping DOM elements for location data, we intercept XHR requests to the internal map API, extracting precise latitude/longitude coordinates from the JSON payloads.

Change detection
Hash-based diffing

For daily market sweeps, we maintain a hash index of last-seen values per listing. Subsequent runs only push new listings, sold properties, or price updates, reducing downstream load.

Applications

Who uses Hepsiemlak data — and how

Teams across industries use hepsiemlak.com data to build competitive products and smarter operations.

01
Property Valuation Models

PropTech firms build automated valuation models (AVMs) using historical listing data and Emlak Endeksi trends.

02
Investment Analysis

Real estate funds identify undervalued assets and high-yield rental opportunities across Istanbul and Ankara.

03
Competitor Intelligence

Agencies monitor rival broker portfolios, listing durations, and market share per district.

04
Urban Planning & Research

Consultancies track urban expansion, gentrification trends, and housing supply metrics.

05
Lead Generation

B2B service providers extract agency contact details to pitch CRM software, photography, or staging services.

06
Market Trend Reporting

Media and analysts aggregate price-per-square-metre data to publish regional real estate reports.

Why DataFlirt

"Hepsiemlak contains the most comprehensive record of Turkish real estate dynamics, but extracting it requires navigating aggressive pagination limits and dynamic map APIs."

Building a reliable pipeline for Turkish property data means handling complex location hierarchies, bypassing strict request rate limits, and rendering JavaScript to expose contact details. DataFlirt absorbs this infrastructure overhead so your analysts can focus on market trends, not crawler maintenance.

Technical Spec

Hepsiemlak scraper — technical capabilities

Everything supported by our hepsiemlak.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions for map interactions and contact number exposure
Supported
Residential proxy rotation
Turkish ISP-grade IPs to bypass Cloudflare and regional restrictions
Supported
Deep pagination bypass
Filter injection to extract listings beyond the standard 50-page cap
Supported
Map API extraction
Direct parsing of JSON payloads from internal mapping endpoints
Supported
Emlak Endeksi historicals
Extraction of regional valuation charts and trend data
Supported
Agent portfolio mapping
Cross-referencing agency IDs to build complete broker profiles
Supported
Change detection (diffs)
Hash-based diff to emit only changed listings since last run
Supported
Webhook delivery
HTTP POST per record for real-time alerting on new listings
Supported
User saved searches
Access to authenticated user saved search alerts and favorite lists
Partial
Private messaging data
Extraction of direct messages sent to agents via the platform
Partial
Infrastructure

Infrastructure powering the Hepsiemlak pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and deduplication. Playwright handles JavaScript rendering, cookie sessions, and map interactions.

Residential Proxy Infrastructure

We maintain pools of Turkish residential ISP proxies. Rotation happens per-request to avoid WAF blocks and rate limits.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. State stored in Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery — compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoints for on-demand query access to extracted data
XLS
Legacy Excel format for offline analyst workflows
PostgreSQL
Direct upsert into your existing relational schema
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About hepsiemlak.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Hepsiemlak legal?

Scraping public real estate listings is generally permissible. We extract only public property and agency data, avoiding authenticated user areas.

How do you bypass the 50-page limit on searches?

We programmatically subdivide broad searches into smaller geographic polygons or apply granular price/size filters to ensure every listing is captured without hitting pagination caps.

Can you extract agent phone numbers?

Yes. We use Playwright to simulate the click interactions required to reveal obfuscated phone numbers on listing pages.

Do you capture Emlak Endeksi data?

Yes, we extract the regional property index data, including historical price trends and average marketing durations for specific districts.

How fresh is the data?

Pipeline cadences are configurable. We can run daily full-market sweeps or hourly updates for specific high-velocity districts in Istanbul.

Can you track price drops over time?

Yes. By maintaining state across pipeline runs, we calculate price deltas and flag listings that have been discounted since their original publish date.

What is the minimum viable engagement?

Our smallest packages cover specific Turkish provinces or property types with weekly delivery. Contact us for a scoped quote.

$ dataflirt scope --new-project --source=hepsiemlak.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a daily feed of Istanbul rental listings or a historical database of Turkish land prices — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →