SYSTEM all green source property24.com queue 14,892 pages p99 latency 187ms dataflirt.com · scraper/property24-com
RUN | 41 active pipelines | property24.com live

Property24 data,
at warehouse scale.

We extract residential and commercial listings, historical sales data, erf sizes, and agent intelligence from Property24. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Listings extracted
312K /day
Price updates
45K /24h
Agent profiles
18K /run
Active pipelines
41
Uptime
99.98%
Data Dictionary

Every field we extract from property24.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Residential Listings objects from property24.com. All fields typed and schema-versioned.

p24_idtitlepricebedroomsbathroomsgarageserf_sizefloor_sizedescriptionaddresssuburbcityprovinceimagesagent_idlisted_date
residential_listings
● 200 OK
"p24_id": "113498214",
"title": "3 Bedroom House in Sandton",
"price": 3450000.0,
"bedrooms": 3,
"bathrooms": 2.5,
"garages": 2,
"erf_size": "1200 sqm",
"suburb": "Sandton",
"city": "Johannesburg",
"listed_date": "2023-10-14T08:30:00Z"
# p24_idtitlepricebedroomsbathroomsgarages
1
2
3

Complete list of extractable fields for Commercial Listings objects from property24.com. All fields typed and schema-versioned.

p24_idtitlepriceprice_per_sqmproperty_typegross_lettable_areazoningaddresssuburbcityprovinceagent_idlisted_date
commercial_listings
● 200 OK
"p24_id": "113498215",
"title": "Office Space in Century City",
"price": 45000.0,
"price_per_sqm": 150.0,
"property_type": "Office",
"gross_lettable_area": "300 sqm",
"suburb": "Century City",
"city": "Cape Town",
"listed_date": "2023-10-12T09:15:00Z"
# p24_idtitlepriceprice_per_sqmproperty_typegross_lettable_area
1
2
3

Complete list of extractable fields for Agent Profiles objects from property24.com. All fields typed and schema-versioned.

agent_idnameagencyphone_numberemailactive_listings_countareas_coveredprofile_urlphoto_url
agent_profiles
● 200 OK
"agent_id": "AG78901",
"name": "Jane Doe",
"agency": "Pam Golding Properties",
"phone_number": "+27 82 123 4567",
"active_listings_count": 24,
"areas_covered": "['Sandton', 'Bryanston', 'Morningside']",
"profile_url": "https://www.property24.com/estate-agents/jane-doe/78901"
# agent_idnameagencyphone_numberemailactive_listings_count
1
2
3

Complete list of extractable fields for Historical Sales objects from property24.com. All fields typed and schema-versioned.

transfer_datepriceaddresserf_numberproperty_typebuyer_nameseller_nametitle_deed_number
historical_sales
● 200 OK
"transfer_date": "2022-05-18",
"price": 2850000.0,
"address": "14 Oak Avenue, Rondebosch",
"erf_number": "12345",
"property_type": "Freehold",
"title_deed_number": "T12345/2022",
"buyer_name": "Private Person"
# transfer_datepriceaddresserf_numberproperty_typebuyer_name
1
2
3

Complete list of extractable fields for Suburb Trends objects from property24.com. All fields typed and schema-versioned.

suburbcityprovinceaverage_asking_priceaverage_sale_pricesales_volume_yoydemographic_dataactive_listings_count
suburb_trends
● 200 OK
"suburb": "Rondebosch",
"city": "Cape Town",
"average_asking_price": 3200000.0,
"average_sale_price": 2950000.0,
"sales_volume_yoy": 145,
"active_listings_count": 87,
"demographic_data": "Available"
# suburbcityprovinceaverage_asking_priceaverage_sale_pricesales_volume_yoy
1
2
3

Capabilities

Complete Property24 intelligence - structured for analysis

Our Property24 scraper navigates map-based searches, dynamic phone reveals, and pagination limits to extract every listing, agent profile, and historical sale record.

Residential & Commercial Data

Extract houses, apartments, farms, and commercial spaces with property-specific metadata and dimensional attributes.

Agent & Agency Intelligence

Capture agent names, contact details, agency affiliations, and active portfolio sizes across all regions.

Historical Property Values

Extract past transfer dates, sale prices, and erf numbers from the Property Values section to track appreciation.

Spatial & POI Extraction

Capture latitude, longitude, and nearby points of interest like schools and hospitals for spatial analysis.

Price Tracking & Reductions

Monitor asking price changes, listing durations, and delist events across suburbs to gauge market velocity.

Erf & Floor Metrics

Extract structured dimensions for land size, floor area, and zoning categories for accurate valuations.

High-Resolution Images

Scrape full-resolution image URLs for property galleries, floor plans, and virtual tour links.

Suburb Trends Aggregation

Aggregate demographic data, average asking prices, and sales volumes by neighbourhood.

Scheduled Diffs

Run daily diffs to identify new listings, sold properties, and price reductions without processing full payloads.

// engagement pipeline

From search parameters to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target suburbs, property types, or agency profiles. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for property24.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and sample validation before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Property24 pipeline handles the hard parts

Property portals deploy aggressive anti-scraping measures to protect their inventory. Here is how we maintain reliable extraction pipelines.

pipeline-monitor · property24.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Pagination limits
Algorithmic grid subdivision for total coverage

Property24 limits search results to a fixed number of pages. We bypass this by programmatically sub-dividing geographic queries using map bounding boxes to ensure total coverage of dense metropolitan areas.

Dynamic phone reveals
JavaScript execution for contact details

Agent contact numbers require a click event to reveal. We use Playwright to simulate user interaction and extract the unmasked numbers without triggering bot detection heuristics.

Anti-bot layer
South African residential proxies

Property portals use Cloudflare and custom WAFs to block datacenter IPs. Our crawlers use South African residential proxies and realistic browser fingerprints to bypass geographic and reputation-based blocks.

Schema stability
Unified extraction across agency templates

Listing layouts vary significantly between standard listings and premium agency templates. Our selectors use multiple fallback chains to normalise data across all template variations.

Change detection
Delta extraction for large catalogues

For national property feeds, we maintain a hash index of last-seen values per listing. Subsequent runs only push diffs, reducing downstream processing load and storage costs.

Applications

Who uses Property24 data - and how

Teams across industries use property24.com data to build competitive products and smarter operations.

01
Real Estate Agencies

Monitor competitor listings, agent performance, and market share across specific suburbs to optimise pricing strategies.

02
PropTech Startups

Feed raw listing data and historical sales into automated valuation models and investment algorithms.

03
Property Developers

Analyse suburb trends, average asking prices, and time-on-market to identify lucrative development zones.

04
Financial Institutions

Use historical transfer data and current asking prices to calibrate mortgage risk models and collateral valuations.

05
Retail & Logistics

Analyse commercial property availability and pricing for site selection and expansion planning.

06
Market Researchers

Track macro-economic indicators through housing market velocity, price reductions, and inventory levels.

Why DataFlirt

"Property24 holds the definitive record of the South African real estate market, but extracting it requires navigating complex map grids and dynamic rendering."

Most teams underestimate the investment required: reliable Property24 scraping requires South African residential proxies, full JavaScript rendering for contact reveals, and geographic query subdivision to bypass pagination limits. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.

Technical Spec

Property24 scraper - technical capabilities

Everything supported by our property24.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions for phone reveals and map interactions
Supported
CAPTCHA bypass
Automated 2Captcha + CapSolver integration
Supported
ZA Residential proxies
ISP-grade residential IPs from South Africa
Supported
Bounding box subdivision
Algorithmic grid search to bypass 50-page limits
Supported
Historical sales extraction
Property Values section data extraction including transfer dates
Supported
Premium listing normalisation
Unified schema across standard and premium agency templates
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed fields
Supported
User saved searches
Access to private user saved search criteria and alerts
Partial
Direct agent messages
Extraction of private messages sent via the portal
Partial
Infrastructure

Infrastructure powering the Property24 pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheusPostGIS
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across South Africa. Rotation happens per-request with sticky sessions where required.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested
CSV
Flat file with typed columns
XLS
Excel compatible format for analysts
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoint for on-demand queries
PostgreSQL
Upsert into your existing schema with conflict resolution
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About property24.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Property24 legal?

Scraping publicly available information is generally permissible under applicable law. DataFlirt targets only public, non-authenticated property and agent data. We do not extract personal user data or circumvent authentication walls.

How do you bypass Property24's pagination limits?

Property24 restricts search results to a maximum number of pages. We bypass this by programmatically sub-dividing large geographic areas into smaller map bounding boxes, ensuring total extraction coverage.

Can you extract agent phone numbers?

Yes. Phone numbers on Property24 are often masked and require a click to reveal. We use Playwright to execute the necessary JavaScript and extract the unmasked numbers.

Do you extract historical sales data?

Yes, we can extract historical transfer dates, sale prices, and erf numbers from the Property Values and Trends sections.

How fresh is the data?

Daily pipelines capture new listings, price changes, and delisted properties within a 24-hour window. Real-time monitoring can be configured for specific high-value suburbs.

Do you support commercial properties?

Yes. Our schema handles both residential and commercial listings, normalising fields like gross lettable area and price per square metre.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 500 listings or a specific suburb as part of the pre-engagement scoping process to validate schema fit and data quality.

$ dataflirt scope --new-project --source=property24.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a daily feed of active listings or a historical dump of property values across South Africa - we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →