SYSTEM all green source propertyguru.com.sg queue 14,892 pages p99 latency 215ms dataflirt.com · scraper/propertyguru-com.sg
RUN · 17 active pipelines · propertyguru.com.sg live

Singapore property data,
at warehouse scale.

We extract HDB resale flats, private condos, landed properties, agent directories, and rental data from PropertyGuru. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Listings extracted
214K /run
Price updates
18.5K /24h
Agent profiles
34K /run
Active pipelines
17
Uptime
99.98%
Data Dictionary

Every field we extract from propertyguru.com.sg

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Sale Listings objects from propertyguru.com.sg. All fields typed and schema-versioned.

listing_idtitleprice_sgdprice_psfdistrictaddressproperty_typebedsbathsarea_sqfttenuredeveloperagent_idmrt_distance_mscraped_at
sale_listings
● 200 OK
"listing_id": "24918341",
"title": "Marina Bay Residences",
"price_sgd": 3200000,
"price_psf": 2854,
"district": "D01 Boat Quay / Raffles Place",
"property_type": "Condominium",
"beds": 3,
"area_sqft": 1121,
"tenure": "99-year Leasehold"
# listing_idtitleprice_sgdprice_psfdistrictaddress
1
2
3

Complete list of extractable fields for Rental Listings objects from propertyguru.com.sg. All fields typed and schema-versioned.

listing_idtitlemonthly_rent_sgddeposit_sgdavailability_datefurnishingbedsbathsarea_sqftproperty_typedistrictagent_idscraped_at
rental_listings
● 200 OK
"listing_id": "23948112",
"title": "The Sail @ Marina Bay",
"monthly_rent_sgd": 6500,
"furnishing": "Fully Furnished",
"beds": 2,
"baths": 2,
"area_sqft": 883,
"availability_date": "2026-08-01",
"district": "D01 Boat Quay / Raffles Place"
# listing_idtitlemonthly_rent_sgddeposit_sgdavailability_datefurnishing
1
2
3

Complete list of extractable fields for Agent Profiles objects from propertyguru.com.sg. All fields typed and schema-versioned.

agent_idnameagency_namecea_reg_nophone_numberactive_sale_listingsactive_rent_listingsratingreview_countjoined_dateprofile_urlscraped_at
agent_profiles
● 200 OK
"agent_id": "84719",
"name": "Jane Doe",
"agency_name": "PropNex Realty Pte Ltd",
"cea_reg_no": "R123456A",
"phone_number": "+65 9123 4567",
"active_sale_listings": 42,
"active_rent_listings": 18,
"rating": 4.9,
"review_count": 128
# agent_idnameagency_namecea_reg_nophone_numberactive_sale_listings
1
2
3

Complete list of extractable fields for Condo Projects objects from propertyguru.com.sg. All fields typed and schema-versioned.

project_idnamedistrictdevelopertop_yeartotal_unitstenurefacilitiesavg_psflowest_price_sgdhighest_price_sgdscraped_at
condo_projects
● 200 OK
"project_id": "P3921",
"name": "Reflections at Keppel Bay",
"district": "D04 Harbourfront / Telok Blangah",
"developer": "Keppel Bay Pte Ltd",
"top_year": 2011,
"total_units": 1129,
"tenure": "99-year Leasehold",
"avg_psf": 1950
# project_idnamedistrictdevelopertop_yeartotal_units
1
2
3

Complete list of extractable fields for Transaction History objects from propertyguru.com.sg. All fields typed and schema-versioned.

transaction_idproject_nameblock_numberfloor_levelarea_sqftprice_sgdprice_psfdate_soldproperty_typetenurescraped_at
transaction_history
● 200 OK
"transaction_id": "T918237",
"project_name": "Pinnacle @ Duxton",
"block_number": "1G",
"floor_level": "41-45",
"area_sqft": 1141,
"price_sgd": 1350000,
"price_psf": 1183,
"date_sold": "2026-01-15"
# transaction_idproject_nameblock_numberfloor_levelarea_sqftprice_sgd
1
2
3

Capabilities

Singapore real estate data, structured and ready

Our PropertyGuru scraper extracts every layer of the residential and commercial market: HDB resale flats, private condos, landed properties, and agent directories.

HDB & Condo Data

Extract comprehensive details for public and private housing, including block numbers, floor levels, lease balances, and TOP dates.

PSF & Pricing Trends

Capture asking prices, historical transaction prices, and per-square-foot metrics to track district-level valuation changes.

Agent CEA Verification

Extract agent names, phone numbers, agency affiliations, and CEA registration numbers across the entire broker directory.

Geo-Spatial & MRT Data

Capture proximity to MRT stations, primary schools, supermarkets, and expressways for every listing.

Floor Plans & Media

Extract high-resolution image URLs, virtual tour links, and floor plan images associated with listings and new launches.

Historical Transactions

Scrape URA-matched transaction data displayed on property pages to build accurate valuation models.

Commercial Real Estate

Extract data for offices, retail spaces, shophouses, and industrial properties with commercial-specific zoning fields.

Cross-Border Support

Support for PropertyGuru Malaysia and other regional domains using the same normalised data schema.

Scheduled Diffs

Run daily or weekly pipelines that only output new listings, price drops, or status changes, saving processing time.

// engagement pipeline

From search parameters to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target districts, property types, or agent lists. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy and Playwright crawlers, handle Cloudflare Turnstile, and manage request limits for propertyguru.com.sg.

Validation & QA
d 4–6

Schema validation, null-rate checks, and data normalisation before full launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

Bypassing PropertyGuru anti-bot systems

PropertyGuru protects its listing database with Cloudflare and aggressive rate limiting. Here is how our infrastructure maintains continuous extraction.

pipeline-monitor · propertyguru.com.sg · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
WAF Evasion
Cloudflare Turnstile bypass

PropertyGuru uses Cloudflare to block automated traffic. We use Playwright with stealth plugins, realistic TLS fingerprints, and CAPTCHA solvers to bypass Turnstile challenges without triggering blocks.

Dynamic Content
Map hydration and lazy loading

Proximity data and map pins load dynamically via API calls after the initial page render. Our crawlers intercept these XHR responses directly to extract precise geo-coordinates and POI distances.

Rate Limiting
Phone number reveal management

Agent phone numbers require a click to reveal, triggering an API call that is strictly rate-limited per IP. We distribute these requests across a large pool of Singapore residential proxies to extract contact details at scale.

Pagination Limits
Deep crawling beyond 50 pages

Search results are capped at 50 pages. We use algorithmic search splitting, generating granular queries by district, price bracket, and property type to ensure 100% coverage of the active listing inventory.

Data Normalisation
Standardising HDB and Condo fields

Public and private properties have different metadata structures. We normalise these into a unified schema, ensuring that leasehold durations, block numbers, and developer details align perfectly in your database.

Applications

Who uses PropertyGuru data and how

Teams across industries use propertyguru.com.sg data to build competitive products and smarter operations.

01
PropTech Valuations

Automated valuation models (AVMs) ingest asking prices and transaction history to generate instant property estimates.

02
Agency Competitor Analysis

Real estate agencies track competitor listings, agent performance, and market share across different districts.

03
REIT Yield Modelling

Institutional investors correlate rental asking prices with sale prices to calculate gross rental yields by project.

04
Mortgage Lead Generation

Banks and brokers monitor new listings and TOP dates to target buyers and sellers with financing products.

05
Urban Planning

Researchers analyse property density, price heatmaps, and MRT proximity to inform infrastructure development.

06
Expat Relocation Services

Relocation firms aggregate rental availability and pricing to advise corporate clients on housing budgets.

Why DataFlirt

"PropertyGuru holds the definitive dataset for Singapore real estate, but turning web pages into queryable market intelligence requires resilient infrastructure."

Extracting data at scale from PropertyGuru means solving Cloudflare Turnstile, handling dynamic map hydration, and managing rate limits on phone number reveals. We build and maintain the extraction layer so your analysts can focus on yield modelling and valuation algorithms rather than proxy rotation.

Technical Spec

PropertyGuru scraper technical capabilities

Everything supported by our propertyguru.com.sg scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

Cloudflare bypass
Automated Turnstile resolution using Playwright stealth and CapSolver
Supported
Residential proxy rotation
ISP-grade residential IPs from Singapore pools to prevent geo-blocking
Supported
Phone number extraction
Automated interaction with the 'reveal phone' API endpoint
Supported
Floor plan images
Extraction of high-resolution floor plan image URLs
Supported
Historical transactions
Capture of URA transaction data embedded in condo project pages
Supported
Search pagination bypass
Algorithmic query splitting to bypass the 50-page search limit
Supported
Change detection (diffs)
Hash-based diff to only emit new, modified, or delisted properties
Supported
Cross-region support
Support for PropertyGuru Malaysia and Indonesia domains
Supported
Agent Lead Inbox
Access to private messages sent by buyers to agents
Partial
Owner Contact Details (Hidden)
Direct owner phone numbers not published on the public listing
Partial
Infrastructure

Infrastructure powering the PropertyGuru pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy and Playwright Stack

Scrapy handles crawl orchestration and deduplication. Playwright handles JavaScript rendering, Cloudflare challenges, and interaction flows like phone number reveals.

Singapore Proxy Infrastructure

We maintain pools of residential ISP proxies localised to Singapore. Rotation happens per-request to avoid rate limits on API endpoints.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling and SLA alerting. All state is stored in managed PostgreSQL.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested array format
CSV
Flat file with typed columns for Excel or Sheets
Parquet
Columnar format for BigQuery, Snowflake, Athena
S3
Direct bucket delivery compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoint to query your extracted datasets
PostgreSQL
Upsert into your existing schema with conflict resolution
Snowflake
Stage and COPY INTO workflow for enterprise warehouses
// faq

Common questions.

About propertyguru.com.sg scraping, legality, and pipeline operations.

Ask us directly →
Is scraping PropertyGuru legal?

Scraping publicly available information is generally permissible. DataFlirt targets only public, non-authenticated real estate listings and agent profiles. We do not extract personal data beyond what agents publish for business contact. Clients should review PropertyGuru terms of service and consult legal counsel for specific use cases.

How do you handle Cloudflare Turnstile?

We use Playwright with stealth configurations, realistic browser fingerprints, and automated solvers to pass Turnstile challenges seamlessly without manual intervention.

Can you extract agent phone numbers?

Yes. The phone numbers require an interaction to reveal. Our crawlers simulate this interaction and capture the resulting API response, rotating IPs to respect rate limits.

How frequently is the data updated?

We can configure pipelines to run daily, weekly, or continuously. For continuous pipelines, we track new listings and price changes, delivering updates within hours.

Do you normalise HDB and Condo data?

Yes. Public HDB flats and private condominiums have different data structures on the site. We map these into a unified schema so you can query across all property types easily.

Can you bypass the 50-page search limit?

Yes. PropertyGuru limits search results to 50 pages. We bypass this by generating highly granular search queries based on micro-districts and price brackets to extract the full inventory.

Can I request a sample dataset?

Yes. We provide a sample run of up to 1,000 listings as part of the pre-engagement scoping process so you can validate schema fit and data quality.

$ dataflirt scope --new-project --source=propertyguru.com.sg ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a daily sync of all HDB resale listings or a continuous feed of private condo transactions, we build and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →