SYSTEM all green source toctoc.com queue 14,203 pages p99 latency 184ms dataflirt.com · scraper/toctoc-com
RUN · 41 active pipelines · toctoc.com live

Toctoc property data,
at warehouse scale.

We extract property listings, UF pricing histories, new developments, and agency intelligence from Toctoc. Delivered as clean JSON, CSV, or Parquet to S3 or BigQuery on your cadence.

Properties extracted
182K /run
Price updates
41K /24h
Agencies mapped
1,492
Active pipelines
41
Uptime
99.98%
Data Dictionary

Every field we extract from toctoc.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Listings objects from toctoc.com. All fields typed and schema-versioned.

property_idlisting_typeproperty_typetitledescriptionprice_ufprice_clpgastos_comunesbedroomsbathroomssurface_totalsurface_usefulcomunaregioncoordinatesimages
listings
● 200 OK
"property_id": "TOC-84921",
"listing_type": "sale",
"price_uf": 4500.0,
"price_clp": 168000000,
"bedrooms": 3,
"bathrooms": 2,
"surface_total": 120,
"comuna": "Providencia"
# property_idlisting_typeproperty_typetitledescriptionprice_uf
1
2
3

Complete list of extractable fields for New Developments objects from toctoc.com. All fields typed and schema-versioned.

project_idnamedeveloperstatusdelivery_dateprice_from_ufprice_to_ufunits_availableaddresscomunaamenitiesmodelscoordinateswebsite
new_developments
● 200 OK
"project_id": "PRJ-992",
"name": "Edificio Los Leones",
"developer": "Inmobiliaria Actual",
"status": "en blanco",
"price_from_uf": 3200,
"comuna": "Providencia",
"units_available": 45
# project_idnamedeveloperstatusdelivery_dateprice_from_uf
1
2
3

Complete list of extractable fields for Agency Data objects from toctoc.com. All fields typed and schema-versioned.

agency_idagency_namebroker_namephone_numberwhatsappemailactive_listings_countproperties_soldratingoffice_addresslogo_urlregistration_date
agency_data
● 200 OK
"agency_id": "AG-104",
"agency_name": "Engel & Volkers Chile",
"active_listings_count": 312,
"phone_number": "+56912345678",
"whatsapp": true,
"comuna": "Vitacura"
# agency_idagency_namebroker_namephone_numberwhatsappemail
1
2
3

Complete list of extractable fields for Pricing objects from toctoc.com. All fields typed and schema-versioned.

property_idcurrent_price_ufcurrent_price_clporiginal_price_ufprice_drop_pctdays_on_marketgastos_comunes_clpcontributions_clprent_yield_estimateprice_per_sqm_uf
pricing
● 200 OK
"property_id": "TOC-84921",
"current_price_uf": 4500.0,
"price_drop_pct": 5.2,
"days_on_market": 42,
"gastos_comunes_clp": 120000,
"price_per_sqm_uf": 45.5
# property_idcurrent_price_ufcurrent_price_clporiginal_price_ufprice_drop_pctdays_on_market
1
2
3

Complete list of extractable fields for Features objects from toctoc.com. All fields typed and schema-versioned.

property_idorientationfloor_numberparking_spotsstorage_unitsheating_typeyear_builtlatitudelongitudenear_metronear_schoolsneighborhood_score
features
● 200 OK
"property_id": "TOC-84921",
"orientation": "Nororiente",
"parking_spots": 1,
"storage_units": 1,
"year_built": 2018,
"near_metro": true,
"latitude": -33.42628
# property_idorientationfloor_numberparking_spotsstorage_unitsheating_type
1
2
3

Capabilities

Extract the entire Chilean property graph

Our Toctoc scraper handles heavy single-page application rendering, bypasses regional rate limits, and extracts clean UF pricing logic directly into your warehouse.

UF & CLP Pricing Extraction

Extract dual pricing structures. Track UF fluctuations, historical price drops, and normalise currency values across historical datasets.

New Development Tracking

Monitor 'en blanco', 'en verde', and 'entrega inmediata' project statuses. Capture developer details and unit availability.

Geolocation & Polygon Data

Extract exact latitude and longitude coordinates and polygon boundaries for spatial analysis directly from intercepted map APIs.

Agency Intelligence

Map broker portfolios, active listing counts, and contact information to understand agency market share per comuna.

Gastos Comunes & Taxes

Capture HOA fees and property tax estimates (contribuciones) for accurate yield modeling and investment analysis.

Property Amenities

Extract structured arrays for parking, storage (bodegas), heating types, and building facilities.

Historical Transaction Data

Access historical registry data surfaced by Toctoc to feed automated valuation models and appraisal algorithms.

Media Extraction

Capture high-resolution gallery URLs, floorplan images, and metadata for 360-degree virtual tours.

Scheduled Diffing

Run daily pipelines that only extract new listings and status changes to reduce compute and downstream processing load.

// engagement pipeline

From target comunas to warehouse tables

Brief in. Clean data out.

Define Scope
d 0

Provide target comunas, property types, or developer names. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Playwright crawlers, proxy rotation, session management, and rate-limit handling for toctoc.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and coordinate verification before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Toctoc pipeline handles the hard parts

Chilean real estate portals employ strict rate limiting and dynamic rendering. Here is how we extract data reliably at scale.

pipeline-monitor · toctoc.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Residential IP routing

Chilean ISP proxies bypass regional blocks and aggressive rate limits applied to datacenter IPs.

JavaScript rendering
SPA hydration

Toctoc uses heavy client-side rendering for maps and dynamic pricing. We execute full Playwright sessions to hydrate the DOM.

Coordinate extraction
Map API interception

We intercept background XHR requests to capture precise lat/long coordinates not exposed in the static HTML.

UF conversion
Currency normalisation

We extract both UF and CLP values, normalising historical prices against daily inflation indexes for accurate time-series analysis.

Change detection
Status diffing

We hash listing states to detect when a property moves from 'available' to 'reserved' or 'sold' without re-scraping the full payload.

Applications

Who uses Toctoc data

Teams across industries use toctoc.com data to build competitive products and smarter operations.

01
Property Valuation Models

Feed AVMs with historical transaction data, asking prices, and time-on-market metrics across comunas.

02
Yield & Investment Analysis

Calculate cap rates using rental asking prices versus sale prices and HOA fees.

03
Developer Competitor Tracking

Monitor competing 'en verde' projects, pricing tiers, and absorption rates.

04
Agency Market Share

Track broker listing volumes and exclusive mandates to map agency dominance.

05
Urban Planning & GIS

Overlay property coordinates with infrastructure data to model neighborhood gentrification.

06
Buyer Lead Generation

Identify stale listings and price drops to target motivated sellers and agencies.

Why DataFlirt

"Toctoc holds the definitive graph of Chilean real estate, but accessing that data programmatically requires navigating heavy SPA rendering and aggressive rate limits."

Extracting accurate property data requires more than a simple HTTP client. You need Chilean residential proxies, full JavaScript execution to hydrate map coordinates, and reliable diffing logic to track UF price changes. DataFlirt handles the extraction so your team can focus on valuation models.

Technical Spec

Toctoc scraper — technical capabilities

Everything supported by our toctoc.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions for SPA content
Supported
Residential proxy rotation
Chilean ISP IPs rotated to avoid rate limits
Supported
Map coordinate extraction
Lat/long extracted from XHR interception
Supported
UF/CLP price tracking
Dual currency extraction and normalisation
Supported
Historical transaction data
Public registry data surfaced on listings
Supported
Project status tracking
En blanco, en verde, entrega inmediata
Supported
Change detection
Hash-based diffs for status updates
Supported
Broker contact details
Phone and email extraction
Supported
User saved searches
Requires user account credentials
Partial
Private messaging history
Broker-to-buyer chat logs
Partial
Infrastructure

Infrastructure powering the Toctoc pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration. Playwright handles SPA rendering and map hydration.

LatAm Proxy Infrastructure

We maintain pools of residential ISP proxies across Chile to bypass regional blocks.

Cloud-Native Orchestration

Pipelines run on AWS ECS. Airflow handles scheduling. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested
CSV
Flat file with typed columns
XLS
Excel compatible export
Parquet
Columnar format for analytics
AWS S3
Direct bucket delivery
Webhook
HTTP POST per record
API
REST endpoints for querying
BigQuery
Streamed directly into your dataset
Snowflake
Stage and copy workflow
PostgreSQL
Upsert into existing schema
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About toctoc.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Toctoc legal?

Public property data is generally permissible to scrape. We target non-authenticated listings and do not extract private user data.

How do you handle Toctoc's map rendering?

We intercept XHR requests during Playwright sessions to capture the exact JSON payloads containing polygon and coordinate data.

Can you extract both UF and CLP prices?

Yes. We capture the listed UF price, the CLP equivalent, and track historical price drops over time.

How fresh is the data?

We can run daily delta pipelines to capture new listings, price changes, and properties marked as sold within 24 hours.

Do you extract new development projects?

Yes. We track 'proyectos' including delivery dates, unit availability, and developer information.

How do you handle pagination limits?

We segment the crawl space by granular geographical filters (comunas, barrios) and price brackets to ensure deep extraction without hitting search limits.

$ dataflirt scope --new-project --source=toctoc.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off dump of Santiago properties or a continuous feed of UF price changes across Chile — we scope, build, and operate the pipeline.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →