We extract property listings, UF pricing histories, new developments, and agency intelligence from Toctoc. Delivered as clean JSON, CSV, or Parquet to S3 or BigQuery on your cadence.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Listings objects from toctoc.com. All fields typed and schema-versioned.
"property_id": "TOC-84921", "listing_type": "sale", "price_uf": 4500.0, "price_clp": 168000000, "bedrooms": 3, "bathrooms": 2, "surface_total": 120, "comuna": "Providencia"
| # | property_id | listing_type | property_type | title | description | price_uf |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for New Developments objects from toctoc.com. All fields typed and schema-versioned.
"project_id": "PRJ-992", "name": "Edificio Los Leones", "developer": "Inmobiliaria Actual", "status": "en blanco", "price_from_uf": 3200, "comuna": "Providencia", "units_available": 45
| # | project_id | name | developer | status | delivery_date | price_from_uf |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Agency Data objects from toctoc.com. All fields typed and schema-versioned.
"agency_id": "AG-104", "agency_name": "Engel & Volkers Chile", "active_listings_count": 312, "phone_number": "+56912345678", "whatsapp": true, "comuna": "Vitacura"
| # | agency_id | agency_name | broker_name | phone_number | ||
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Pricing objects from toctoc.com. All fields typed and schema-versioned.
"property_id": "TOC-84921", "current_price_uf": 4500.0, "price_drop_pct": 5.2, "days_on_market": 42, "gastos_comunes_clp": 120000, "price_per_sqm_uf": 45.5
| # | property_id | current_price_uf | current_price_clp | original_price_uf | price_drop_pct | days_on_market |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Features objects from toctoc.com. All fields typed and schema-versioned.
"property_id": "TOC-84921", "orientation": "Nororiente", "parking_spots": 1, "storage_units": 1, "year_built": 2018, "near_metro": true, "latitude": -33.42628
| # | property_id | orientation | floor_number | parking_spots | storage_units | heating_type |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Our Toctoc scraper handles heavy single-page application rendering, bypasses regional rate limits, and extracts clean UF pricing logic directly into your warehouse.
Extract dual pricing structures. Track UF fluctuations, historical price drops, and normalise currency values across historical datasets.
Monitor 'en blanco', 'en verde', and 'entrega inmediata' project statuses. Capture developer details and unit availability.
Extract exact latitude and longitude coordinates and polygon boundaries for spatial analysis directly from intercepted map APIs.
Map broker portfolios, active listing counts, and contact information to understand agency market share per comuna.
Capture HOA fees and property tax estimates (contribuciones) for accurate yield modeling and investment analysis.
Extract structured arrays for parking, storage (bodegas), heating types, and building facilities.
Access historical registry data surfaced by Toctoc to feed automated valuation models and appraisal algorithms.
Capture high-resolution gallery URLs, floorplan images, and metadata for 360-degree virtual tours.
Run daily pipelines that only extract new listings and status changes to reduce compute and downstream processing load.
Brief in. Clean data out.
Provide target comunas, property types, or developer names. We design the extraction schema together.
We configure Playwright crawlers, proxy rotation, session management, and rate-limit handling for toctoc.com.
Schema validation, null-rate checks, price-outlier detection, and coordinate verification before full launch.
JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Chilean real estate portals employ strict rate limiting and dynamic rendering. Here is how we extract data reliably at scale.
Chilean ISP proxies bypass regional blocks and aggressive rate limits applied to datacenter IPs.
Toctoc uses heavy client-side rendering for maps and dynamic pricing. We execute full Playwright sessions to hydrate the DOM.
We intercept background XHR requests to capture precise lat/long coordinates not exposed in the static HTML.
We extract both UF and CLP values, normalising historical prices against daily inflation indexes for accurate time-series analysis.
We hash listing states to detect when a property moves from 'available' to 'reserved' or 'sold' without re-scraping the full payload.
Feed AVMs with historical transaction data, asking prices, and time-on-market metrics across comunas.
Calculate cap rates using rental asking prices versus sale prices and HOA fees.
Monitor competing 'en verde' projects, pricing tiers, and absorption rates.
Track broker listing volumes and exclusive mandates to map agency dominance.
Overlay property coordinates with infrastructure data to model neighborhood gentrification.
Identify stale listings and price drops to target motivated sellers and agencies.
"Toctoc holds the definitive graph of Chilean real estate, but accessing that data programmatically requires navigating heavy SPA rendering and aggressive rate limits."
Extracting accurate property data requires more than a simple HTTP client. You need Chilean residential proxies, full JavaScript execution to hydrate map coordinates, and reliable diffing logic to track UF price changes. DataFlirt handles the extraction so your team can focus on valuation models.
Everything supported by our toctoc.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Scrapy handles crawl orchestration. Playwright handles SPA rendering and map hydration.
We maintain pools of residential ISP proxies across Chile to bypass regional blocks.
Pipelines run on AWS ECS. Airflow handles scheduling. All state stored in managed Postgres.
Data delivered to where your team already works — no new tooling required.
About toctoc.com scraping, legality, and pipeline operations.
Ask us directly →Public property data is generally permissible to scrape. We target non-authenticated listings and do not extract private user data.
We intercept XHR requests during Playwright sessions to capture the exact JSON payloads containing polygon and coordinate data.
Yes. We capture the listed UF price, the CLP equivalent, and track historical price drops over time.
We can run daily delta pipelines to capture new listings, price changes, and properties marked as sold within 24 hours.
Yes. We track 'proyectos' including delivery dates, unit availability, and developer information.
We segment the crawl space by granular geographical filters (comunas, barrios) and price brackets to ensure deep extraction without hitting search limits.
20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off dump of Santiago properties or a continuous feed of UF price changes across Chile — we scope, build, and operate the pipeline.