SYSTEM all green source metrocuadrado.com queue 14,892 listings p99 latency 218ms dataflirt.com · scraper/metrocuadrado-com
RUN · 31 active pipelines · metrocuadrado.com live

Metrocuadrado data,
at warehouse scale.

We extract property listings, price histories, strata details, and agency portfolios from Metrocuadrado. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Properties extracted
138K /run
Price updates
12.4K /24h
New developments
1,842 active
Active pipelines
31
Uptime
99.94%
Data Dictionary

Every field we extract from metrocuadrado.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Property Listings objects from metrocuadrado.com. All fields typed and schema-versioned.

property_idtitleproperty_typetransaction_typepriceadmin_feecityneighborhoodstratumarea_builtarea_privatebedroomsbathroomsparking_spacesproperty_agedescriptionurl
property_listings
● 200 OK
"property_id": "MTC-849201",
"title": "Apartamento en Venta en Chico Norte",
"property_type": "Apartamento",
"transaction_type": "Venta",
"price": 850000000.0,
"admin_fee": 650000.0,
"city": "Bogotá",
"neighborhood": "Chicó Norte",
"stratum": 6,
"area_built": 112.5,
"bedrooms": 3
# property_idtitleproperty_typetransaction_typepriceadmin_fee
1
2
3

Complete list of extractable fields for New Developments objects from metrocuadrado.com. All fields typed and schema-versioned.

project_idproject_namedeveloper_namestatusdelivery_datemin_pricemax_pricetotal_unitsavailable_unitscitysectoramenitiessales_room_address
new_developments
● 200 OK
"project_id": "PRJ-9921",
"project_name": "Torre Central Park",
"developer_name": "Constructora Bolivar",
"status": "En Planos",
"delivery_date": "2026-11-01",
"min_price": 420000000.0,
"city": "Medellín",
"sector": "El Poblado",
"available_units": 45
# project_idproject_namedeveloper_namestatusdelivery_datemin_price
1
2
3

Complete list of extractable fields for Pricing & Valuation objects from metrocuadrado.com. All fields typed and schema-versioned.

property_idcurrent_priceoriginal_priceprice_per_sqmadmin_feediscount_pctdays_on_marketprice_historylast_updatedcurrency
pricing_& valuation
● 200 OK
"property_id": "MTC-849201",
"current_price": 850000000.0,
"original_price": 890000000.0,
"price_per_sqm": 7555555.55,
"admin_fee": 650000.0,
"discount_pct": 4.5,
"days_on_market": 42,
"last_updated": "2026-05-12T14:30:00Z"
# property_idcurrent_priceoriginal_priceprice_per_sqmadmin_feediscount_pct
1
2
3

Complete list of extractable fields for Agency & Broker objects from metrocuadrado.com. All fields typed and schema-versioned.

agency_idagency_namecontact_phonewhatsapp_numberactive_listings_countproperties_for_saleproperties_for_rentaverage_listing_priceprofile_url
agency_& broker
● 200 OK
"agency_id": "AGN-442",
"agency_name": "Inmobiliaria Bogota Premium",
"contact_phone": "+573104567890",
"whatsapp_number": "+573104567890",
"active_listings_count": 124,
"properties_for_sale": 89,
"properties_for_rent": 35,
"profile_url": "https://www.metrocuadrado.com/inmobiliaria/bogota-premium"
# agency_idagency_namecontact_phonewhatsapp_numberactive_listings_countproperties_for_sale
1
2
3

Complete list of extractable fields for Location & Amenities objects from metrocuadrado.com. All fields typed and schema-versioned.

property_idlatitudelongitudeaddressneighborhoodcitydepartmentbuilding_amenitiesneighborhood_amenitiesnearby_transitnearby_parks
location_& amenities
● 200 OK
"property_id": "MTC-849201",
"latitude": 4.6758,
"longitude": -74.0483,
"neighborhood": "Chicó Norte",
"city": "Bogotá",
"department": "Cundinamarca",
"building_amenities": "['Gimnasio', 'Piscina', 'Seguridad 24/7']",
"nearby_transit": "['Estación Transmilenio Calle 100']"
# property_idlatitudelongitudeaddressneighborhoodcity
1
2
3

Capabilities

Colombian real estate data — structured and normalised

Our Metrocuadrado scraper handles map-based pagination, dynamic pricing widgets, and contact detail rendering — circumventing bot protections to deliver clean property records.

Full Property Extraction

Extract title, description, built area, private area, bedrooms, bathrooms, parking spaces, and property age across all residential and commercial listing types.

Price & Admin Fee Tracking

Capture sale price, rental rates, and administration fees (valor administración) to calculate true monthly carrying costs.

Strata & Zoning Data

Extract the socioeconomic stratum (estrato 1-6) critical for utility cost estimation and property valuation models in Colombia.

New Development Tracking

Monitor 'Proyectos Nuevos' including developer details, delivery timelines, available units, and price ranges for off-plan investments.

Agency Intelligence

Extract broker names, agency affiliations, and active portfolio sizes to identify top-performing real estate agents by sector.

Map Coordinate Extraction

Capture exact latitude and longitude coordinates embedded in listing map widgets for precise spatial analysis.

Historical Market Days

Track when listings first appeared and when they are delisted to calculate average days on market (DOM) by neighborhood.

Contact Obfuscation Bypass

Execute JavaScript to render gated WhatsApp numbers and contact forms typically hidden behind user interaction walls.

Scheduled Diffing

Run daily or weekly pipelines that only emit new listings, price drops, or status changes, minimising redundant data processing.

// engagement pipeline

From search parameters to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target cities, neighborhoods, property types, or agency profiles. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for metrocuadrado.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and coordinate verification before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Metrocuadrado pipeline handles the hard parts

Real estate portals use aggressive rate limiting and dynamic rendering to protect listing data. Here is how we maintain reliable extraction.

pipeline-monitor · metrocuadrado.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Pagination limits
Map-based search grid traversal

Metrocuadrado limits standard search pagination to a fixed number of results. We bypass this by programmatically dividing city maps into bounding box grids, extracting complete datasets without hitting pagination ceilings.

Contact masking
JavaScript execution for WhatsApp rendering

Broker phone numbers and WhatsApp links are dynamically injected via JavaScript upon user click. Our Playwright instances simulate these interactions to capture the unmasked contact data for agency analysis.

Anti-bot layer
Colombian residential proxies

Requests originating outside Latin America often face severe rate limits or CAPTCHA walls. We route traffic through Colombian residential ISP proxies to mimic local buyer traffic patterns and maintain high throughput.

Data standardisation
Normalised amenity and area fields

Broker input is notoriously messy. We normalise text fields, separate built area from private area, and map unstructured amenity descriptions into clean boolean arrays for immediate database ingestion.

State tracking
Listing lifecycle monitoring

We maintain state across pipeline runs to detect when a property price changes or when it is removed from the market, providing accurate signals for market velocity rather than just static snapshots.

Applications

Who uses Metrocuadrado data — and how

Teams across industries use metrocuadrado.com data to build competitive products and smarter operations.

01
Valuation Models (AVM)

PropTech firms ingest strata, area, and pricing data to train automated valuation models for the Colombian market.

02
Market Analytics

Real estate funds track price-per-square-meter trends across Bogotá and Medellín neighborhoods to time acquisitions.

03
Investment Sourcing

Investors monitor price drops and days-on-market metrics to identify distressed assets or highly motivated sellers.

04
Agency Benchmarking

Brokerages analyse competitor portfolios, average listing prices, and market share by sector to optimise their sales strategies.

05
Urban Planning

Consultancies correlate new development concentration with stratum changes to forecast urban density and infrastructure needs.

06
PropTech Aggregation

Mortgage brokers and insurance providers use listing data to generate pre-approved financial products based on property value.

Why DataFlirt

"Metrocuadrado holds the most comprehensive record of Colombian property values — but extracting accurate historical pricing requires continuous, resilient pipeline infrastructure."

Most teams underestimate the complexity of scraping Colombian real estate portals. Handling map-based dynamic loading, contact obfuscation, and inconsistent agent formatting requires dedicated engineering. DataFlirt absorbs that complexity so your analysts can focus on market trends — not DOM changes.

Technical Spec

Metrocuadrado scraper — technical capabilities

Everything supported by our metrocuadrado.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions — required for map widgets and contact unmasking
Supported
CAPTCHA bypass
Automated 2Captcha + CapSolver integration for Datadome/Cloudflare walls
Supported
Colombian residential IPs
ISP-grade residential IPs from CO pools — rotated per request
Supported
Historical price tracking
Price changes captured per run; historical time-series available
Supported
Map coordinate extraction
Lat/Long pairs extracted from listing map data payloads
Supported
WhatsApp number rendering
Automated click-to-reveal execution for broker contact details
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed fields since last run
Supported
Webhook delivery
HTTP POST per record or batch — useful for real-time alerting
Supported
Agent private dashboard metrics
Gated lead generation data and internal broker analytics
Partial
Saved searches / User favorites
Consumer account-specific saved properties and search alerts
Partial
Infrastructure

Infrastructure powering the Metrocuadrado pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows. Combined via scrapy-playwright middleware.

Latam Proxy Infrastructure

We maintain pools of residential ISP proxies specifically targeting Colombian regions. Rotation happens per-request with sticky sessions where required to prevent geo-blocking.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
XLS
Excel format for non-technical analyst teams
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery — compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoints to query your extracted Metrocuadrado dataset
PostgreSQL
Upsert into your existing schema with conflict resolution
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About metrocuadrado.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Metrocuadrado legal?

Scraping publicly available real estate listings is generally permissible. DataFlirt targets only public, non-authenticated property and agency data. We do not extract personal user data or circumvent authentication walls. Clients should review portal ToS and consult legal counsel for specific commercial use cases.

How do you handle pagination limits on large city searches?

Metrocuadrado restricts standard list pagination. We circumvent this by programmatically dividing city maps into smaller coordinate bounding boxes, extracting all properties within each grid square to ensure 100% coverage without hitting result ceilings.

Can you extract WhatsApp numbers and broker details?

Yes. While Metrocuadrado masks these details behind click events, our Playwright instances simulate user interaction to render and extract the underlying phone numbers and WhatsApp links for active listings.

Do you capture strata (estrato) and administration fees?

Yes. We extract the stratum (1-6) and the exact administration fee (valor administración), which are critical fields for accurate property valuation and carrying cost calculations in Colombia.

How fresh is the data?

We can configure pipelines to run daily, weekly, or monthly. A full sweep of a major city like Bogotá typically completes within 8-12 hours. We use change-detection to only push updated or new records to your warehouse.

Can you track historical price changes?

Yes. Every pipeline run produces timestamped snapshots. We maintain a time-series record per property ID, allowing you to track price drops, price increases, and total days on market.

What is the minimum viable engagement?

Our smallest packages start at a defined geographic scope (e.g., all listings in specific Bogotá zones) with weekly delivery. Contact us with your target volume and frequency for a scoped quote.

$ dataflirt scope --new-project --source=metrocuadrado.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off Bogotá market dump or continuous price-monitoring across Colombia — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →