SYSTEM all green source fotocasa.es queue 12,491 pages p99 latency 218ms dataflirt.com · scraper/fotocasa-es
RUN - 31 active pipelines - fotocasa.es live

Fotocasa data,
at warehouse scale.

We extract property listings, price variations, energy ratings, and agency portfolios from Fotocasa. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Properties extracted
341K /day
Price updates
42K /24h
Agency profiles
18K /run
Active pipelines
31
Uptime
99.94%
Data Dictionary

Every field we extract from fotocasa.es

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Residential Sales objects from fotocasa.es. All fields typed and schema-versioned.

property_idtitlepricecurrencyroomsbathroomssurface_m2floorelevatorenergy_ratinglocationagency_idimage_urlsscraped_at
residential_sales
● 200 OK
"property_id": "164920184",
"title": "Piso en venta en Eixample",
"price": 450000.0,
"currency": "EUR",
"rooms": 3,
"bathrooms": 2,
"surface_m2": 110,
"elevator": true,
"energy_rating": "E"
# property_idtitlepricecurrencyroomsbathrooms
1
2
3

Complete list of extractable fields for Rental Listings objects from fotocasa.es. All fields typed and schema-versioned.

property_idtitlemonthly_rentdeposit_requiredavailable_fromfurnishedpet_friendlyroomssurface_m2agency_nameurlscraped_at
rental_listings
● 200 OK
"property_id": "173829102",
"title": "Alquiler de piso en Madrid Centro",
"monthly_rent": 1200.0,
"furnished": true,
"pet_friendly": false,
"rooms": 2,
"surface_m2": 75,
"agency_name": "Inmobiliaria Centro",
"scraped_at": "2026-05-12T09:14:00Z"
# property_idtitlemonthly_rentdeposit_requiredavailable_fromfurnished
1
2
3

Complete list of extractable fields for Agency Portfolios objects from fotocasa.es. All fields typed and schema-versioned.

agency_idnameaddressphonetotal_propertiessales_countrentals_countlogo_urlratingjoined_date
agency_portfolios
● 200 OK
"agency_id": "AG-9482",
"name": "Fincas Barcelona",
"address": "Carrer de Balmes, 12, Barcelona",
"total_properties": 142,
"sales_count": 98,
"rentals_count": 44,
"rating": 4.2,
"joined_date": "2018-03-14"
# agency_idnameaddressphonetotal_propertiessales_count
1
2
3

Complete list of extractable fields for New Developments objects from fotocasa.es. All fields typed and schema-versioned.

promotion_idnamedevelopercompletion_datetotal_unitsavailable_unitsmin_pricemax_pricelocation_polygonamenities
new_developments
● 200 OK
"promotion_id": "PR-10293",
"name": "Residencial Mar Bella",
"developer": "Metrovacesa",
"completion_date": "2027-Q2",
"available_units": 14,
"min_price": 320000.0,
"max_price": 550000.0,
"amenities": "['pool', 'gym', 'garage']"
# promotion_idnamedevelopercompletion_datetotal_unitsavailable_units
1
2
3

Complete list of extractable fields for Market Trends objects from fotocasa.es. All fields typed and schema-versioned.

property_idcurrent_priceoriginal_priceprice_drop_pctdays_on_marketprice_per_m2neighborhooddistrictcityupdated_at
market_trends
● 200 OK
"property_id": "164920184",
"current_price": 450000.0,
"original_price": 475000.0,
"price_drop_pct": 5.2,
"days_on_market": 42,
"price_per_m2": 4090.9,
"neighborhood": "Dreta de l'Eixample",
"city": "Barcelona"
# property_idcurrent_priceoriginal_priceprice_drop_pctdays_on_marketprice_per_m2
1
2
3

Capabilities

Everything you need from Fotocasa - nothing you don't

Our Fotocasa scraper handles every layer of the platform: property details, dynamic map data, agency listings, and pricing trends - with anti-bot circumvention built in.

Full Property Extraction

Title, description, dimensions, rooms, floor, elevator status, and every metadata field Fotocasa surfaces.

Price History Tracking

Capture original listing price, current price, and calculate price drops over time across active listings.

Agency Intelligence

Agency name, contact details, total portfolio size, and performance metrics for every listing.

Geolocation & Map Data

Extract precise latitude and longitude coordinates for properties, along with neighborhood and district tags.

Energy Certificate Mining

Capture energy consumption and emission ratings required for Spanish real estate compliance.

Commercial & Land Listings

Extract data for offices, industrial spaces, retail units, and plots of land across all Spanish provinces.

New Development Tracking

Monitor new construction projects, developer details, completion dates, and unit availability.

Image & Floorplan Extraction

Collect high-resolution property images and floorplan URLs for computer vision analysis.

Scheduled Pipeline Modes

Run continuous pipelines at daily or weekly cadences with change-detection diffing.

// engagement pipeline

From province list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target provinces, property types, or agency IDs. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for fotocasa.es.

Validation & QA
d 4–6

Schema validation, null-rate checks, and geographical coverage verification before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Fotocasa pipeline handles the hard parts

Fotocasa employs strict scraping countermeasures. Here is how we stay resilient.

pipeline-monitor · fotocasa.es · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Datadome bypass with residential IPs

Fotocasa relies heavily on advanced bot protection. Our crawlers use Spanish residential ISP proxies with realistic browser fingerprints and full cookie session management to bypass these blocks.

JavaScript rendering
Full Playwright execution for dynamic content

Property details and map coordinates are heavily JavaScript-rendered. We run full Playwright browser sessions to trigger lazy-loads and capture data that headless HTTP clients miss entirely.

Schema stability
Resilient selectors with fallback chains

Fotocasa updates its frontend frequently. Our selector strategy uses multiple fallback chains per field, including structured data extraction, so a layout change does not break your data pipeline.

Change detection
Only re-scrape what has changed

For large property catalogues, we maintain a hash index of last-seen values per field. Subsequent runs only push diffs, reducing compute cost and downstream processing load.

Monitoring & alerting
24/7 pipeline health with anomaly detection

Every run emits structured logs to our observability stack. We alert on null-rate spikes, coverage drops, and block rates, responding before you notice.

Applications

Who uses Fotocasa data - and how

Teams across industries use fotocasa.es data to build competitive products and smarter operations.

01
Property Valuation Models (AVM)

PropTech companies train automated valuation models using historical pricing, surface area, and location data.

02
Investment Analysis

Real estate funds identify high-yield rental zones by comparing sale prices against monthly rental rates per district.

03
Agency Competitor Tracking

Brokerages monitor competitor portfolios, time-on-market metrics, and price reduction strategies across regions.

04
Urban Planning Research

Analysts track new developments and residential density shifts to forecast infrastructure requirements.

05
PropTech Aggregation

Listing aggregators normalise Fotocasa data to provide unified market views for end consumers.

06
Macro-economic Indicators

Financial institutions use real estate listing volumes and price fluctuations as leading indicators for economic health.

Why DataFlirt

"Fotocasa holds the definitive pulse of the Spanish real estate market - but extracting property data at scale requires bypassing enterprise-grade bot protection."

Most teams underestimate the investment required: reliable Fotocasa scraping requires Spanish residential proxies, full JavaScript rendering, CAPTCHA solving, and anomaly monitoring. DataFlirt absorbs that complexity so your engineers can focus on the analysis.

Technical Spec

Fotocasa scraper - technical capabilities

Everything supported by our fotocasa.es scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions required for map data and dynamic content
Supported
CAPTCHA bypass
Automated solver integration for strict bot protection
Supported
Residential proxy rotation
ISP-grade residential IPs from ES pools rotated per request
Supported
Geolocation extraction
Precise coordinates and regional tagging per property
Supported
Price drop history
Track original vs current price to calculate discounts
Supported
Agency contact extraction
Capture public agency phone numbers and addresses
Supported
Change detection (diffs)
Hash-based diff to emit only records with changed fields
Supported
Webhook delivery
HTTP POST per record or batch for real-time workflows
Supported
User saved searches
Private user account data and saved property lists
Partial
Private messaging
Direct communication channels between users and agencies
Partial
Infrastructure

Infrastructure powering the Fotocasa pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows.

Residential Proxy Infrastructure

We maintain pools of Spanish residential ISP proxies. Rotation happens per-request with sticky sessions where required to maintain state.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested format
CSV
Flat file with typed columns
XLS
Excel compatible format for business teams
Parquet
Columnar format for data warehouses
AWS S3
Direct bucket delivery
Webhook
HTTP POST per record
API
REST endpoint for on-demand queries
PostgreSQL
Direct database upserts
Snowflake
Stage and COPY INTO workflow
BigQuery
Streamed directly into your dataset
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About fotocasa.es scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Fotocasa legal?

Scraping publicly available information is generally permissible. DataFlirt targets only public, non-authenticated property and agency data. We do not extract personal user data or circumvent authentication walls.

How do you handle Fotocasa's anti-bot systems?

We use Spanish residential ISP proxies, full Playwright browser sessions with realistic fingerprints, and request timing modelled on human behaviour to bypass bot detection.

How fresh is the data?

Full catalogue refreshes at daily cadence complete within a 6-12 hour window depending on the target province size. Real-time pipelines can achieve lower latency for specific target sets.

Can you track price drops over time?

Yes. Every pipeline run produces timestamped snapshots. We maintain a time-series record per property to track price reductions from the original listing date.

What is the minimum viable engagement?

Our smallest packages start at a defined province list with weekly delivery. For national coverage, we price based on volume and delivery frequency.

Can I request a sample dataset before committing?

Yes. We provide a sample run of up to 500 properties as part of the pre-engagement scoping process so you can validate schema fit and data quality.

$ dataflirt scope --new-project --source=fotocasa.es ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off province dump or a continuous price-monitoring feed across Spain - we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →