SYSTEM all green source farfetch.com queue 18,492 pages p99 latency 184ms dataflirt.com · scraper/farfetch-com
RUN · 42 active pipelines · farfetch.com live

Farfetch luxury data,
at warehouse scale.

We extract designer collections, geo-specific pricing, boutique inventory, and material specifications from Farfetch. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Products extracted
312K /day
Price updates
1.2M /24h
Boutique records
8,400 /run
Active pipelines
42
Uptime
99.94%
Data Dictionary

Every field we extract from farfetch.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Product Catalogue objects from farfetch.com. All fields typed and schema-versioned.

product_iddesignertitlecategorysub_categoryseasoncolorbase_pricecurrencyavailable_sizessold_out_sizesmaterialscare_instructionsmade_inboutique_idpositively_consciousimage_urlsproduct_url
product_catalogue
● 200 OK
"product_id": "18492015",
"designer": "Bottega Veneta",
"title": "Intrecciato leather cross-body bag",
"color": "Parakeet Green",
"base_price": 2500.0,
"currency": "USD",
"available_sizes": "['One Size']",
"made_in": "Italy",
"positively_conscious": false
# product_iddesignertitlecategorysub_categoryseason
1
2
3

Complete list of extractable fields for Pricing & Duties objects from farfetch.com. All fields typed and schema-versioned.

product_iddesignerregionbase_pricediscount_pctdiscount_absfinal_pricecurrencyimport_duties_includedestimated_taxshipping_tierboutique_idprice_timestamp
pricing_& duties
● 200 OK
"product_id": "18492015",
"region": "US",
"base_price": 2500.0,
"discount_pct": 0,
"final_price": 2500.0,
"currency": "USD",
"import_duties_included": true,
"price_timestamp": "2026-08-14T10:15:00Z"
# product_iddesignerregionbase_pricediscount_pctdiscount_abs
1
2
3

Complete list of extractable fields for Boutique & Sourcing objects from farfetch.com. All fields typed and schema-versioned.

boutique_idboutique_namelocation_citylocation_countryratingproduct_countships_toreturn_policyshipping_speedcustoms_handling
boutique_& sourcing
● 200 OK
"boutique_id": "9281",
"boutique_name": "Browns",
"location_city": "London",
"location_country": "UK",
"product_count": 4150,
"return_policy": "14 days",
"customs_handling": "DDP"
# boutique_idboutique_namelocation_citylocation_countryratingproduct_count
1
2
3

Capabilities

Complete visibility into luxury fashion inventory

Our Farfetch scraper handles regional pricing variations, boutique-level inventory, and complex sizing matrices — with geo-targeted proxies to capture accurate local market data.

Designer & Brand Catalogues

Extract full collections across men's, women's, and kidswear. Capture titles, seasons, colourways, and detailed product descriptions.

Geo-Targeted Pricing

Capture local currency pricing, import duties, and regional tax variations using country-specific residential IP proxies.

Boutique-Level Sourcing

Track which physical boutiques hold specific inventory. Extract boutique names, locations, and shipping policies.

Sizing Matrices

Extract available and out-of-stock sizes across IT, FR, UK, and US standards. Monitor stock depth indicators.

Material & Composition

Parse fabric details, exact material percentages, care instructions, and manufacturing origin countries.

Pre-Owned & Archive Data

Track vintage pieces, condition grading, and POSITIVELY CONSCIOUS sustainability labels across the catalogue.

High-Resolution Imagery

Extract uncompressed image URLs for product galleries, model shots, and detail views.

// engagement pipeline

From designer list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide designer names, category URLs, or target regions. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for farfetch.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and size-mapping verification before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Farfetch pipeline handles the hard parts

Luxury e-commerce platforms deploy aggressive anti-bot measures to protect their catalogues. Here's how we ensure reliable extraction.

pipeline-monitor · farfetch.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Residential proxy rotation + fingerprint spoofing

Farfetch uses advanced bot mitigation to block automated traffic. Our crawlers use residential ISP proxies with realistic browser fingerprints, randomised request timing, and full cookie session management to blend in with human shoppers.

Geo-pricing
Accurate regional tax and duty extraction

Farfetch dynamically alters prices, duties, and taxes based on the user's IP address. We route requests through specific country proxy pools (e.g., US, UK, UAE) to capture exact local pricing and DDP (Delivered Duty Paid) variables.

Sizing matrices
Normalised size availability tracking

Fashion sizing is notoriously complex. We map and normalise size availability across different regional standards (IT, FR, US) and capture low-stock warnings to provide a clear picture of inventory depth.

Infinite scroll
Full catalogue pagination

Designer pages rely on heavy JavaScript rendering and infinite scroll mechanics. We run full Playwright browser sessions to trigger lazy-loaded items, ensuring no products are missed in the extraction.

Change detection
Only re-scrape what's changed

For large boutique catalogues, we maintain a hash index of last-seen values per product. Subsequent runs only push diffs — reducing compute cost and downstream processing load. You get a clean changelog of price drops and new stock.

Applications

Who uses Farfetch data — and how

Teams across industries use farfetch.com data to build competitive products and smarter operations.

01
Competitor Price Monitoring

Luxury retailers track Farfetch pricing to adjust their own markups, monitor discount strategies, and manage regional price parity.

02
Grey Market Detection

Fashion houses audit boutique listings to identify unauthorised sellers, MAP violations, and cross-border arbitrage.

03
Trend Forecasting

Merchandisers analyse colorways, materials, and category velocity to predict upcoming seasonal trends and consumer preferences.

04
Boutique Aggregation

Marketplaces and aggregators normalise Farfetch inventory data to augment their own catalogues and drop-shipping networks.

05
AI Fashion Models

Machine learning teams use structured product descriptions, material compositions, and high-res imagery to train visual search and recommendation engines.

06
Assortment Planning

Buyers monitor brand representation, size availability curves, and out-of-stock rates to optimise their own seasonal procurement.

Why DataFlirt

"Farfetch aggregates inventory from hundreds of global boutiques — tracking this fragmented supply chain requires precise, geo-targeted extraction at scale."

Extracting luxury fashion data involves navigating complex sizing charts, regional tax calculations, and aggressive bot mitigation. DataFlirt manages the proxy rotation and session handling, delivering structured catalogue data so your merchandising teams can focus on analysis — not infrastructure.

Technical Spec

Farfetch scraper — technical capabilities

Everything supported by our farfetch.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

Geo-targeted pricing
Extract prices, taxes, and duties specific to local markets using regional IPs
Supported
Boutique inventory mapping
Link specific products to physical boutique locations and shipping policies
Supported
Size availability
Track in-stock and out-of-stock states across all size variants
Supported
High-res image extraction
Capture original, uncompressed image URLs from the Farfetch CDN
Supported
Material composition parsing
Extract structured fabric percentages and care instructions
Supported
Pre-owned condition grading
Capture vintage status and specific condition notes for archive pieces
Supported
Import duty calculation
Identify whether import duties are included in the base price
Supported
User purchase history
Historical orders and return data tied to specific user accounts
Partial
Access status / loyalty tiers
Private client pricing and early access drops requiring authentication
Partial
Infrastructure

Infrastructure powering the Farfetch pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and deduplication. Playwright handles JavaScript rendering, infinite scroll, and interaction flows for complex designer pages.

Geo-Targeted Proxy Infrastructure

We maintain pools of residential ISP proxies across target regions (US, EU, APAC). Rotation happens per-request to bypass bot protection and capture accurate local pricing.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and Kubernetes (sustained). Airflow handles scheduling and dependency management. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
Parquet
Columnar format for BigQuery, Snowflake, Athena
S3
Direct bucket delivery — compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
BigQuery
Streamed directly into your dataset with schema auto-detect
Snowflake
Stage + COPY INTO workflow — incremental or full-replace
// faq

Common questions.

About farfetch.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Farfetch legal?

Scraping publicly available information from Farfetch is generally permissible under applicable law. DataFlirt targets only public, non-authenticated product, pricing, and boutique data. We do not extract personal data or circumvent authentication walls. Clients should review Farfetch's ToS and consult legal counsel for specific use cases.

How do you handle regional pricing and duties?

Farfetch alters prices based on IP geolocation. We configure pipelines to route through specific residential proxy pools (e.g., US, UK, EU) to capture accurate local currency pricing, DDP flags, and regional tax variations.

Can you track inventory at the boutique level?

Yes. We extract the boutique ID, name, and location associated with each product listing, allowing you to map inventory distribution across Farfetch's global network of physical partners.

How fresh is the data?

For targeted designer or category monitoring, we can configure hourly pipelines to catch sneaker drops and price adjustments. Full catalogue refreshes typically run at a daily or weekly cadence depending on volume.

Do you extract high-resolution product images?

Yes. We extract the direct CDN URLs for all high-resolution imagery, including primary shots, alternate angles, and detail views. We deliver the URLs in the payload for your systems to ingest.

What is the minimum viable engagement?

Our minimum engagement typically starts with a defined set of designers or categories (e.g., top 50 luxury brands) with weekly delivery. Contact us with your target scope for a precise quote.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 500 products from a specific designer or category as part of the pre-engagement scoping process — so you can validate schema fit and data quality.

$ dataflirt scope --new-project --source=farfetch.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a targeted designer monitor or a continuous feed of global boutique inventory — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →