SYSTEM all green source freepeople.com queue 12,841 pages p99 latency 184ms dataflirt.com · scraper/freepeople-com
RUN · 41 active pipelines · freepeople.com live

Free People data,
at warehouse scale.

We extract product catalogues, FP Movement inventory, size-level stock availability, pricing, and customer reviews from Free People. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Products extracted
42.1K /run
Stock updates
185K /day
Reviews parsed
312K /month
Active pipelines
41
Uptime
99.94%
Data Dictionary

Every field we extract from freepeople.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Product Catalogue objects from freepeople.com. All fields typed and schema-versioned.

product_idskutitlebrandcategorysub_categorydescriptionfabric_compositioncare_instructionsstyle_numberavailable_coloursavailable_sizesprimary_image_urlgallery_image_urlsvideo_urlscraped_at
product_catalogue
● 200 OK
"product_id": "82349102",
"sku": "OB123456",
"title": "Oasis Midi Dress",
"brand": "Free People",
"category": "Dresses",
"fabric_composition": "100% Cotton",
"style_number": "F22-DR-091",
"available_colours": "['Black', 'Ivory', 'Terracotta']"
# product_idskutitlebrandcategorysub_category
1
2
3

Complete list of extractable fields for Inventory & Pricing objects from freepeople.com. All fields typed and schema-versioned.

skuvariant_idcoloursizefull_pricesale_pricecurrencydiscount_pctin_stocklow_stock_warningmarkdown_statusregionprice_timestamp
inventory_& pricing
● 200 OK
"sku": "OB123456",
"variant_id": "V987654",
"colour": "Terracotta",
"size": "M",
"full_price": 128.0,
"sale_price": 89.95,
"currency": "USD",
"in_stock": true,
"low_stock_warning": true
# skuvariant_idcoloursizefull_pricesale_price
1
2
3

Complete list of extractable fields for Reviews & Fit Data objects from freepeople.com. All fields typed and schema-versioned.

review_idskuratingreview_titlereview_bodyfit_indexquality_ratinguser_nicknameverified_buyerdate_postedhelpful_votessize_purchased
reviews_& fit data
● 200 OK
"review_id": "REV-992831",
"sku": "OB123456",
"rating": 4.5,
"fit_index": "True to Size",
"quality_rating": 5.0,
"verified_buyer": true,
"helpful_votes": 12,
"size_purchased": "M"
# review_idskuratingreview_titlereview_bodyfit_index
1
2
3

Capabilities

Everything you need from Free People — nothing you don't

Our Free People scraper handles the entire URBN platform architecture: complex variant matrices, dynamic stock hydration, high-res media extraction, and the review corpus — with JavaScript rendering and anti-bot circumvention built in.

Full Product Extraction

Title, description, fabric composition, care instructions, and style numbers — scraped at the base product level.

FP Movement Tracking

Isolate and extract specific activewear lines, including performance specifications and activity tags.

Variant Matrix Mapping

Map parent products to every possible colour and size combination, maintaining relational integrity.

Size-Level Inventory

Monitor stock status and low-stock warnings for specific size and colour variants.

Pricing & Markdowns

Capture full retail price, current sale price, and promotional discounts — timestamped per crawl.

Fit & Quality Reviews

Extract star ratings, detailed text reviews, and specific indices for fit (runs small/large) and quality.

High-Res Media URLs

Extract primary images, full gallery arrays, and video asset URLs for visual analysis or AI training.

// engagement pipeline

From SKU list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide category URLs, specific SKUs, or search terms. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for freepeople.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and sample variants before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Free People pipeline handles the hard parts

Modern apparel sites rely on heavy front-end frameworks and aggressive anti-bot systems. Here is how we maintain stable extraction.

pipeline-monitor · freepeople.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Dynamic hydration
React/Next.js state extraction

Free People loads variant pricing and stock data via background API calls after the initial page load. We intercept these XHR requests and execute full Playwright sessions to ensure accurate, size-level inventory capture.

Anti-bot layer
Bypassing perimeter defenses

URBN brands utilise sophisticated bot mitigation. Our crawlers route traffic through US-based residential ISP proxies with realistic TLS and browser fingerprints, preventing IP bans and CAPTCHA walls.

Variant complexity
Multi-dimensional matrix normalisation

Apparel data is rarely flat. We map the complex relationships between parent styles, colourways, and sizes into a normalised relational schema, ensuring you can query stock by specific SKUs.

Geolocation
Region-specific pricing

Free People alters currency and pricing based on the visitor's IP address. We enforce strict regional proxy targeting (e.g., US, UK) to ensure the pricing data matches your target market.

Delta updates
Efficient change detection

Monitoring stock levels across thousands of SKUs generates massive data volume. We maintain a hash index of previous states and only emit records when price or inventory status changes.

Applications

Who uses Free People data — and how

Teams across industries use freepeople.com data to build competitive products and smarter operations.

01
Competitor Price Monitoring

Fashion retailers track Free People's pricing tiers and promotional cadence to adjust their own markdown strategies.

02
Trend & Assortment Analysis

Merchandisers analyse category depth, colour prevalence, and fabric choices to inform upcoming seasonal collections.

03
Inventory & Markdown Strategy

Analysts monitor size-level stock depletion rates to reverse-engineer sales velocity and optimal markdown timing.

04
AI Fashion Models

Computer vision teams ingest high-res gallery images alongside detailed fabric descriptions to train apparel recognition models.

05
Fabric & Sustainability Tracking

Researchers extract material compositions to track the adoption of organic cotton and sustainable fibres in mainstream bohemian fashion.

06
Sentiment Analysis

Product teams mine customer reviews for fit indices (runs small/large) and quality complaints to improve their own manufacturing tolerances.

Why DataFlirt

"Free People's catalogue is a goldmine for bohemian fashion trends and fabric compositions — but extracting accurate size-level stock requires bypassing aggressive bot mitigation."

Apparel scraping is notoriously difficult due to complex variant matrices and dynamic inventory hydration. DataFlirt handles the JavaScript execution, residential proxy rotation, and schema normalisation required to turn freepeople.com into a clean, queryable database — so your analysts can focus on markdown strategies, not DOM parsing.

Technical Spec

Free People scraper — technical capabilities

Everything supported by our freepeople.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions — required for variant hydration and dynamic stock loading
Supported
Variant matrix mapping
Normalised relationships between parent styles, colours, and sizes
Supported
FP Movement category
Dedicated extraction for activewear performance specifications
Supported
Size-level stock status
Accurate in-stock and low-stock indicators per specific variant
Supported
Fit & quality metrics
Extraction of structured review indices alongside raw text
Supported
High-res image extraction
Capture of all primary, gallery, and detail image URLs
Supported
Geolocation pricing
Targeted proxies to extract USD, GBP, or EUR pricing accurately
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed inventory or price
Supported
User wishlists / FP Community
Gated personal data requiring authenticated user accounts
Partial
Checkout & shipping logic
Dynamic cart calculation and address-specific shipping rates
Partial
Infrastructure

Infrastructure powering the apparel pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and XHR interception. Combined via scrapy-playwright middleware.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across US and EU regions. Rotation happens per-request with sticky sessions where required, preventing regional pricing anomalies.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
Parquet
Columnar format for BigQuery, Snowflake, Athena
S3
Direct bucket delivery — compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
BigQuery
Streamed directly into your dataset with schema auto-detect
// faq

Common questions.

About freepeople.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Free People legal?

Scraping publicly available information from freepeople.com is generally permissible under applicable law. DataFlirt targets only public, non-authenticated product, pricing, and review data. We do not extract personal user data or circumvent authentication walls.

How do you handle size and colour variants?

We map the entire variant matrix. A single parent product (e.g., a dress) is expanded into individual records for every colour and size combination, capturing specific stock status and pricing for each.

Can you track pricing in different regions?

Yes. Free People uses geolocation to alter pricing. We configure the pipeline with specific regional residential proxies (e.g., US for USD, UK for GBP) to ensure you receive the correct localised data.

How frequently can you update inventory data?

For targeted SKU lists, we can run hourly pipelines to monitor fast-moving stock. Full catalogue refreshes are typically executed on a daily cadence.

Do you provide historical pricing data?

We begin tracking history from the moment your pipeline is commissioned. We maintain a time-series record of price changes and markdowns for every variant moving forward.

What is the minimum viable engagement?

Our smallest packages start at a defined category or SKU list (typically 5,000+ items) with weekly delivery. For full-site extraction or continuous monitoring, we price based on compute volume.

$ dataflirt scope --new-project --source=freepeople.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off catalogue dump or continuous inventory monitoring across all categories — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →