SYSTEM all green source freepeople.com queue 12,841 pages p99 latency 184ms dataflirt.com · scraper/freepeople-com

RUN · 41 active pipelines · freepeople.com live

Free People data,
at warehouse scale.

We extract product catalogues, FP Movement inventory, size-level stock availability, pricing, and customer reviews from Free People. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Get data from freepeople.com → See how it works

Products extracted

42.1K /run

Stock updates

185K /day

Reviews parsed

312K /month

Active pipelines

Uptime

99.94%

◆ Free People Apparel Data◆ FP Movement Activewear◆ Size & Fit Details◆ Stock Availability◆ Fabric & Care Specs◆ Colour Variations◆ Customer Reviews◆ Pricing & Markdown Tracking◆ Category Hierarchies◆ High-Res Image URLs◆ Managed Pipeline◆ S3 / BigQuery Delivery◆ Bengaluru HQ◆ Enterprise SLA◆ Free People Apparel Data◆ FP Movement Activewear◆ Size & Fit Details◆ Stock Availability◆ Fabric & Care Specs◆ Colour Variations◆ Customer Reviews◆ Pricing & Markdown Tracking◆ Category Hierarchies◆ High-Res Image URLs◆ Managed Pipeline◆ S3 / BigQuery Delivery◆ Bengaluru HQ◆ Enterprise SLA

Data Dictionary

Every field we extract from freepeople.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Product Catalogue objects from freepeople.com. All fields typed and schema-versioned.

product_idskutitlebrandcategorysub_categorydescriptionfabric_compositioncare_instructionsstyle_numberavailable_coloursavailable_sizesprimary_image_urlgallery_image_urlsvideo_urlscraped_at

"product_id": "82349102",
"sku": "OB123456",
"title": "Oasis Midi Dress",
"brand": "Free People",
"category": "Dresses",
"fabric_composition": "100% Cotton",
"style_number": "F22-DR-091",
"available_colours": "['Black', 'Ivory', 'Terracotta']"

#	product_id	sku	title	brand	category	sub_category
1
2
3

Complete list of extractable fields for Inventory & Pricing objects from freepeople.com. All fields typed and schema-versioned.

skuvariant_idcoloursizefull_pricesale_pricecurrencydiscount_pctin_stocklow_stock_warningmarkdown_statusregionprice_timestamp

"sku": "OB123456",
"variant_id": "V987654",
"colour": "Terracotta",
"size": "M",
"full_price": 128.0,
"sale_price": 89.95,
"currency": "USD",
"in_stock": true,
"low_stock_warning": true

#	sku	variant_id	colour	size	full_price	sale_price
1
2
3

Complete list of extractable fields for Reviews & Fit Data objects from freepeople.com. All fields typed and schema-versioned.

review_idskuratingreview_titlereview_bodyfit_indexquality_ratinguser_nicknameverified_buyerdate_postedhelpful_votessize_purchased

"review_id": "REV-992831",
"sku": "OB123456",
"rating": 4.5,
"fit_index": "True to Size",
"quality_rating": 5.0,
"verified_buyer": true,
"helpful_votes": 12,
"size_purchased": "M"

#	review_id	sku	rating	review_title	review_body	fit_index
1
2
3

Capabilities

Everything you need from Free People — nothing you don't

Our Free People scraper handles the entire URBN platform architecture: complex variant matrices, dynamic stock hydration, high-res media extraction, and the review corpus — with JavaScript rendering and anti-bot circumvention built in.

Full Product Extraction

Title, description, fabric composition, care instructions, and style numbers — scraped at the base product level.

FP Movement Tracking

Isolate and extract specific activewear lines, including performance specifications and activity tags.

Variant Matrix Mapping

Map parent products to every possible colour and size combination, maintaining relational integrity.

Size-Level Inventory

Monitor stock status and low-stock warnings for specific size and colour variants.

Pricing & Markdowns

Capture full retail price, current sale price, and promotional discounts — timestamped per crawl.

Fit & Quality Reviews

Extract star ratings, detailed text reviews, and specific indices for fit (runs small/large) and quality.

High-Res Media URLs

Extract primary images, full gallery arrays, and video asset URLs for visual analysis or AI training.

// engagement pipeline

From SKU list to warehouse record

Brief in. Clean data out.

Define Scope

d 0

Provide category URLs, specific SKUs, or search terms. We design the extraction schema together.

Pipeline Build

d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for freepeople.com.

Validation & QA

d 4–6

Schema validation, null-rate checks, price-outlier detection, and sample variants before full launch.

Delivery

ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Free People pipeline handles the hard parts

Modern apparel sites rely on heavy front-end frameworks and aggressive anti-bot systems. Here is how we maintain stable extraction.

// fingerprinting

Identity rotation

TLS fingerprintrandomised

User-agentrotated

IP poolresidential

Challenges blocked0

// pagination

Page coverage

48,291 pages queued running

// observability

Pipeline health

99.9%

uptime

142ms

p99 lat

0.3%

null rate

alerts

Dynamic hydration

React/Next.js state extraction

Free People loads variant pricing and stock data via background API calls after the initial page load. We intercept these XHR requests and execute full Playwright sessions to ensure accurate, size-level inventory capture.

Anti-bot layer

Bypassing perimeter defenses

URBN brands utilise sophisticated bot mitigation. Our crawlers route traffic through US-based residential ISP proxies with realistic TLS and browser fingerprints, preventing IP bans and CAPTCHA walls.

Variant complexity

Multi-dimensional matrix normalisation

Apparel data is rarely flat. We map the complex relationships between parent styles, colourways, and sizes into a normalised relational schema, ensuring you can query stock by specific SKUs.

Geolocation

Region-specific pricing

Free People alters currency and pricing based on the visitor's IP address. We enforce strict regional proxy targeting (e.g., US, UK) to ensure the pricing data matches your target market.

Delta updates

Efficient change detection

Monitoring stock levels across thousands of SKUs generates massive data volume. We maintain a hash index of previous states and only emit records when price or inventory status changes.

Applications

Who uses Free People data — and how

Teams across industries use freepeople.com data to build competitive products and smarter operations.

Competitor Price Monitoring

Fashion retailers track Free People's pricing tiers and promotional cadence to adjust their own markdown strategies.

Trend & Assortment Analysis

Merchandisers analyse category depth, colour prevalence, and fabric choices to inform upcoming seasonal collections.

Inventory & Markdown Strategy

Analysts monitor size-level stock depletion rates to reverse-engineer sales velocity and optimal markdown timing.

AI Fashion Models

Computer vision teams ingest high-res gallery images alongside detailed fabric descriptions to train apparel recognition models.

Fabric & Sustainability Tracking

Researchers extract material compositions to track the adoption of organic cotton and sustainable fibres in mainstream bohemian fashion.

Sentiment Analysis

Product teams mine customer reviews for fit indices (runs small/large) and quality complaints to improve their own manufacturing tolerances.

Why DataFlirt

"Free People's catalogue is a goldmine for bohemian fashion trends and fabric compositions — but extracting accurate size-level stock requires bypassing aggressive bot mitigation."

Apparel scraping is notoriously difficult due to complex variant matrices and dynamic inventory hydration. DataFlirt handles the JavaScript execution, residential proxy rotation, and schema normalisation required to turn freepeople.com into a clean, queryable database — so your analysts can focus on markdown strategies, not DOM parsing.

Technical Spec

Free People scraper — technical capabilities

Everything supported by our freepeople.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering

Full Playwright sessions — required for variant hydration and dynamic stock loading

Supported

Variant matrix mapping

Normalised relationships between parent styles, colours, and sizes

Supported

FP Movement category

Dedicated extraction for activewear performance specifications

Supported

Size-level stock status

Accurate in-stock and low-stock indicators per specific variant

Supported

Fit & quality metrics

Extraction of structured review indices alongside raw text

Supported

High-res image extraction

Capture of all primary, gallery, and detail image URLs

Supported

Geolocation pricing

Targeted proxies to extract USD, GBP, or EUR pricing accurately

Supported

Change detection (diffs)

Hash-based diff: only emit records with changed inventory or price

Supported

User wishlists / FP Community

Gated personal data requiring authenticated user accounts

Partial

Checkout & shipping logic

Dynamic cart calculation and address-specific shipping rates

Partial

Infrastructure

Infrastructure powering the apparel pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus

Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and XHR interception. Combined via scrapy-playwright middleware.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across US and EU regions. Rotation happens per-request with sticky sessions where required, preventing regional pricing anomalies.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON

Newline-delimited or nested — schema versioned per run

CSV

Flat file with typed columns — Excel/Sheets compatible

Parquet

Columnar format for BigQuery, Snowflake, Athena

Direct bucket delivery — compatible with any data lake

Webhook

HTTP POST per record for real-time downstream processing

BigQuery

Streamed directly into your dataset with schema auto-detect

// faq

Common questions.

About freepeople.com scraping, legality, and pipeline operations.

Ask us directly →

Is scraping Free People legal?

Scraping publicly available information from freepeople.com is generally permissible under applicable law. DataFlirt targets only public, non-authenticated product, pricing, and review data. We do not extract personal user data or circumvent authentication walls.

How do you handle size and colour variants?

We map the entire variant matrix. A single parent product (e.g., a dress) is expanded into individual records for every colour and size combination, capturing specific stock status and pricing for each.

Can you track pricing in different regions?

Yes. Free People uses geolocation to alter pricing. We configure the pipeline with specific regional residential proxies (e.g., US for USD, UK for GBP) to ensure you receive the correct localised data.

How frequently can you update inventory data?

For targeted SKU lists, we can run hourly pipelines to monitor fast-moving stock. Full catalogue refreshes are typically executed on a daily cadence.

Do you provide historical pricing data?

We begin tracking history from the moment your pipeline is commissioned. We maintain a time-series record of price changes and markdowns for every variant moving forward.

What is the minimum viable engagement?

Our smallest packages start at a defined category or SKU list (typically 5,000+ items) with weekly delivery. For full-site extraction or continuous monitoring, we price based on compute volume.

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off catalogue dump or continuous inventory monitoring across all categories — we scope, build, and operate the pipeline. Tell us what you need.

Start a freepeople.com pipeline → View pricing

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h

Free People data, at warehouse scale.

Every field we extract from freepeople.com

Everything you need from Free People — nothing you don't

From SKU list to warehouse record

How our Free People pipeline handles the hard parts

Who uses Free People data — and how

Free People scraper — technical capabilities

Infrastructure powering the apparel pipeline

Your data, your destination

Common questions.

Tell us whatto extract. We do the rest.

Data Extraction for Every Industry

Free People data,
at warehouse scale.

Tell us what
to extract.
We do the rest.