We extract product catalogues, FP Movement inventory, size-level stock availability, pricing, and customer reviews from Free People. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Product Catalogue objects from freepeople.com. All fields typed and schema-versioned.
"product_id": "82349102", "sku": "OB123456", "title": "Oasis Midi Dress", "brand": "Free People", "category": "Dresses", "fabric_composition": "100% Cotton", "style_number": "F22-DR-091", "available_colours": "['Black', 'Ivory', 'Terracotta']"
| # | product_id | sku | title | brand | category | sub_category |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Inventory & Pricing objects from freepeople.com. All fields typed and schema-versioned.
"sku": "OB123456", "variant_id": "V987654", "colour": "Terracotta", "size": "M", "full_price": 128.0, "sale_price": 89.95, "currency": "USD", "in_stock": true, "low_stock_warning": true
| # | sku | variant_id | colour | size | full_price | sale_price |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Reviews & Fit Data objects from freepeople.com. All fields typed and schema-versioned.
"review_id": "REV-992831", "sku": "OB123456", "rating": 4.5, "fit_index": "True to Size", "quality_rating": 5.0, "verified_buyer": true, "helpful_votes": 12, "size_purchased": "M"
| # | review_id | sku | rating | review_title | review_body | fit_index |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Our Free People scraper handles the entire URBN platform architecture: complex variant matrices, dynamic stock hydration, high-res media extraction, and the review corpus — with JavaScript rendering and anti-bot circumvention built in.
Title, description, fabric composition, care instructions, and style numbers — scraped at the base product level.
Isolate and extract specific activewear lines, including performance specifications and activity tags.
Map parent products to every possible colour and size combination, maintaining relational integrity.
Monitor stock status and low-stock warnings for specific size and colour variants.
Capture full retail price, current sale price, and promotional discounts — timestamped per crawl.
Extract star ratings, detailed text reviews, and specific indices for fit (runs small/large) and quality.
Extract primary images, full gallery arrays, and video asset URLs for visual analysis or AI training.
Brief in. Clean data out.
Provide category URLs, specific SKUs, or search terms. We design the extraction schema together.
We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for freepeople.com.
Schema validation, null-rate checks, price-outlier detection, and sample variants before full launch.
JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Modern apparel sites rely on heavy front-end frameworks and aggressive anti-bot systems. Here is how we maintain stable extraction.
Free People loads variant pricing and stock data via background API calls after the initial page load. We intercept these XHR requests and execute full Playwright sessions to ensure accurate, size-level inventory capture.
URBN brands utilise sophisticated bot mitigation. Our crawlers route traffic through US-based residential ISP proxies with realistic TLS and browser fingerprints, preventing IP bans and CAPTCHA walls.
Apparel data is rarely flat. We map the complex relationships between parent styles, colourways, and sizes into a normalised relational schema, ensuring you can query stock by specific SKUs.
Free People alters currency and pricing based on the visitor's IP address. We enforce strict regional proxy targeting (e.g., US, UK) to ensure the pricing data matches your target market.
Monitoring stock levels across thousands of SKUs generates massive data volume. We maintain a hash index of previous states and only emit records when price or inventory status changes.
Fashion retailers track Free People's pricing tiers and promotional cadence to adjust their own markdown strategies.
Merchandisers analyse category depth, colour prevalence, and fabric choices to inform upcoming seasonal collections.
Analysts monitor size-level stock depletion rates to reverse-engineer sales velocity and optimal markdown timing.
Computer vision teams ingest high-res gallery images alongside detailed fabric descriptions to train apparel recognition models.
Researchers extract material compositions to track the adoption of organic cotton and sustainable fibres in mainstream bohemian fashion.
Product teams mine customer reviews for fit indices (runs small/large) and quality complaints to improve their own manufacturing tolerances.
"Free People's catalogue is a goldmine for bohemian fashion trends and fabric compositions — but extracting accurate size-level stock requires bypassing aggressive bot mitigation."
Apparel scraping is notoriously difficult due to complex variant matrices and dynamic inventory hydration. DataFlirt handles the JavaScript execution, residential proxy rotation, and schema normalisation required to turn freepeople.com into a clean, queryable database — so your analysts can focus on markdown strategies, not DOM parsing.
Everything supported by our freepeople.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and XHR interception. Combined via scrapy-playwright middleware.
We maintain pools of residential ISP proxies across US and EU regions. Rotation happens per-request with sticky sessions where required, preventing regional pricing anomalies.
Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.
Data delivered to where your team already works — no new tooling required.
About freepeople.com scraping, legality, and pipeline operations.
Ask us directly →Scraping publicly available information from freepeople.com is generally permissible under applicable law. DataFlirt targets only public, non-authenticated product, pricing, and review data. We do not extract personal user data or circumvent authentication walls.
We map the entire variant matrix. A single parent product (e.g., a dress) is expanded into individual records for every colour and size combination, capturing specific stock status and pricing for each.
Yes. Free People uses geolocation to alter pricing. We configure the pipeline with specific regional residential proxies (e.g., US for USD, UK for GBP) to ensure you receive the correct localised data.
For targeted SKU lists, we can run hourly pipelines to monitor fast-moving stock. Full catalogue refreshes are typically executed on a daily cadence.
We begin tracking history from the moment your pipeline is commissioned. We maintain a time-series record of price changes and markdowns for every variant moving forward.
Our smallest packages start at a defined category or SKU list (typically 5,000+ items) with weekly delivery. For full-site extraction or continuous monitoring, we price based on compute volume.
20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off catalogue dump or continuous inventory monitoring across all categories — we scope, build, and operate the pipeline. Tell us what you need.