SYSTEM all green source potterybarn.com queue 12,419 pages p99 latency 184ms dataflirt.com · scraper/potterybarn-com
RUN · 42 active pipelines · potterybarn.com live

Pottery Barn data,
at warehouse scale.

We extract product catalogues, fabric variations, pricing tiers, room scenes, and stock availability from Pottery Barn. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Products extracted
142K /day
Fabric variations
1.2M /run
Price updates
345K /24h
Active pipelines
42
Uptime
99.98%
Data Dictionary

Every field we extract from potterybarn.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Product Listings objects from potterybarn.com. All fields typed and schema-versioned.

skutitlecollectioncategorybase_pricecurrencydimensionsweightmaterialscare_instructionsoverview_textmain_image_url
product_listings
● 200 OK
"sku": "7483921",
"title": "Carmel Square Arm Upholstered Sofa",
"collection": "Carmel",
"category": "Furniture > Sofas",
"base_price": 1499.0,
"currency": "USD",
"dimensions": "84" w x 40" d x 34" h",
"weight": "145 lbs"
# skutitlecollectioncategorybase_pricecurrency
1
2
3

Complete list of extractable fields for Fabric & Finishes objects from potterybarn.com. All fields typed and schema-versioned.

skuparent_skufabric_gradecolourmaterialbase_pricesurchargefinal_pricelead_time_weeksswatch_image_url
fabric_& finishes
● 200 OK
"sku": "7483921-A",
"parent_sku": "7483921",
"fabric_grade": "Grade C",
"colour": "Performance Everydaylinen, Oatmeal",
"material": "Linen Blend",
"final_price": 1799.0,
"lead_time_weeks": 8
# skuparent_skufabric_gradecolourmaterialbase_price
1
2
3

Complete list of extractable fields for Room Scenes objects from potterybarn.com. All fields typed and schema-versioned.

scene_idtitledesignerstylecategorymain_image_urltagged_skustotal_pricedescriptionscraped_at
room_scenes
● 200 OK
"scene_id": "RS-9482",
"title": "Modern Coastal Living Room",
"designer": "In-house",
"style": "Coastal",
"category": "Living Room",
"tagged_skus": "['7483921', '8392011', '2938471']",
"total_price": 4250.0
# scene_idtitledesignerstylecategorymain_image_url
1
2
3

Complete list of extractable fields for Stock & Shipping objects from potterybarn.com. All fields typed and schema-versioned.

skuzip_codein_stockstock_statusdelivery_estimate_mindelivery_estimate_maxwhite_glove_eligibleshipping_surchargereturnablescraped_at
stock_& shipping
● 200 OK
"sku": "7483921",
"zip_code": "90210",
"in_stock": true,
"stock_status": "Made to Order",
"delivery_estimate_min": "2026-07-10",
"white_glove_eligible": true,
"returnable": false
# skuzip_codein_stockstock_statusdelivery_estimate_mindelivery_estimate_max
1
2
3

Complete list of extractable fields for Category Taxonomies objects from potterybarn.com. All fields typed and schema-versioned.

category_idnamebreadcrumbparent_categorylevelproduct_counturldescriptionscraped_at
category_taxonomies
● 200 OK
"category_id": "cat-sofas-sectionals",
"name": "Sofas & Sectionals",
"breadcrumb": "Furniture > Living Room Furniture > Sofas & Sectionals",
"parent_category": "Living Room Furniture",
"level": 3,
"product_count": 412,
"url": "/shop/furniture/living-room-furniture/sofas-sectionals/"
# category_idnamebreadcrumbparent_categorylevelproduct_count
1
2
3

Capabilities

Everything you need from Pottery Barn — nothing you do not

Our Pottery Barn scraper handles every layer of the platform: product catalogues, dynamic fabric pricing, room scene deconstruction, and localised inventory — with JavaScript rendering, session management, and anti-bot circumvention built in.

Full Product Catalogues

Title, description, dimensions, materials, and care instructions extracted for every piece of furniture and decor.

Complex Variation Mapping

Extract pricing across hundreds of fabric grades, leathers, and wood finishes per base product.

Room Scene Deconstruction

Parse Shop the Room pages to extract individual SKUs and styling context from lifestyle imagery.

Dynamic Pricing Capture

Track base prices, promotional discounts, and material-specific surcharges accurately.

Stock & Lead Time Polling

Monitor availability and estimated delivery windows across regional zip codes.

High-Resolution Imagery

Extract main product images, swatch textures, and lifestyle photography URLs in maximum resolution.

B2B Trade Pricing

Capture designer trade program discounts and bulk order pricing tiers where publicly available.

Category & Taxonomy Scraping

Map the full site hierarchy from main departments down to specific decor sub-categories.

Scheduled + Streaming Modes

Run one-off bulk exports or configure continuous pipelines at hourly, daily, or real-time cadences.

// engagement pipeline

From SKU list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide category URLs, product lines, or search terms. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for potterybarn.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, price-outlier detection, and sample variations before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Pottery Barn pipeline handles the hard parts

Furniture retail sites rely on heavy front-end state for product configuration. Here is how we extract accurate data without breaking the pipeline.

pipeline-monitor · potterybarn.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Complex state machines
Full Playwright execution for fabric configuration

Pottery Barn uses heavy React state for fabric and finish configuration. We execute full Playwright sessions to trigger state changes and capture dynamic pricing per variation.

Anti-bot layer
Residential proxy rotation + fingerprint spoofing

Retailers deploy aggressive anti-bot middleware. We use residential ISP proxies with realistic browser fingerprints and full cookie session management to maintain access.

Schema stability
Resilient selectors with fallback chains

DOM structures change during seasonal promotions. We use multiple fallback chains per field, including JSON-LD structured data and internal API interception.

Change detection
Only re-scrape what has changed

For large furniture catalogues, we maintain a hash index of last-seen values per field. Subsequent runs only push diffs, reducing compute cost and downstream load.

Monitoring & alerting
24/7 pipeline health with anomaly detection

Every run emits structured logs to our observability stack. We alert on null-rate spikes, price outliers, and coverage drops before you notice.

Applications

Who uses Pottery Barn data — and how

Teams across industries use potterybarn.com data to build competitive products and smarter operations.

01
Competitor Price Monitoring

Home goods retailers track Pottery Barn pricing, fabric surcharges, and promotional calendars to adjust their own merchandising.

02
Trend & Assortment Analysis

Merchandisers analyse category depth, material trends, and colour palettes across seasonal collections.

03
Interior Design Aggregation

Design platforms ingest product specifications, dimensions, and 3D models for virtual staging applications.

04
Supply Chain Intelligence

Logistics teams monitor lead times and backorder status across furniture categories to gauge macroeconomic supply chain health.

05
AI Training Data

Computer vision teams use high-resolution room scenes and tagged SKUs to train spatial recognition and style matching models.

06
MAP & Brand Monitoring

Vendors and textile suppliers audit product descriptions to ensure accurate material representation and trademark compliance.

Why DataFlirt

"Pottery Barn holds the blueprint for premium home furnishings, but extracting fabric-level pricing requires navigating thousands of dynamic state changes per product."

Most teams underestimate the investment required: reliable Pottery Barn scraping requires residential proxies, full JavaScript rendering for fabric configuration, daily selector maintenance, and anomaly monitoring. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.

Technical Spec

Pottery Barn scraper — technical capabilities

Everything supported by our potterybarn.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions — required for fabric configuration and dynamic pricing
Supported
CAPTCHA bypass
Automated 2Captcha + CapSolver integration
Supported
Residential proxy rotation
ISP-grade residential IPs from US pools
Supported
Fabric & Finish mapping
Parent to child SKU relationships with all option combinations
Supported
ZIP-code localised stock
Delivery estimates based on specific regional zip codes
Supported
Room scene SKU extraction
Parsing Shop the Room tagged products
Supported
High-res image extraction
Capturing uncompressed image URLs and swatches
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed fields since last run
Supported
User account order history
Requires authenticated session credentials
Partial
Key Rewards program point balances
Gated behind individual user authentication
Partial
Infrastructure

Infrastructure powering the Pottery Barn pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration. Playwright handles JavaScript rendering, cookie sessions, and interaction flows for complex fabric configuration.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across US regions. Rotation happens per-request with sticky sessions to maintain cart and zip-code state.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested
CSV
Flat file with typed columns
XLS
Excel compatible format for business teams
Parquet
Columnar format for BigQuery, Snowflake
AWS S3
Direct bucket delivery
Webhook
HTTP POST per record
API
REST endpoint for on-demand querying
PostgreSQL
Upsert into your existing schema
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About potterybarn.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Pottery Barn legal?

Scraping publicly available information is generally permissible under applicable law. DataFlirt targets only public, non-authenticated product, pricing, and inventory data. We do not extract personal data or circumvent authentication walls.

How do you handle fabric and finish variations?

We use Playwright to interact with the configuration UI, triggering state changes to capture price and SKU updates for every fabric grade and wood finish.

Can you extract localised delivery times?

Yes. We can inject target ZIP codes into the session to extract regional stock availability and estimated white-glove delivery dates.

Do you extract Shop the Room data?

Yes, we parse the interactive room scenes to extract all tagged SKUs, mapping lifestyle imagery to individual product records.

How fresh is the data?

Full catalogue refreshes typically run weekly or daily. Targeted price monitoring on specific SKUs can run at hourly cadences.

What is the minimum viable engagement?

Our smallest packages start at a defined category scope with weekly delivery. We price based on volume and delivery frequency.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 500 SKUs as part of the pre-engagement scoping process so you can validate schema fit.

$ dataflirt scope --new-project --source=potterybarn.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off product catalogue dump or a continuous price-monitoring feed — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →