SYSTEM all green source 1stdibs.com queue 12,845 pages p99 latency 218ms dataflirt.com · scraper/1stdibs-com
RUN · 34 active pipelines · 1stdibs.com live

1Stdibs data,
at warehouse scale.

We extract vintage furniture listings, art provenance, dealer profiles, and pricing signals from 1Stdibs. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Listings extracted
145K /day
Dealer profiles
12.4K /run
Image assets mapped
890K /24h
Active pipelines
34
Uptime
99.94%
Data Dictionary

Every field we extract from 1stdibs.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Product Listings objects from 1stdibs.com. All fields typed and schema-versioned.

item_idtitlecreatorperiodmaterialsdimensionsconditionpricecurrencycategorysub_categoryreference_numberlocationimage_urls
product_listings
● 200 OK
"item_id": "F-129485",
"title": "Mid-Century Modern Walnut Credenza",
"creator": "George Nelson",
"period": "1950s",
"price": 8500.0,
"currency": "USD",
"condition": "Excellent - Minor wear consistent with age and history"
# item_idtitlecreatorperiodmaterialsdimensions
1
2
3

Complete list of extractable fields for Dealer Intelligence objects from 1stdibs.com. All fields typed and schema-versioned.

dealer_iddealer_namestorefront_urllocation_citylocation_countryratingresponse_timemember_sinceactive_listings_countshipping_policiesbio
dealer_intelligence
● 200 OK
"dealer_id": "D-9382",
"dealer_name": "Galerie Moderne",
"location_city": "Paris",
"location_country": "France",
"member_since": "2014",
"active_listings_count": 142,
"response_time": "Within 24 hours"
# dealer_iddealer_namestorefront_urllocation_citylocation_countryrating
1
2
3

Complete list of extractable fields for Art & Provenance objects from 1stdibs.com. All fields typed and schema-versioned.

item_idtitleartistmediumsignaturecreation_yearprovenance_historyframe_dimensionsauthentication_statusexhibition_history
art_& provenance
● 200 OK
"artist": "Pablo Picasso",
"medium": "Lithograph on paper",
"signature": "Hand-signed lower right",
"creation_year": 1962,
"authentication_status": "Certificate of Authenticity included",
"frame_dimensions": "24 x 36 in"
# item_idtitleartistmediumsignaturecreation_year
1
2
3

Complete list of extractable fields for Pricing & Offers objects from 1stdibs.com. All fields typed and schema-versioned.

item_idlisted_pricecurrencyshipping_costships_frommake_offer_availablediscount_percentageprice_drop_historytax_informationprice_timestamp
pricing_& offers
● 200 OK
"item_id": "F-129485",
"listed_price": 8500.0,
"make_offer_available": true,
"ships_from": "New York, NY",
"shipping_cost": 450.0,
"price_timestamp": "2023-10-24T08:12:00Z",
"currency": "USD"
# item_idlisted_pricecurrencyshipping_costships_frommake_offer_available
1
2
3

Complete list of extractable fields for Search & Taxonomy objects from 1stdibs.com. All fields typed and schema-versioned.

keywordcategory_pathstyle_filterperiod_filterpositionitem_idtitlepricedealer_namescraped_at
search_& taxonomy
● 200 OK
"keyword": "mid century credenza",
"category_path": "Furniture > Storage > Credenzas",
"style_filter": "Mid-Century Modern",
"position": 3,
"item_id": "F-129485",
"scraped_at": "2023-10-24T08:14:22Z"
# keywordcategory_pathstyle_filterperiod_filterpositionitem_id
1
2
3

Capabilities

Everything you need from 1Stdibs

Our 1Stdibs scraper handles every layer of the platform: dealer storefronts, vintage inventory, provenance metadata, and pricing signals. We bypass strict perimeter defences to deliver structured JSON.

Full Listing Extraction

Title, creator, dimensions, materials, period, and condition notes scraped across all furniture and art categories.

Dealer Intelligence

Storefront URLs, active inventory size, geographic location, and response times for every seller on the platform.

Provenance & Authentication

Extract signature details, certificates of authenticity, and historical provenance text for fine art listings.

High-Res Asset Mapping

Parse Next.js state data to extract uncompressed, high-resolution source URLs for imagery.

Pricing & Offer Signals

Capture listed price, currency, shipping estimates, and the availability of Make an Offer functionality.

Category Taxonomy

Deep scrape of styles, periods, and makers to map the entire 1Stdibs classification system.

Shipping & Location Data

Origin cities, countries, and shipping cost estimates normalised across global dealer locations.

Multi-Region Support

Handle geo-located pricing and availability based on specific target markets and currencies.

Scheduled & Streaming Modes

Run one-off bulk exports or configure continuous pipelines at daily cadences with change-detection diffing.

Anti-Bot Circumvention

Handle Datadome and PerimeterX blocks using residential proxies and TLS fingerprinting.

// engagement pipeline

From search query to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide dealer URLs, category paths, creator names, or search keywords. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy crawlers, proxy rotation, session management, and CAPTCHA handling for 1stdibs.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, and sample extraction before full launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our 1Stdibs pipeline handles the hard parts

Luxury marketplaces invest heavily in scraping detection. Here is how we stay resilient and why teams choose managed infrastructure over DIY.

pipeline-monitor · 1stdibs.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Residential proxy rotation and fingerprint spoofing

1Stdibs uses strict Web Application Firewalls. Our crawlers use residential ISP proxies with realistic browser fingerprints, randomised request timing, and full cookie session management trained on real user behaviour patterns.

JavaScript rendering
Full Playwright execution for SPA content

The platform relies heavily on React and Next.js hydration. We run full Playwright browser sessions with JavaScript execution and lazy-load triggering to capture data that headless HTTP clients miss entirely.

Schema stability
Resilient selectors with fallback chains

DOM structures change frequently. Our selector strategy uses multiple fallback chains per field, including JSON state extraction, so a layout change does not break your data pipeline overnight.

High-res extraction
Parsing application state for raw assets

Instead of scraping compressed viewport images, we parse the underlying JSON application state to extract the original, uncompressed image URLs for art and furniture listings.

Change detection
Only re-scrape what has changed

For large dealer catalogues, we maintain a hash index of last-seen values per field. Subsequent runs only push diffs, reducing compute cost and downstream processing load.

Applications

Who uses 1Stdibs data and how

Teams across industries use 1stdibs.com data to build competitive products and smarter operations.

01
Market Pricing Analysis

Appraisers and auction houses track vintage market values and pricing floors for specific creators and periods.

02
Competitor Intelligence

High-end dealers monitor rival inventory, pricing strategies, and active listing counts.

03
Interior Design Procurement

Design firms aggregate specific styles, materials, and periods for large commercial or residential projects.

04
AI Training Data

Machine learning teams train visual recognition models on verified period furniture and fine art imagery.

05
Trend Forecasting

Analysts identify rising demand and inventory scarcity for specific designers or mid-century eras.

06
Investment Due Diligence

Funds and analysts track liquidity and price stability for blue-chip fine art and collectible design.

Why DataFlirt

"1Stdibs holds the definitive digital record of vintage furniture and fine art pricing, but querying market value requires bypassing strict perimeter defences."

Most teams underestimate the investment required: reliable 1Stdibs scraping requires residential proxies, full JavaScript rendering for Next.js hydration, CAPTCHA handling, and daily selector maintenance. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.

Technical Spec

1Stdibs scraper technical capabilities

Everything supported by our 1stdibs.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions required for dynamic content and Next.js hydration
Supported
CAPTCHA bypass
Automated solver integration for WAF challenges
Supported
Residential proxy rotation
ISP-grade residential IPs rotated per request to avoid IP bans
Supported
High-res image URL extraction
Direct extraction of uncompressed asset URLs from application state
Supported
Dealer inventory mapping
Pagination across all active listings for targeted dealer profiles
Supported
Change detection (diffs)
Hash-based diff to only emit records with changed fields since last run
Supported
Webhook delivery
HTTP POST per record or batch
Supported
Accepted offer prices
Final negotiated sale prices are private and not exposed to the public DOM
Partial
Buyer message history
Direct messages with dealers require authenticated buyer credentials
Partial
Infrastructure

Infrastructure powering the 1Stdibs pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and retry logic. Playwright handles JavaScript rendering and interaction flows. Combined via scrapy-playwright middleware.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies. Rotation happens per-request with sticky sessions where required. IP score monitoring prevents blacklisted pool contamination.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested array formatting
CSV
Flat file with typed columns
XLS
Excel compatible format for analyst teams
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoints for on-demand record retrieval
BigQuery
Streamed directly into your dataset with schema auto-detect
PostgreSQL
Upsert into your existing schema with conflict resolution
Snowflake
Stage and COPY INTO workflow
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About 1stdibs.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping 1Stdibs legal?

Scraping publicly available information from 1Stdibs is generally permissible under applicable law, reinforced by the hiQ v. LinkedIn ruling. DataFlirt targets only public, non-authenticated listing, dealer, and pricing data. We do not extract personal buyer data or circumvent authentication walls.

How do you handle 1Stdibs anti-bot systems?

We use residential ISP proxies, full Playwright browser sessions with realistic TLS fingerprints, and request timing modelled on human behaviour. We monitor for WAF blocks in real time and trigger pool rotation automatically.

Can you extract high-resolution image URLs?

Yes. We bypass the compressed viewport images loaded in the browser and extract the uncompressed, high-resolution asset URLs directly from the application state data.

Do you track Make an Offer availability?

Yes. We capture whether a listing accepts offers alongside the stated list price, providing deeper signals on pricing flexibility.

How fresh is the data?

Full catalogue refreshes at daily cadence complete within a 6 to 12 hour window depending on category size. Targeted dealer inventory can be scraped at hourly intervals.

Can I scrape specific creators or periods?

Yes. Pipelines can be scoped to specific designers, art movements, material types, or geographic dealer locations based on your requirements.

What is the minimum viable engagement?

Our smallest packages start at a defined category or dealer list with weekly delivery. For full platform extraction, we price based on compute volume and delivery frequency.

$ dataflirt scope --new-project --source=1stdibs.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off catalogue dump or a continuous price-monitoring feed across dealer inventories, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →