We extract product listings, pricing signals, flash deal windows, Shopee Mall shop intelligence, reviews, sold counts, and live commerce data from Shopee. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Product Listings objects from shopee.com. All fields typed and schema-versioned.
"item_id": "24718291930", "title": "SOMETHINC Niacinamide + Moisture Barrier Serum 20ml", "brand": "SOMETHINC", "price": 89000, "currency": "IDR", "discount_pct": 30, "shopee_mall_badge": true, "sold_count": 82400, "rating": 4.9, "rating_count": 14820, "in_stock": true
| # | item_id | shop_id | title | brand | category | sub_category |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Pricing & Flash Deals objects from shopee.com. All fields typed and schema-versioned.
"item_id": "24718291930", "price": 89000, "price_before_discount": 127000, "discount_pct": 30, "flash_sale_price": 71200, "flash_sale_end": "2026-05-12T22:00:00+07:00", "flash_sale_stock": 200, "flash_sale_sold": 183, "shopee_coins_cashback": 89, "price_timestamp": "2026-05-12T14:00:00Z"
| # | item_id | price | price_before_discount | discount_pct | flash_sale_price | flash_sale_start |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Reviews & Ratings objects from shopee.com. All fields typed and schema-versioned.
"review_id": "spe_rv_77391040", "item_id": "24718291930", "star_rating": 5, "comment": "Pemakaian 2 minggu sudah terlihat hasilnya, kulit lebih cerah!", "sku_reviewed": "20ml", "helpful_votes": 221, "shop_reply": true, "review_date": "2026-04-28"
| # | review_id | item_id | shop_id | reviewer_name | reviewer_level | star_rating |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Shop Intelligence objects from shopee.com. All fields typed and schema-versioned.
"shop_id": "somethinc-id-official", "shop_name": "SOMETHINC Official Store", "shopee_mall_official": true, "shop_rating": 4.94, "response_rate": 99, "ship_on_time_rate": 98, "follower_count": 1840000, "active_listings_count": 287
| # | shop_id | shop_name | shop_url | shopee_mall_official | preferred_seller | shop_rating |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Search & Rankings objects from shopee.com. All fields typed and schema-versioned.
"keyword": "niacinamide serum", "country": "ID", "position": 1, "item_id": "24718291930", "sponsored": false, "shopee_mall_badge": true, "flash_sale_badge": true, "scraped_at": "2026-05-12T14:00:18Z"
| # | keyword | country | position | item_id | title | shop_id |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Shopee operates across seven SEA markets plus Taiwan, with flash deals, Coins cashback, live commerce, and Shopee Mall creating pricing complexity that raw scrapes miss. Our pipeline handles all of it — from flash sale stock depth to sold count velocity.
Title, brand, category, images, SKU variants, sold count, liked count, and every metadata field Shopee surfaces — at item level with full variation mapping across all markets.
Capture flash sale price, countdown window, total stock, remaining stock, and units sold during flash period — updated at elevated cadence during active deal windows.
Shopee Coins cashback percentage, voucher codes, bundle deal pricing, and free-shipping eligibility — the promotion stack that makes Shopee's effective price materially different from the listed price.
Shopee displays cumulative sold counts prominently. We capture sold_count and liked_count per item — making Shopee one of the few platforms where demand velocity is a directly observable metric.
Shop rating, response rate, on-time shipping rate, follower count, Shopee Mall official status, Preferred Seller badge, and Shopee Choice flag — per shop across all markets.
Full review text, star ratings, SKU reviewed, helpful votes, shop replies, and review images and video thumbnails — paginated across all review pages in local language.
shopee.co.id, shopee.co.th, shopee.com.my, shopee.com.ph, shopee.sg, shopee.vn, and shopee.tw — all from a unified pipeline with market-level tagging and currency normalisation.
Capture live stream view counts, product links featured in streams, and Shopee Video engagement metrics — emerging demand signals for social commerce intelligence.
Run daily pipelines or elevate cadence during 9.9, 10.10, 11.11, 12.12, and mid-month Shopee campaigns — with pre/during/post snapshots for price and rank analysis.
Brief in. Clean data out.
Provide item ID lists, category URLs, keyword sets, shop IDs, or country-market filters. We design the extraction schema together — including which markets, price fields, and deal-tracking cadences you need.
We configure Scrapy / Playwright crawlers with per-country residential proxies, sold count capture, flash deal monitoring logic, and Coins cashback calculation for Shopee.
Schema validation, sold count cross-verification, price-outlier checks, flash deal logic testing, and sample reviews before full launch.
JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Shopee's seven-market footprint, API-driven architecture, and flash-deal complexity require a pipeline built specifically for it — not adapted from a single-market B2C scraper.
Shopee's web frontend communicates with an internal JSON API that returns product, pricing, and review data in structured form. Our Playwright sessions intercept these API calls — giving us reliable, schema-stable data that doesn't break when Shopee updates its frontend CSS or React component tree.
Flash deals on Shopee move fast. We capture flash_sale_stock (total allocated) and flash_sale_sold (units sold during the deal) at each crawl — giving you sell-through rate over time, not just a snapshot of the final cleared state.
Shopee's anti-bot detection is country-specific. We maintain dedicated residential ISP proxy pools for ID, TH, MY, PH, SG, VN, and TW — routing each request through a country-appropriate proxy with matching browser locale and timezone settings.
Shopee surfaces cumulative sold counts publicly on product pages. We capture sold_count per item per run — and because we run on a consistent schedule, the delta between runs gives you a sold velocity time-series: a demand signal unavailable on most other platforms.
Every run emits structured logs to our observability stack. We alert on sold count regressions, flash deal logic failures, null-rate spikes, and schema drift — and respond before you notice.
Brands and analysts track sold count velocity per item over time — using Shopee's publicly visible demand signal as a near-real-time proxy for consumer purchase behaviour across SEA.
Competitive intelligence teams monitor flash deal prices, sell-through rates, and stock allocation during 9.9, 11.11, and 12.12 campaigns — to benchmark and respond to competitor promotional strategies.
Brands monitor Shopee Mall and third-party seller listings for MAP violations, counterfeit products, and unauthorised resellers — across all seven Shopee markets simultaneously.
Consumer goods companies track category pricing, brand ranking, and new product launch velocity across all SEA markets — to inform pan-regional distribution and pricing strategy.
ML teams use Shopee's multilingual review corpus — spanning Bahasa Indonesia, Thai, Vietnamese, Filipino, and Malay — to train sentiment models and regional NLP classifiers.
Analysts track Shopee's category growth, Shopee Mall penetration, and live commerce adoption across markets as indicators of Sea Limited's eCommerce strategy execution.
"Shopee publicly shows sold counts — making it one of the only major platforms where demand velocity is directly observable. That data is only valuable if you're capturing it consistently over time."
Reliable Shopee intelligence requires per-country proxy pools, API intercept for stable data extraction, flash deal stock-depth monitoring, and multi-market schema normalisation. DataFlirt operates unified Shopee pipelines across all seven markets — delivering sold count time-series, flash deal analytics, and campaign snapshots on your cadence.
Everything supported by our shopee.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Scrapy handles crawl orchestration and retry logic. Playwright drives JavaScript rendering and Shopee's internal API intercept. The API intercept layer provides structured, schema-stable product and pricing data regardless of frontend updates.
We maintain dedicated residential ISP proxy pools for all seven Shopee markets. Each request is routed through a country-matched proxy with the appropriate locale, timezone, and browser fingerprint settings.
Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, campaign-calendar alignment, and SLA alerting. All state stored in managed Postgres.
Data delivered to where your team already works — no new tooling required.
About shopee.com scraping, legality, and pipeline operations.
Ask us directly →Scraping publicly available information from Shopee is generally permissible under applicable law across Southeast Asia and Taiwan — reinforced by precedents such as hiQ v. LinkedIn. DataFlirt targets only public, non-authenticated product, pricing, review, and sold count data. We do not extract personal data or circumvent authentication walls. We recommend clients review Shopee's ToS independently and consult legal counsel for specific use cases.
We support all seven Shopee storefronts: shopee.co.id (Indonesia), shopee.co.th (Thailand), shopee.com.my (Malaysia), shopee.com.ph (Philippines), shopee.sg (Singapore), shopee.vn (Vietnam), and shopee.tw (Taiwan) — delivered via a unified, market-normalised schema with a country tag per record.
Yes. We capture sold_count per item on every pipeline run. Because runs happen at a consistent cadence, the delta between consecutive runs gives you a demand velocity time-series — showing how many units sold in each period. This is one of Shopee's most distinctive data signals.
Yes. During flash deal windows, we capture flash_sale_price, flash_sale_stock (total allocated), and flash_sale_sold (units sold so far) at elevated cadence — giving you sell-through rate over time, not just the final state after the deal clears.
We use per-country residential ISP proxies, full Playwright browser sessions with country-appropriate locale and fingerprint settings, and Shopee's internal API intercept layer as a stable data source that is less sensitive to bot detection than rendered HTML scraping. We monitor block rates in real time and rotate pools automatically.
Yes. We provide a sample run of up to 500 items per market — including pricing, sold count, flash deal, and shop data — as part of pre-engagement scoping, so you can validate schema fit and data quality before signing any contract.
20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need sold count time-series across seven SEA markets, a flash deal monitoring feed, or a multilingual review corpus — we scope, build, and operate the pipeline. Tell us what you need.