SYSTEM all green source hometown.in queue 12,405 pages p99 latency 214ms dataflirt.com · scraper/hometown-in
RUN - 14 active pipelines - hometown.in live

Hometown data,
structured for scale.

We extract furniture listings, pricing signals, material specifications, and stock availability from Hometown. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake.

Products extracted
42,190 /run
Price updates
18,400 /24h
Pincode checks
145K /day
Active pipelines
14
Uptime
99.98%
Data Dictionary

Every field we extract from hometown.in

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Furniture Listings objects from hometown.in. All fields typed and schema-versioned.

skutitlecategorysub_categorypricemrpdiscount_pctmaterialfinishdimensionsweightassembly_required
furniture_listings
● 200 OK
"sku": "HT-BED-0042",
"title": "Winston Engineered Wood Queen Bed",
"category": "Furniture",
"price": 18499.0,
"mrp": 25999.0,
"discount_pct": 28
# skutitlecategorysub_categorypricemrp
1
2
3

Complete list of extractable fields for Pricing & Offers objects from hometown.in. All fields typed and schema-versioned.

skucurrent_pricemrpdiscount_absdiscount_pctbank_offersemi_starting_pricecoupon_eligiblestock_status
pricing_& offers
● 200 OK
"sku": "HT-BED-0042",
"current_price": 18499.0,
"mrp": 25999.0,
"discount_abs": 7500.0,
"discount_pct": 28,
"stock_status": "In Stock",
"emi_starting_price": 894.0
# skucurrent_pricemrpdiscount_absdiscount_pctbank_offers
1
2
3

Complete list of extractable fields for Product Specifications objects from hometown.in. All fields typed and schema-versioned.

skuprimary_materialsecondary_materialcolourstylewarranty_summarycare_instructionscountry_of_originbrand
product_specifications
● 200 OK
"sku": "HT-BED-0042",
"primary_material": "Engineered Wood",
"colour": "Walnut",
"style": "Contemporary",
"warranty_summary": "12 Months Manufacturer Warranty",
"country_of_origin": "India"
# skuprimary_materialsecondary_materialcolourstylewarranty_summary
1
2
3

Complete list of extractable fields for Pincode Delivery objects from hometown.in. All fields typed and schema-versioned.

skupincodedelivery_availableestimated_daysdelivery_chargeinstallation_availableinstallation_chargecod_available
pincode_delivery
● 200 OK
"sku": "HT-BED-0042",
"pincode": "560034",
"delivery_available": true,
"estimated_days": 5,
"delivery_charge": 0.0,
"cod_available": false
# skupincodedelivery_availableestimated_daysdelivery_chargeinstallation_available
1
2
3

Complete list of extractable fields for Media & Assets objects from hometown.in. All fields typed and schema-versioned.

skuprimary_image_urlgallery_urlsvideo_url360_view_availablemanual_pdf_urlroom_shot_urlswatch_urls
media_& assets
● 200 OK
"sku": "HT-BED-0042",
"primary_image_url": "https://hometown.in/images/ht-bed-0042-main.jpg",
"gallery_urls": "['https://hometown.in/images/ht-bed-0042-side.jpg', 'https://hometown.in/images/ht-bed-0042-detail.jpg']",
"360_view_available": false,
"manual_pdf_url": "https://hometown.in/docs/ht-bed-0042-assembly.pdf",
"swatch_urls": "[]"
# skuprimary_image_urlgallery_urlsvideo_url360_view_availablemanual_pdf_url
1
2
3

Capabilities

Extract the entire Hometown catalogue

Our Hometown scraper captures all product metadata, pricing changes, and stock availability across categories. We manage the proxy rotation and session states required to extract accurate data at scale.

Furniture Specifications

Extract dimensions, primary material, finish, weight, and assembly requirements for every piece of furniture.

Pricing & Discount Tracking

Capture current price, MRP, absolute discount, percentage drop, and EMI starting prices.

Pincode Availability

Simulate delivery checks across multiple Indian pincodes to map stock availability and estimated delivery timelines.

Variant Mapping

Link parent products to child variants based on colour, fabric, or size selections.

High-Res Media Extraction

Capture primary images, gallery URLs, lifestyle room shots, and assembly manual PDFs.

Category Taxonomy

Map the full category tree from top-level departments down to specific sub-categories and filters.

Warranty & Care Data

Extract warranty summaries, care instructions, and brand details directly from the product description.

Stock Status Monitoring

Track out-of-stock flags and low-stock warnings to monitor competitor inventory levels.

Delta Exports

Receive only updated records. We hash previous runs and emit only rows where pricing or stock has changed.

// engagement pipeline

From category URL to structured table

Brief in. Clean data out.

Define Scope
d 0

Provide categories, search terms, or specific SKUs. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy crawlers, proxy rotation, session management, and pincode simulation for hometown.in.

Validation & QA
d 4–6

Schema validation, null-rate checks, and price-outlier detection before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How we extract Hometown data reliably

Furniture retail sites rely on heavy JavaScript for variant selection and pincode validation. We handle the technical overhead so you receive clean tables.

pipeline-monitor · hometown.in · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Pincode APIs
Simulated delivery checks

Stock and delivery times on Hometown depend on the user's pincode. We inject specific pincodes into the session state via Playwright to extract accurate availability data for your target regions.

Variant selectors
Handling multi-dimensional options

Furniture often has multiple variants for wood finish or fabric colour. We map the JSON state behind the frontend to extract all variant combinations without executing unnecessary page loads.

Unstructured text
Standardising specifications

Dimensions and materials are often buried in HTML tables or bullet points. Our parsers clean and normalise this data into strict typed columns like length_cm, width_cm, and primary_material.

Proxy rotation
Avoiding rate limits

We route requests through Indian residential proxies to prevent IP blocking during full catalogue crawls, ensuring consistent extraction without triggering firewall rules.

Incremental updates
Efficient price tracking

For daily price monitoring, we maintain a hash of the catalogue and only export records where pricing, discounts, or stock status have changed.

Applications

Who uses Hometown data

Teams across industries use hometown.in data to build competitive products and smarter operations.

01
Competitor Price Tracking

Furniture retailers monitor Hometown's pricing, discount strategies, and EMI offers to adjust their own pricing models.

02
Market Research

D2C brands analyse category depth, material trends, and popular finishes to inform product development and sourcing.

03
Catalogue Gap Analysis

Marketplaces compare Hometown's inventory against their own to identify missing brands, styles, or price segments.

04
Supply Chain Monitoring

Analysts track out-of-stock rates across different pincodes to map supply chain efficiency and regional demand.

05
Interior Design Aggregation

Design platforms ingest furniture catalogues to populate 3D planning tools and client recommendation engines.

06
Inflation Tracking

Economic researchers track price changes in consumer durables and furniture over time to measure retail inflation.

Why DataFlirt

"Furniture retail requires precise dimensional and material data. Scraping Hometown means standardising unstructured text into queryable specifications."

Extracting furniture catalogues involves handling multi-dimensional variants, pincode-dependent stock states, and deeply nested taxonomy trees. DataFlirt manages the residential proxy rotation and JavaScript execution required to capture this data reliably, delivering clean schemas directly to your data warehouse.

Technical Spec

Hometown scraper capabilities

Everything supported by our hometown.in scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Playwright sessions for dynamic pricing and variant rendering
Supported
Pincode simulation
Inject target pincodes to verify regional stock and delivery estimates
Supported
Variant mapping
Link parent products to child variants based on colour and finish
Supported
Image extraction
Capture high-resolution gallery images and lifestyle shots
Supported
Category pagination
Traverse entire category trees to extract all listed SKUs
Supported
Change detection
Hash-based diffing to emit only updated prices or stock status
Supported
Webhook delivery
HTTP POST per record for real-time downstream processing
Supported
User cart data
Requires active user session and cart state
Partial
Order history
Gated behind OTP authentication
Partial
Infrastructure

Infrastructure powering the pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and retry logic. Playwright handles JavaScript rendering, cookie sessions, and pincode injection flows.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across India. Rotation happens per-request to prevent IP blocks during large catalogue crawls.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling and SLA alerting. All state is stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested arrays
CSV
Flat file with typed columns
XLS
Excel compatible export for analysts
Parquet
Columnar format for data warehouses
AWS S3
Direct bucket delivery
Webhook
HTTP POST per record
API
REST endpoint to query completed runs
BigQuery
Streamed directly into your dataset
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About hometown.in scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Hometown legal?

Scraping publicly available information from Hometown is generally permissible. DataFlirt targets only public product, pricing, and specification data. We do not extract personal data or circumvent authentication walls.

Can you check stock for specific pincodes?

Yes. We can simulate delivery availability and estimated timelines for a predefined list of Indian pincodes during the crawl.

How do you handle furniture variants?

We extract all available variants (e.g., different wood finishes or fabric colours) for a product and map them to the parent SKU, capturing price differences between variants.

How fresh is the data?

Full catalogue refreshes typically run daily or weekly depending on your requirements. Delta runs targeting specific high-velocity categories can be configured for higher frequency.

Do you extract images and manuals?

Yes. We extract URLs for primary images, gallery shots, lifestyle images, and assembly manuals (PDFs). We deliver the URLs; you can download the assets directly.

What is the minimum viable engagement?

Our smallest packages start at a defined category list with weekly delivery. For full catalogue extraction, we price based on volume and delivery frequency.

Can I request a sample dataset?

Yes. We provide a sample run of up to 500 products as part of the pre-engagement scoping process so you can validate schema fit and field completeness.

$ dataflirt scope --new-project --source=hometown.in ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off catalogue dump or continuous price monitoring across categories, we build and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →