SYSTEM all green source auction.com queue 14,892 properties p99 latency 184ms dataflirt.com · scraper/auction-com
RUN · 31 active pipelines · auction.com live

Distressed property data,
at warehouse scale.

We extract foreclosure listings, REO properties, auction schedules, bidding histories, and property metadata from Auction.com. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Properties extracted
142K /week
Auction updates
38.4K /day
Bidding records
95K /run
Active pipelines
31
Uptime
99.94%
Data Dictionary

Every field we extract from auction.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Property Listings objects from auction.com. All fields typed and schema-versioned.

apnproperty_idaddresscitystatezip_codecountyproperty_typebedsbathssqftlot_size_acresyear_builtoccupancy_statusimage_urls
property_listings
● 200 OK
"apn": "042-192-04",
"property_id": "2948174",
"address": "1492 Willow Drive",
"city": "Atlanta",
"state": "GA",
"zip_code": "30303",
"beds": 3,
"baths": 2,
"sqft": 1850,
"occupancy_status": "Occupied"
# apnproperty_idaddresscitystatezip_code
1
2
3

Complete list of extractable fields for Auction Details objects from auction.com. All fields typed and schema-versioned.

auction_idproperty_idauction_typeauction_date_startauction_date_endvenue_typevenue_addressstarting_bidcurrent_bidbidding_statusreserve_metdeposit_requiredrun_numberitem_number
auction_details
● 200 OK
"auction_id": "A-492193",
"auction_type": "Foreclosure",
"auction_date_start": "2026-08-14T09:00:00Z",
"venue_type": "Online",
"starting_bid": 150000.0,
"current_bid": 165000.0,
"bidding_status": "Active",
"reserve_met": false,
"deposit_required": 5000.0
# auction_idproperty_idauction_typeauction_date_startauction_date_endvenue_type
1
2
3

Complete list of extractable fields for Financial & Valuation objects from auction.com. All fields typed and schema-versioned.

property_idestimated_debtestimated_valueafter_repair_valuetax_assessed_valuetax_yearbuyers_premium_pctbuyers_premium_minearnest_money_depositliens_statushoa_feescurrency
financial_& valuation
● 200 OK
"property_id": "2948174",
"estimated_debt": 210500.0,
"estimated_value": 275000.0,
"tax_assessed_value": 240000.0,
"tax_year": 2025,
"buyers_premium_pct": 5.0,
"buyers_premium_min": 2500.0,
"earnest_money_deposit": 5000.0,
"currency": "USD"
# property_idestimated_debtestimated_valueafter_repair_valuetax_assessed_valuetax_year
1
2
3

Complete list of extractable fields for Foreclosure Metadata objects from auction.com. All fields typed and schema-versioned.

property_idcase_numbertrustee_nametrustee_sale_numberdefault_amountnotice_dateforeclosure_stageloan_typerecording_dateplaintiff_name
foreclosure_metadata
● 200 OK
"property_id": "2948174",
"case_number": "2025-CV-04921",
"trustee_name": "Quality Loan Service Corp",
"trustee_sale_number": "GA-25-8941",
"default_amount": 18420.5,
"notice_date": "2026-06-12",
"foreclosure_stage": "Notice of Sale",
"plaintiff_name": "Wells Fargo Bank"
# property_idcase_numbertrustee_nametrustee_sale_numberdefault_amountnotice_date
1
2
3

Complete list of extractable fields for Search Results & Map objects from auction.com. All fields typed and schema-versioned.

search_querycounty_filterlatitudelongitudeproperty_idpositionlisting_urlthumbnail_urlsaved_countis_featuredscraped_at
search_results & map
● 200 OK
"search_query": "Fulton County, GA",
"latitude": 33.749,
"longitude": -84.388,
"property_id": "2948174",
"position": 12,
"is_featured": true,
"saved_count": 48,
"scraped_at": "2026-07-21T14:32:01Z"
# search_querycounty_filterlatitudelongitudeproperty_idposition
1
2
3

Capabilities

Complete distressed property intelligence

Our Auction.com scraper extracts deep property metadata, financial estimates, and live auction schedules, bypassing anti-bot systems to deliver structured real estate data directly to your warehouse.

Foreclosure & REO Listings

Extract property characteristics including beds, baths, square footage, year built, lot size, and property type across all asset classes.

Auction Schedule Tracking

Monitor start dates, end dates, postponements, and cancellations for online, in-person, and courthouse step auctions.

Financial & Valuation Data

Capture starting bids, estimated debt, tax assessed values, buyer premiums, and earnest money deposit requirements.

Occupancy & Title Status

Track occupancy flags (Occupied, Vacant) and title clearance status to accurately model eviction risk and holding costs.

Map-Based Search Scraping

Iterate through geographic bounds to capture all available inventory in a target county, MSA, or custom polygon.

Venue & Bidding Intelligence

Extract specific venue addresses for live auctions, online bidding portal links, and real-time bid increments where available.

APN & Parcel Mapping

Capture Assessor's Parcel Numbers (APN) and county tax IDs to join Auction.com listings with your internal county recorder datasets.

Trustee & Legal Metadata

Extract case numbers, trustee sale numbers, default amounts, and plaintiff names directly from the notice of sale records.

Scheduled + Streaming Modes

Run daily refreshes for upcoming auctions or configure high-frequency pipelines to monitor bid changes during active events.

// engagement pipeline

From target counties to warehouse records

Brief in. Clean data out.

Define Scope
d 0

Provide target states, counties, or zip codes. We design the extraction schema covering REO, foreclosure, or short sale listings.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, coordinate map pagination, and implement anti-bot circumvention for auction.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, APN format verification, and data normalisation before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our pipeline handles real estate data complexity

Auction.com relies on dynamic map loading and aggressive bot protection. Here is how we maintain reliable extraction.

pipeline-monitor · auction.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Residential proxy rotation + fingerprint spoofing

Real estate platforms employ strict WAFs and bot mitigation. Our crawlers use US-based residential ISP proxies with realistic browser fingerprints and full cookie session management to blend in with human buyer traffic.

Map interface rendering
Playwright execution for spatial queries

Property discovery relies heavily on interactive maps and XHR requests. We run full Playwright browser sessions to trigger map pan/zoom events, ensuring we capture all pins in dense urban markets.

Dynamic bidding hydration
XHR interception for auction state

Current bids and auction statuses update dynamically via background requests. We intercept these API calls directly to extract precise, real-time numerical data rather than relying on brittle DOM parsing.

Change detection
Only re-scrape what has changed

Auction dates are frequently postponed or cancelled. We maintain a hash index of last-seen values per property. Subsequent runs only push diffs, reducing downstream processing load and highlighting critical status changes.

Monitoring & alerting
24/7 pipeline health with anomaly detection

Every run emits structured logs to our observability stack. We alert on null-rate spikes, missing APNs, or sudden drops in inventory counts, resolving issues before they impact your investment models.

Applications

Who uses Auction.com data — and how

Teams across industries use auction.com data to build competitive products and smarter operations.

01
Investment Property Sourcing

REITs and private equity firms monitor specific zip codes for high-yield REO and foreclosure assets meeting strict buy-box criteria.

02
Market Trend Analysis

Real estate analysts track foreclosure volumes, default amounts, and auction cancellation rates as leading indicators of local market distress.

03
Automated Underwriting

Acquisition teams feed starting bids, estimated debt, and property characteristics directly into algorithmic underwriting models to calculate maximum allowable offers.

04
Competitor Intelligence

Institutional buyers monitor bidding activity and final sale metrics to understand competitor pricing strategies and market saturation.

05
Title & Lien Research

Title companies extract case numbers, trustee names, and plaintiff details to preemptively build title reports for upcoming inventory.

06
Flipping & Rehab Modeling

Fix-and-flip investors correlate occupancy status, year built, and estimated debt to model holding costs and renovation budgets.

Why DataFlirt

"Auction.com holds the definitive inventory of distressed US real estate, but tracking changing auction dates and bid states requires continuous, automated extraction."

Most teams underestimate the investment required: reliable Auction.com scraping demands residential proxies, map-based coordinate handling, and resilient anti-bot circumvention. DataFlirt absorbs that complexity so your analysts can focus on underwriting properties — not maintaining infrastructure.

Technical Spec

Auction.com scraper — technical capabilities

Everything supported by our auction.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions — required for map loading, image galleries, and dynamic bids
Supported
CAPTCHA bypass
Automated solver integration with fallback to manual queue
Supported
Residential proxy rotation
ISP-grade residential IPs from US pools — rotated per request
Supported
Coordinate/Map-based scraping
Extract properties via bounding box or radial geographic queries
Supported
APN resolution
Capture Assessor's Parcel Numbers for county record joins
Supported
Live bid tracking
High-frequency extraction of active auction current bids
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed fields since last run
Supported
Webhook delivery
HTTP POST per record or batch — useful for real-time bid alerts
Supported
Title report PDF downloads
Gated documents require authenticated user sessions and payment
Partial
Bidding account execution
Automated bid placement requires KYC verification and violates terms
Partial
Infrastructure

Infrastructure powering the real estate pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, map interactions, and dynamic XHR interception.

Residential Proxy Infrastructure

We maintain pools of US-based residential ISP proxies. Rotation happens per-request with sticky sessions where required to prevent bot detection flags.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
Parquet
Columnar format for BigQuery, Snowflake, Athena
S3
Direct bucket delivery — compatible with any data lake
BigQuery
Streamed directly into your dataset with schema auto-detect
Webhook
HTTP POST per record for real-time downstream processing
Postgres
Upsert into your existing schema with conflict resolution
Snowflake
Stage + COPY INTO workflow — incremental or full-replace
// faq

Common questions.

About auction.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Auction.com legal?

Scraping publicly available property details and auction schedules is generally permissible under applicable law, reinforced by rulings like hiQ v. LinkedIn. DataFlirt extracts only public, non-authenticated real estate data. We do not bypass login walls to download gated title documents or place bids. Clients should review platform terms and consult legal counsel for specific use cases.

How do you handle map-based search results?

We use Playwright to interact with the map interface, generating bounding box coordinates for target counties or MSAs. The crawler intercepts the backend API responses that populate the map pins, ensuring 100% coverage of inventory within the defined geographic area.

Can you track auction date changes and postponements?

Yes. Foreclosure auctions are frequently postponed. Our change detection system compares current auction dates against the previous run's state, emitting diff records so you can update your internal schedules without processing the entire dataset again.

How fresh is the data?

For general inventory monitoring, we typically run daily refreshes. For active auctions, we can configure high-frequency pipelines to capture current bid increments, though this requires custom scoping to manage proxy load and bot detection limits.

Do you extract APNs and tax IDs?

Yes. We extract Assessor's Parcel Numbers (APNs) and county-specific tax IDs wherever they are surfaced on the property detail page, allowing you to join the auction data with your internal county recorder datasets.

What is the minimum viable engagement?

Our smallest packages start at a defined county or state list with daily delivery. For nationwide coverage or custom schema requirements, we price based on geographic volume and extraction frequency.

Can I request a sample dataset before committing?

Yes. We provide a sample run covering a specific county or set of zip codes during the scoping process. This allows your underwriting team to validate field completeness, APN formats, and data quality before signing a contract.

$ dataflirt scope --new-project --source=auction.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a daily feed of local foreclosures or nationwide REO tracking — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →