SYSTEM all green source healthmarkets.com queue 12,841 ZIP codes p99 latency 312ms dataflirt.com · scraper/healthmarkets-com
RUN - 42 active pipelines - healthmarkets.com live

Insurance plan data,
normalised at scale.

We extract health insurance plans, Medicare Advantage details, supplemental coverage, and agent directories from HealthMarkets. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake.

Plans extracted
142K /day
Premium updates
38.4K /24h
Agent profiles
18,920 /run
ZIP codes mapped
41,200
Uptime
99.98%
Data Dictionary

Every field we extract from healthmarkets.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Health Insurance Plans objects from healthmarkets.com. All fields typed and schema-versioned.

plan_idcarrier_nameplan_namemetal_tiernetwork_typemonthly_premiumdeductibleout_of_pocket_maxcopay_primarycopay_specialistcoinsurancezip_code
health_insurance plans
● 200 OK
"plan_id": "HM-ACA-84920",
"carrier_name": "Blue Cross Blue Shield",
"plan_name": "Silver Standard HMO",
"metal_tier": "Silver",
"network_type": "HMO",
"monthly_premium": 412.5,
"deductible": 2500.0,
"zip_code": "75001"
# plan_idcarrier_nameplan_namemetal_tiernetwork_typemonthly_premium
1
2
3

Complete list of extractable fields for Medicare Plans objects from healthmarkets.com. All fields typed and schema-versioned.

medicare_idcarrierplan_typestar_ratingmonthly_premiumhealth_deductibledrug_deductiblemax_out_of_pocketincludes_part_dzip_codecounty
medicare_plans
● 200 OK
"medicare_id": "H1036-042-0",
"carrier": "Humana",
"plan_type": "HMO",
"star_rating": 4.5,
"monthly_premium": 0.0,
"health_deductible": 0.0,
"includes_part_d": true,
"zip_code": "33101"
# medicare_idcarrierplan_typestar_ratingmonthly_premiumhealth_deductible
1
2
3

Complete list of extractable fields for Agent Directory objects from healthmarkets.com. All fields typed and schema-versioned.

agent_idfull_nameagency_namephone_numberemailaddresscitystatezip_codelicenseslanguages_spokenyears_experience
agent_directory
● 200 OK
"agent_id": "AGT-99482",
"full_name": "Sarah Jenkins",
"agency_name": "Jenkins Insurance Group",
"phone_number": "555-019-2834",
"city": "Austin",
"state": "TX",
"languages_spoken": "['English', 'Spanish']",
"years_experience": 12
# agent_idfull_nameagency_namephone_numberemailaddress
1
2
3

Complete list of extractable fields for Supplemental Plans objects from healthmarkets.com. All fields typed and schema-versioned.

plan_idcategorycarrierplan_namemonthly_premiumbenefit_amountwaiting_periodage_restrictionsstate_availabilityexclusions
supplemental_plans
● 200 OK
"plan_id": "SUP-DNT-441",
"category": "Dental",
"carrier": "Delta Dental",
"plan_name": "Premium PPO",
"monthly_premium": 45.0,
"benefit_amount": 2000.0,
"waiting_period": "6 months",
"state_availability": "['TX', 'FL', 'CA']"
# plan_idcategorycarrierplan_namemonthly_premiumbenefit_amount
1
2
3

Complete list of extractable fields for Carrier Information objects from healthmarkets.com. All fields typed and schema-versioned.

carrier_idcarrier_nameam_best_ratingstates_activetotal_plans_offeredcustomer_service_numberwebsiteplan_categoriesnetwork_size
carrier_information
● 200 OK
"carrier_id": "CAR-092",
"carrier_name": "UnitedHealthcare",
"am_best_rating": "A+",
"total_plans_offered": 1420,
"customer_service_number": "800-555-0199",
"plan_categories": "['Medicare', 'ACA', 'Dental', 'Vision']",
"network_size": "National"
# carrier_idcarrier_nameam_best_ratingstates_activetotal_plans_offeredcustomer_service_number
1
2
3

Capabilities

Extract the entire health insurance marketplace

Our HealthMarkets scraper navigates complex location-based session states, rendering dynamic plan grids and extracting precise premium and deductible data across thousands of ZIP codes.

Medicare Advantage Extraction

Extract plan details, Star Ratings, Part D inclusion, and out-of-pocket maximums for Medicare plans across all available counties.

ACA Plan Normalisation

Capture metal tiers, network types, premiums, and copay structures for Affordable Care Act plans, normalised into a unified schema.

ZIP Code Iteration

Automated session management to iterate through 41,000+ US ZIP codes, capturing location-specific pricing and plan availability.

Premium & Deductible Mapping

Extract precise financial data including monthly premiums, individual and family deductibles, and coinsurance percentages.

Agent Directory Scraping

Compile contact information, state licenses, language capabilities, and agency affiliations for local insurance agents.

FitScore Capture

Extract HealthMarkets proprietary FitScore ratings to understand how plans are ranked and recommended to users.

Supplemental Coverage

Scrape dental, vision, life, and critical illness plans, including benefit amounts and waiting periods.

Carrier Intelligence

Map which carriers are actively writing policies in specific geographic regions and track their plan portfolios.

Scheduled Updates

Run recurring pipelines during open enrollment periods to track premium adjustments and new plan introductions.

// engagement pipeline

From ZIP code list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target ZIP codes, plan categories, or agent locations. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy and Playwright crawlers, managing location cookies and session states for healthmarkets.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, and premium outlier detection before full launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our pipeline handles location-based rendering

HealthMarkets requires strict session management and location context to display accurate pricing. Here is how we maintain pipeline stability.

pipeline-monitor · healthmarkets.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Location context
ZIP code session management

Insurance pricing is hyper-local. Our crawlers manage isolated cookie jars and session states for each ZIP code, ensuring that the premiums extracted match the exact geographic location requested without session bleed.

Dynamic rendering
Playwright for SPA plan grids

HealthMarkets uses JavaScript-heavy single-page application architectures to render plan comparisons. We use Playwright to execute the JavaScript, hydrate the plan grids, and extract the data from the DOM.

Anti-bot layer
US residential proxies

We route all requests through US-based residential ISP proxies to match expected geographic traffic patterns and avoid datacenter IP bans.

Schema stability
Fallback selectors for plan variants

Different plan types (Medicare vs ACA) render with different DOM structures. We use fallback chains and structured data extraction to ensure consistent field capture across all plan categories.

Change detection
Premium diffing during open enrollment

We maintain hash indexes of plan details. During critical periods like open enrollment, we can run high-frequency checks and only emit records when premiums or deductibles change.

Applications

Who uses HealthMarkets data

Teams across industries use healthmarkets.com data to build competitive products and smarter operations.

01
Competitive Pricing Analysis

Insurance carriers monitor competitor premiums, deductibles, and network types across specific ZIP codes to optimise their own pricing strategies.

02
Market Expansion Strategy

Healthcare organisations analyse plan availability and carrier dominance in target counties before entering new markets.

03
Broker & Agency Recruitment

Field marketing organisations extract agent directories to identify and recruit licensed brokers in specific geographic regions.

04
Medicare Plan Benchmarking

Actuaries track Medicare Advantage Star Ratings, Part D inclusions, and zero-premium plan availability at a national scale.

05
Insurtech Product Development

Startups use normalised plan data to build proprietary recommendation engines and alternative coverage comparison tools.

06
Actuarial Market Research

Analysts track out-of-pocket maximum trends and copay structures across metal tiers to model healthcare cost trajectories.

Why DataFlirt

"HealthMarkets aggregates thousands of insurance policies across carriers, but comparing premiums at a national scale requires a structured data pipeline."

Extracting insurance data requires precise location spoofing, session management for ZIP codes, and handling complex JavaScript rendering for plan grids. DataFlirt manages the infrastructure so your actuarial and product teams can focus on analysis.

Technical Spec

HealthMarkets scraper technical specifications

Everything supported by our healthmarkets.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Playwright sessions required for dynamic plan grids and FitScore rendering
Supported
Residential proxy rotation
US-based ISP proxies to bypass geographic blocking and rate limits
Supported
ZIP code session management
Isolated cookie jars per location to prevent pricing bleed
Supported
Multi-state coverage
Extraction across all 50 states and thousands of counties
Supported
Medicare & ACA plans
Support for both primary health insurance categories
Supported
Agent license extraction
Capture state licenses and appointments from agent profiles
Supported
Change detection (diffs)
Hash-based diffing to track premium changes over time
Supported
Personal Health Information (PHI)
We do not extract or process any protected health information
Partial
Gated enrollment portals
Data behind authenticated login walls requiring SSN or DOB
Partial
Infrastructure

Infrastructure powering the extraction

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles ZIP code iteration and retry logic. Playwright manages the complex DOM rendering and location-specific session cookies required by HealthMarkets.

US Proxy Infrastructure

We maintain dedicated pools of US residential proxies. Rotation happens per ZIP code iteration to ensure location spoofing matches the requested geographic target.

Cloud-Native Orchestration

Pipelines run on AWS ECS. Airflow handles scheduling, dependency management, and SLA alerting. All extracted plan schemas are validated before delivery.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Nested structures ideal for plan details and copay arrays
CSV
Flat files for actuarial analysis in Excel
Parquet
Columnar format for BigQuery and Snowflake
AWS S3
Direct bucket delivery on pipeline completion
Webhook
HTTP POST per record for real-time processing
API
REST endpoints to query extracted plan data
BigQuery
Streamed directly into your dataset
XLS
Excel format for business teams
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About healthmarkets.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping HealthMarkets legal?

Scraping publicly available insurance plan data and agent directories is generally permissible. DataFlirt extracts only public, non-authenticated data. We do not extract Personal Health Information (PHI) or access gated enrollment portals. Clients should consult legal counsel for their specific use cases.

How do you handle location-based pricing?

HealthMarkets relies on session cookies and location context to display accurate premiums. We use Playwright to manage isolated browser contexts for each target ZIP code, ensuring that the pricing extracted strictly matches the requested geographic area.

Can you extract Medicare Star Ratings?

Yes. We extract Medicare Advantage Star Ratings, Part D inclusions, health deductibles, and out-of-pocket maximums for all available plans in a given county.

How fresh is the premium data?

Pipelines can be configured to run daily, weekly, or monthly. During open enrollment periods, we can increase frequency to capture rapid plan introductions and premium adjustments.

Do you extract agent contact information?

Yes. We extract public agent profiles, including phone numbers, agency names, state licenses, and languages spoken, from the local agent directory.

What is the minimum viable engagement?

Our minimum engagement typically starts with a defined list of target ZIP codes or counties with weekly delivery. We price based on geographic coverage and delivery frequency.

$ dataflirt scope --new-project --source=healthmarkets.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need premium tracking across 500 counties or a complete national agent directory, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →