We extract provider directories, clinic locations, network affiliations, Medicare plan structures, and drug formularies from Aetna. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Providers objects from aetna.com. All fields typed and schema-versioned.
"npi": "1829304958", "first_name": "Sarah", "last_name": "Chen", "speciality": "Cardiology", "accepting_new_patients": true, "telehealth_offered": true, "gender": "Female"
| # | npi | first_name | last_name | speciality | sub_speciality | gender |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Facilities objects from aetna.com. All fields typed and schema-versioned.
"facility_id": "F-93847", "facility_name": "Mercy General Hospital", "facility_type": "Acute Care Hospital", "city": "Austin", "state": "TX", "zip_code": "78701", "network_status": "In-Network"
| # | facility_id | facility_name | facility_type | address_line_1 | city | state |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Network Plans objects from aetna.com. All fields typed and schema-versioned.
"plan_id": "AET-TX-2026-HMO", "plan_name": "Aetna Value Network HMO", "plan_type": "HMO", "metal_tier": "Silver", "deductible_individual": 2500.0, "pcp_required": true, "referral_required": true
| # | plan_id | plan_name | plan_type | state | county | metal_tier |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Formulary Drugs objects from aetna.com. All fields typed and schema-versioned.
"ndc_code": "00069-1530-68", "drug_name": "Lisinopril 10mg", "tier_level": "Tier 1", "prior_authorization_required": false, "step_therapy_required": false, "generic_name": "Lisinopril", "plan_id": "AET-TX-2026-HMO"
| # | ndc_code | drug_name | generic_name | brand_name | tier_level | prior_authorization_required |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Clinical Policy objects from aetna.com. All fields typed and schema-versioned.
"cpb_number": "0016", "title": "Back Pain - Invasive Procedures", "last_review_date": "2025-11-12", "status": "Active", "cpt_codes_covered": "['22513', '22514']", "icd10_codes_covered": "['M54.50']"
| # | cpb_number | title | last_review_date | effective_date | status | cpt_codes_covered |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Our infrastructure navigates Aetna's complex directory state, extracts provider NPIs, and maps network affiliations across millions of records. We handle the session management and pagination limits.
Extract NPI, names, speciality, languages spoken, and board certifications across all Aetna networks.
Map which specific HMO, PPO, and Medicare Advantage plans a provider or facility accepts.
Capture clinic addresses, phone numbers, and facility types. Normalised into clean location records.
Extract plan premiums, deductibles, out-of-pocket maximums, and coverage tiers for state-specific plans.
Track drug tier placements, prior authorization requirements, and step therapy rules across Aetna formularies.
Scrape Aetna CPBs to extract covered CPT and ICD-10 codes, effective dates, and policy summaries.
Navigate Aetna's complex React-based search forms, handling session tokens and multi-step inputs automatically.
Track when a provider drops out of a network or when a drug shifts tiers. Receive only the diffs.
Execute parallel extraction pipelines across all 50 states to build a national provider database.
Brief in. Clean data out.
Provide target zip codes, network names, or NPI lists. We design the extraction schema together.
We configure Playwright crawlers, state management, and geo-targeted proxies for aetna.com.
Schema validation, null-rate checks, and NPI checksum validation before full launch.
JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Health insurance directories are built to prevent bulk extraction. Here is how we bypass Aetna's limits.
Aetna's directory requires multi-step session state to view results. We maintain active Playwright contexts that handle cookie negotiation and token generation required to access provider details.
Aetna caps search results to a few hundred providers per query. We implement automated radius chunking, dividing large geographic areas into micro-grids to ensure 100% extraction coverage.
Provider data is inherently messy. We clean and normalise addresses, split combined speciality strings, and validate NPI formats before the data reaches your warehouse.
Network results change based on the user's IP location. We map search queries to state-specific residential proxies to ensure we see the correct local network data.
Aggressive extraction triggers IP bans. We manage request velocity and distribute queries across thousands of residential IPs to maintain pipeline stability.
Healthcare consultants map Aetna's coverage gaps and provider density to evaluate network adequacy against regulatory standards.
Health systems and credentialing teams check if their doctors are actively listed in-network and verify directory accuracy.
Rival payers compare Aetna's network size, facility affiliations, and Medicare plan structures against their own offerings.
Pharma companies track drug tier placements and prior authorization requirements to optimise market access strategies.
Digital health platforms use network data to direct patients to in-network virtual care providers, reducing out-of-pocket costs.
ML teams use Clinical Policy Bulletins and provider metadata to train medical LLMs and claims adjudication models.
"Aetna's provider directory contains the ground truth for millions of patient routing decisions, but extracting it requires navigating heavy session-state applications."
Health insurance directories are notoriously difficult to scrape. They rely on complex JavaScript frameworks, session tokens, and aggressive pagination limits. DataFlirt manages the browser automation and proxy rotation required to extract clean, normalised NPI and facility records at scale.
Everything supported by our aetna.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Aetna's directory requires heavy browser automation. We use Playwright to manage the complex state transitions and token generation needed to access provider records.
Network data varies by location. We route requests through state-specific residential proxies to ensure accurate extraction of local HMO and PPO directories.
National directory extraction requires running thousands of parallel zip code queries. Airflow manages the grid search orchestration and dependency tracking.
Data delivered to where your team already works — no new tooling required.
About aetna.com scraping, legality, and pipeline operations.
Ask us directly →Scraping publicly available provider directories is generally permissible. DataFlirt extracts only public, non-authenticated provider and network data. We do not extract PHI or bypass HIPAA-compliant member portals.
Aetna limits search results per query. We bypass this by implementing automated radius chunking, generating overlapping zip code grids to extract the entire provider population without hitting the cap.
Yes. National Provider Identifiers are exposed in the directory details. We extract and validate NPIs to ensure accurate mapping to your internal provider databases.
Yes. We extract network affiliations and plan details for Aetna's Medicare Advantage (MAPD) offerings, including state-specific variations.
We recommend weekly or monthly cadences for full directory refreshes, depending on your scope. Differential updates can be run more frequently for targeted networks.
No. Explanation of Benefits, member claims, and negotiated rates tied to specific employer groups require authentication and are out of scope for our public data pipelines.
20-minute scoping call. Pilot dataset within the week. Production within two. Need Aetna's national directory, state-specific Medicare networks, or formulary updates? We build and maintain the infrastructure. Tell us your scope.