SYSTEM all green source cigna.com queue 12,948 queries p99 latency 845ms dataflirt.com · scraper/cigna-com
RUN · 42 active pipelines · cigna.com live

Cigna network data,
at warehouse scale.

We extract provider directories, NPI records, facility networks, Medicare plan details, and prescription formularies from Cigna. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Providers extracted
1.2M /month
Formulary updates
84K /run
Facilities mapped
39K /run
Active pipelines
42
Uptime
99.94%
Data Dictionary

Every field we extract from cigna.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Healthcare Providers objects from cigna.com. All fields typed and schema-versioned.

npifirst_namelast_namespecialtysub_specialtyboard_certifiedgenderaccepting_new_patientslanguagestelehealth_offeredcontact_phoneaddress_line_1citystatezip_code
healthcare_providers
● 200 OK
"npi": "1982736450",
"first_name": "Sarah",
"last_name": "Jenkins",
"specialty": "Cardiology",
"board_certified": true,
"accepting_new_patients": true,
"telehealth_offered": false,
"city": "Atlanta"
# npifirst_namelast_namespecialtysub_specialtyboard_certified
1
2
3

Complete list of extractable fields for Facilities & Hospitals objects from cigna.com. All fields typed and schema-versioned.

facility_idfacility_namefacility_typenetwork_statusaddresscitystatezip_codephoneemergency_servicestotal_bedsaccreditationaffiliationsratingplans_accepted
facilities_& hospitals
● 200 OK
"facility_id": "FAC-99281",
"facility_name": "Mercy General Hospital",
"facility_type": "Acute Care",
"network_status": "In-Network",
"emergency_services": true,
"total_beds": 450,
"city": "Phoenix",
"rating": 4.2
# facility_idfacility_namefacility_typenetwork_statusaddresscity
1
2
3

Complete list of extractable fields for Accepted Plans objects from cigna.com. All fields typed and schema-versioned.

plan_idplan_nameplan_typenetwork_namestate_availabilitymetal_tieris_medicaredeductiblesout_of_pocket_maxcopay_primarycopay_specialistreferral_requiredformulary_id
accepted_plans
● 200 OK
"plan_id": "OAP-2026",
"plan_name": "Open Access Plus",
"plan_type": "PPO",
"network_name": "Cigna OAP",
"metal_tier": "Gold",
"is_medicare": false,
"copay_primary": 25.0,
"referral_required": false
# plan_idplan_nameplan_typenetwork_namestate_availabilitymetal_tier
1
2
3

Complete list of extractable fields for Drug Formularies objects from cigna.com. All fields typed and schema-versioned.

ndc_codedrug_namegeneric_namebrand_namedosage_formroutetier_levelprior_authorization_requiredstep_therapyquantity_limitplan_idcoverage_status
drug_formularies
● 200 OK
"ndc_code": "00069-3060-30",
"drug_name": "Lipitor 20mg",
"tier_level": "Tier 3",
"prior_authorization_required": true,
"step_therapy": false,
"quantity_limit": "30 per 30 days",
"coverage_status": "Covered"
# ndc_codedrug_namegeneric_namebrand_namedosage_formroute
1
2
3

Complete list of extractable fields for Office Locations objects from cigna.com. All fields typed and schema-versioned.

location_idpractice_namenpi_listaddresscitystatezip_codephonefaxhours_mondayhours_tuesdaywheelchair_accessibleparking_availablepublic_transit_access
office_locations
● 200 OK
"location_id": "LOC-4451",
"practice_name": "Atlanta Heart Specialists",
"city": "Atlanta",
"state": "GA",
"zip_code": "30308",
"wheelchair_accessible": true,
"hours_monday": "08:00-17:00",
"phone": "404-555-0199"
# location_idpractice_namenpi_listaddresscitystate
1
2
3

Capabilities

Everything you need from Cigna directories

Our Cigna scraper handles the platform complexity: geographic search tokens, network selection logic, and strict rate limits. Built with session management and anti-bot circumvention.

Provider Directory Extraction

Extract NPI, specialty, board certification, languages spoken, and contact details for individual practitioners across all networks.

Facility & Hospital Data

Capture hospital affiliations, bed counts, emergency service capabilities, and accreditation status for in-network facilities.

Formulary & Drug Tier Scraping

Map NDC codes to coverage tiers, capturing prior authorisation requirements and step therapy rules across specific plans.

Plan & Network Mapping

Correlate providers with accepted coverage networks including OAP, PPO, HMO, and Medicare Advantage plans.

Geographic Search Automation

Iterate across US ZIP codes using algorithmic grid searches to ensure complete national coverage without data overlap.

Telehealth & Accessibility

Identify virtual care availability, wheelchair access, and public transit proximity for specific clinic locations.

Change Detection

Track provider additions and drops from specific networks over time using hash-based diffing on directory records.

Anti-Bot Circumvention

Bypass healthcare portal rate limits using US-based residential proxies and human-like request timing patterns.

Scheduled Updates

Configure continuous pipelines at weekly or monthly cadences to maintain accurate master data records.

// engagement pipeline

From geographic coordinates to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide ZIP code radii, NPI lists, or target plan names. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy crawlers, proxy rotation, session management, and geographic iteration logic for cigna.com.

Validation & QA
d 4–6

Schema validation, NPI format checks, null-rate monitoring, and sample directory exports before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Cigna pipeline handles the hard parts

Healthcare directories use complex search state and strict rate limits. Here is how we maintain extraction reliability at scale.

pipeline-monitor · cigna.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Complex search state
Handling React state tokens for geographic searches

Cigna search interfaces rely on complex session tokens and client-side state. We execute full Playwright browser sessions to handle geographic search initialisation and capture the underlying API payloads.

Geographic coverage
Algorithmic ZIP code radius iteration

To extract a national directory, searching a single city fails due to result truncation. We deploy a mathematical grid of ZIP codes and search radii to ensure 100% geographic coverage without missing rural providers.

Rate limits
US residential proxies and request throttling

Healthcare portals aggressively block datacenter IPs. Our crawlers route traffic exclusively through US-based residential ISP proxies, mimicking legitimate patient search behaviour and avoiding IP bans.

Data normalisation
Standardising messy healthcare records

Provider addresses, specialty codes, and clinic names are often inconsistent. We apply post-extraction normalisation pipelines to format NPIs, standardise street addresses, and clean specialty categorisations.

Change detection
Only re-scrape what has changed

For large directories, we maintain a hash index of last-seen values per NPI. Subsequent runs only push diffs, reducing compute cost and downstream processing load in your warehouse.

Applications

Who uses Cigna data and how

Teams across industries use cigna.com data to build competitive products and smarter operations.

01
Network Adequacy Analysis

Insurers and regulators benchmark Cigna geographic coverage to ensure compliance with network adequacy standards.

02
Provider Master Data Management

Health systems update internal NPI, specialty, and contact records by cross-referencing payer directories.

03
Competitive Intelligence

Rival payers analyse Cigna Medicare Advantage network density and formulary tier placements to inform product strategy.

04
Pharma Market Access

Pharmaceutical companies track formulary tier placement and step therapy requirements for specific NDC codes.

05
Telehealth Routing

Digital health platforms map in-network specialists to optimise patient routing and reduce out-of-pocket costs.

06
Referral Management

Care coordinators identify in-network facilities and specialists for out-of-state patients requiring complex care.

Why DataFlirt

"Cigna provider directories represent a critical dataset in US healthcare, but extracting them requires navigating aggressive rate limits and complex geographic search states."

Most teams underestimate the investment required: reliable Cigna extraction demands US-based residential proxies, session state management, geographic grid algorithms, and daily selector maintenance. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.

Technical Spec

Cigna scraper technical capabilities

Everything supported by our cigna.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

Provider directory search
Extract individual practitioners, specialties, and contact details
Supported
Drug formulary tiers
Map NDC codes to coverage tiers and authorisation rules
Supported
Medicare Advantage plans
Extract specific network details for Medicare populations
Supported
US residential proxy routing
Bypass geographic and datacenter IP blocks using ISP proxies
Supported
NPI validation
Format checking and validation for National Provider Identifiers
Supported
Geographic radius iteration
Grid-based search algorithms for complete national coverage
Supported
Change detection (diffs)
Hash-based diffs to identify provider network additions and drops
Supported
Machine-readable files (MRF)
Parsing large JSON files for Transparency in Coverage mandates
Supported
Member claims data (PHI)
Protected health information and individual claims history
Partial
myCigna authenticated portal
Data behind member login walls requiring user credentials
Partial
Individual negotiated rates
Patient-specific contracted rates requiring authentication
Partial
Infrastructure

Infrastructure powering the Cigna pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and retry logic. Playwright handles JavaScript rendering, state tokens, and complex search interactions on Cigna portal interfaces.

US-Targeted Proxy Infrastructure

We maintain pools of US residential ISP proxies. Rotation happens per-request with sticky sessions to mimic legitimate patient search behaviour and avoid rate limits.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, geographic grid iteration, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested array structures
CSV
Flat file with typed columns for direct analysis
XLS
Excel compatible exports for business teams
Parquet
Columnar format optimised for analytical warehouses
AWS S3
Direct bucket delivery compatible with data lakes
Webhook
HTTP POST per record for real-time processing
API
REST endpoints to query your extracted datasets
BigQuery
Streamed directly into your dataset with schema auto-detect
Snowflake
Stage and COPY INTO workflow for incremental updates
PostgreSQL
Upsert into your existing schema with conflict resolution
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About cigna.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Cigna legal?

Scraping publicly available directory information is generally permissible under applicable US law. DataFlirt targets only public, non-authenticated provider and formulary data. We do not extract Protected Health Information (PHI), circumvent authentication walls, or violate HIPAA. Clients should review Cigna terms of service and consult legal counsel.

How do you handle Cigna rate limits?

We use US-based residential ISP proxies, full Playwright browser sessions, and request timing modelled on human behaviour. We monitor for blocking in real time and trigger IP rotation automatically.

Can you extract data by specific plan networks?

Yes. We can configure the pipeline to target specific networks such as Open Access Plus (OAP), HMO, PPO, or specific Medicare Advantage plans.

How do you ensure complete geographic coverage?

We use an algorithmic grid search across US ZIP codes. By calculating overlapping radii, we ensure the crawler captures all providers in both dense urban centres and rural areas without missing records.

Do you extract Medicare Advantage data?

Yes. We extract full plan details, network composition, and provider participation specific to Medicare Advantage offerings.

How fresh is the directory data?

We typically configure weekly or monthly refreshes for healthcare directories. The exact cadence depends on your specific data requirements and warehouse ingestion limits.

Can you parse Transparency in Coverage MRFs?

Yes. We have infrastructure designed to download, parse, and flatten the multi-gigabyte Machine-Readable Files (MRFs) mandated by the Transparency in Coverage rule.

Do you scrape patient portals like myCigna?

No. We do not bypass authentication walls or interact with any systems that house Protected Health Information (PHI).

$ dataflirt scope --new-project --source=cigna.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a national provider directory dump or continuous formulary monitoring, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →