SYSTEM all green source jameda.de queue 12,409 profiles p99 latency 184ms dataflirt.com · scraper/jameda-de
RUN · 42 active pipelines · jameda.de live

German healthcare data,
at warehouse scale.

We extract physician profiles, clinic locations, insurance acceptance, patient reviews, and calendar availability from Jameda. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Doctors extracted
284K /run
Reviews captured
3.1M /run
Calendar slots
840K /24h
Active pipelines
42
Uptime
99.94%
Data Dictionary

Every field we extract from jameda.de

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Doctor Profiles objects from jameda.de. All fields typed and schema-versioned.

doctor_idnametitlespecialtysub_specialtiesprofile_urlimage_urlabout_textqualificationslanguagesmembershipsawardstotal_reviewsaverage_rating
doctor_profiles
● 200 OK
"doctor_id": "dr-med-max-mustermann-123",
"name": "Dr. med. Max Mustermann",
"title": "Dr. med.",
"specialty": "Orthopädie",
"total_reviews": 142,
"average_rating": 4.8,
"languages": "['Deutsch', 'Englisch']"
# doctor_idnametitlespecialtysub_specialtiesprofile_url
1
2
3

Complete list of extractable fields for Clinic & Location Data objects from jameda.de. All fields typed and schema-versioned.

location_iddoctor_idclinic_namestreetzip_codecitystatelatitudelongitudephonewebsitewheelchair_accessibleparking_availablepublic_transport
clinic_& location data
● 200 OK
"clinic_name": "Orthopädie Zentrum München",
"street": "Marienplatz 1",
"zip_code": "80331",
"city": "München",
"latitude": 48.137154,
"longitude": 11.576124,
"wheelchair_accessible": true
# location_iddoctor_idclinic_namestreetzip_codecity
1
2
3

Complete list of extractable fields for Patient Reviews objects from jameda.de. All fields typed and schema-versioned.

review_iddoctor_idrating_overallrating_treatmentrating_educationrating_trustrating_wait_timerating_friendlinessreview_titlereview_textreview_datepatient_insurance_typepatient_age_group
patient_reviews
● 200 OK
"review_id": "rev-987654321",
"rating_overall": 5.0,
"rating_wait_time": 4.5,
"review_text": "Sehr kompetenter Arzt, nimmt sich viel Zeit für die Patienten.",
"review_date": "2026-03-15",
"patient_insurance_type": "Gesetzlich versichert",
"patient_age_group": "30-50"
# review_iddoctor_idrating_overallrating_treatmentrating_educationrating_trust
1
2
3

Complete list of extractable fields for Appointment Availability objects from jameda.de. All fields typed and schema-versioned.

doctor_idlocation_idvisit_typeinsurance_acceptednext_available_dateavailable_slotsslot_timesis_video_consultationnew_patients_acceptedbooking_url
appointment_availability
● 200 OK
"doctor_id": "dr-med-max-mustermann-123",
"visit_type": "Erstuntersuchung",
"insurance_accepted": "['Gesetzlich', 'Privat']",
"next_available_date": "2026-05-20",
"available_slots": 4,
"is_video_consultation": false,
"new_patients_accepted": true
# doctor_idlocation_idvisit_typeinsurance_acceptednext_available_dateavailable_slots
1
2
3

Complete list of extractable fields for Services & Pricing objects from jameda.de. All fields typed and schema-versioned.

doctor_idservice_nameservice_categorydescriptionprice_minprice_maxcurrencycovered_by_public_insurancecovered_by_private_insuranceduration_minutes
services_& pricing
● 200 OK
"service_name": "Stoßwellentherapie",
"service_category": "IGeL-Leistungen",
"price_min": 85.0,
"currency": "EUR",
"covered_by_public_insurance": false,
"covered_by_private_insurance": true,
"duration_minutes": 20
# doctor_idservice_nameservice_categorydescriptionprice_minprice_max
1
2
3

Capabilities

Deep healthcare directory extraction

Our Jameda scraper navigates complex directory structures, premium vs standard profile layouts, and dynamic booking calendars to extract structured physician data while maintaining strict GDPR compliance.

Full Physician Profiles

Extract names, titles, qualifications, languages, specialties, and sub-specialties across both free and premium Jameda profiles.

Granular Review Extraction

Capture overall ratings, wait times, friendliness scores, and full review text. Paginate through all historical patient feedback.

Calendar & Availability Polling

Extract next available slots, video consultation flags, and available appointment times by interacting with the dynamic booking widgets.

Insurance Mapping

Determine public vs private insurance acceptance per doctor and per specific medical service offered.

Location & Facility Data

Extract clinic addresses, geographic coordinates, wheelchair accessibility, and public transport options.

Treatment Catalogues

Scrape specific procedures offered, including IGeL services, with pricing and duration details if listed by the physician.

Regional Search Scraping

Iterate through German zip codes (PLZ) and cities to ensure complete coverage of specific medical specialties.

JavaScript Rendering

Full Playwright execution to hydrate booking calendars and reveal obfuscated contact details.

Anti-Bot Circumvention

Handle Docplanner's Cloudflare and Datadome protections using German residential proxy rotation and fingerprinting.

Change Detection

Only emit updates when a doctor changes clinic, updates availability, or receives a new patient review.

// engagement pipeline

From PLZ list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target cities, zip codes, medical specialties, or specific doctor URLs. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Playwright crawlers, German residential proxies, and interaction flows for calendar widgets.

Validation & QA
d 4–6

Schema validation, null-rate checks on reviews, and calendar parsing tests before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Jameda pipeline handles the hard parts

Healthcare directories use strict rate limiting and complex JavaScript to protect their databases. Here is how we maintain stable extraction.

pipeline-monitor · jameda.de · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
German residential IPs to bypass regional blocks

Jameda and the broader Docplanner network employ strict rate limits and Datadome challenges. We use premium German residential proxies to distribute requests naturally and avoid geographic blocking.

JavaScript rendering
Playwright sessions for dynamic widgets

Booking calendars and paginated reviews are heavily JavaScript-dependent. We run full browser sessions to simulate user clicks, ensuring all available appointment slots are hydrated and captured.

GDPR Compliance
Strict filtering of personal data

We only extract public directory data and anonymised patient reviews. Our pipelines are configured to drop any fields that might contain patient PII, ensuring your datasets remain fully compliant.

Calendar state management
Interaction scripts for appointment polling

Extracting availability requires clicking through calendar months and selecting insurance types. Our automated flows handle these interactions reliably to map out future availability.

Schema stability
Normalising premium vs standard profiles

Jameda displays premium doctor profiles differently from basic listings. Our selector strategy uses fallback chains to normalise data across all profile types into a single, clean schema.

Applications

Who uses Jameda data — and how

Teams across industries use jameda.de data to build competitive products and smarter operations.

01
Competitor Analysis

Healthcare networks track competitor clinic locations, doctor counts, and specialties across specific German regions.

02
Reputation Management

Agencies monitor patient reviews and wait time ratings across specific medical groups to improve patient satisfaction.

03
Market Research

Pharma and medtech companies map physician specialties, languages spoken, and treatment focus areas to optimise sales territories.

04
Lead Generation

Medical software vendors identify doctors still using basic booking systems or lacking online appointment availability.

05
Patient Access Studies

Researchers analyse wait times, review sentiment, and appointment availability across different German states.

06
Insurance Network Mapping

Insurers audit public vs private acceptance rates across specialties to identify gaps in coverage networks.

Why DataFlirt

"Jameda holds the most comprehensive map of German healthcare availability, but extracting calendar data requires complex browser automation."

Scraping healthcare directories is technically demanding due to strict rate limits, regional IP blocking, and heavy JavaScript reliance for booking widgets. DataFlirt manages the proxy rotation, browser sessions, and schema maintenance so your data engineering team receives clean, structured records.

Technical Spec

Jameda scraper — technical capabilities

Everything supported by our jameda.de scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Playwright execution required for calendar widgets and obfuscated contact details
Supported
German residential proxies
ISP-grade residential IPs from DE pools to bypass geo-blocking
Supported
Review pagination
Extracting all historical reviews across multiple pages
Supported
Premium vs Standard profiles
Normalised schema across varying profile layouts
Supported
Search by Zip/Radius
Automated geographic traversal using PLZ and radius parameters
Supported
Change detection
Hash-based diffs to identify new reviews or changed availability
Supported
Historical rating tracking
Time-series data for doctor scores and review counts
Supported
Patient PII extraction
Names or contact details of reviewers (GDPR restriction)
Partial
Logged-in appointment booking
Automated booking actions requiring patient accounts
Partial
Private messaging extraction
Direct doctor-patient communications via the platform
Partial
Infrastructure

Infrastructure powering the Jameda pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and deduplication, while Playwright manages JavaScript execution for booking calendars and dynamic content.

Localised Proxy Infrastructure

We maintain pools of German residential ISP proxies to avoid regional blocks and bypass Datadome anti-bot protections.

Cloud-Native Orchestration

Pipelines run on AWS ECS with Airflow handling scheduling, dependency management, and SLA alerting. State is stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
XLS
Excel format for direct business analyst consumption
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery — compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoint to query your extracted datasets
PostgreSQL
Upsert into your existing schema with conflict resolution
Snowflake
Stage + COPY INTO workflow — incremental or full-replace
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About jameda.de scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Jameda legal?

Extracting public business profiles and anonymised reviews is generally permissible. We strictly avoid scraping patient PII to maintain GDPR compliance. Clients should review Jameda's ToS and consult legal counsel for specific use cases.

How do you handle calendar extraction?

We use Playwright to simulate user clicks on the booking widget, extracting available dates and times for both public and private insurance patients.

Can you target specific German regions?

Yes, we can configure the pipeline to iterate through specific PLZ (zip codes) or cities, focusing on exact medical specialties.

How frequently can you update availability data?

Calendar data can be polled daily or hourly depending on the scope of the target doctor list and the required geographic coverage.

Do you extract premium profile data?

Yes, our schema normalises data across both free and premium Jameda profiles, capturing all available fields including extended treatment catalogues.

How do you bypass Docplanner anti-bot protections?

We utilise German residential ISP proxies, realistic browser fingerprinting, and automated CAPTCHA solvers to maintain high success rates against Datadome and Cloudflare.

$ dataflirt scope --new-project --source=jameda.de ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a full directory export of German physicians or daily monitoring of appointment availability — we scope, build, and operate the pipeline.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →