We extract company profiles, executive contacts, product classifications, and financial indicators from Kompass. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Company Profiles objects from kompass.com. All fields typed and schema-versioned.
"company_id": "FR1234567", "name": "TechCorp Solutions SAS", "country": "France", "city": "Paris", "year_established": 1998, "legal_form": "SAS", "phone": "+33 1 23 45 67 89"
| # | company_id | name | country | city | address | phone |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Executive Contacts objects from kompass.com. All fields typed and schema-versioned.
"contact_id": "CNT-98765", "company_id": "FR1234567", "first_name": "Jean", "last_name": "Dupont", "job_title": "Chief Technology Officer", "department": "IT", "management_level": "C-Level"
| # | contact_id | company_id | first_name | last_name | job_title | department |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Products & Services objects from kompass.com. All fields typed and schema-versioned.
"company_id": "FR1234567", "category_name": "Software Development Services", "kompass_code": "85210", "is_service_provider": true, "is_producer": false, "is_distributor": false
| # | company_id | category_name | kompass_code | description | is_producer | is_distributor |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Financial & Activity Data objects from kompass.com. All fields typed and schema-versioned.
"company_id": "FR1234567", "turnover_range": "10M - 50M EUR", "employee_count": "250-499", "export_regions": "['Europe', 'North America']", "capital_amount": "500000 EUR", "fiscal_year": 2024
| # | company_id | turnover_range | employee_count | import_regions | export_regions | bank_name |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Search & Category Results objects from kompass.com. All fields typed and schema-versioned.
"keyword": "industrial valves", "position": 1, "company_id": "DE7654321", "company_name": "ValveTech GmbH", "location": "Berlin, Germany", "verified_badge": true
| # | keyword | category_path | position | company_id | company_name | location |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Our Kompass scraper handles complex category hierarchies, geographic pagination, and multilingual directories to deliver unified B2B company data.
Extract business names, registration numbers, addresses, contact details, and descriptions across 70 countries.
Capture decision makers, job titles, and management hierarchies associated with each company profile.
Map businesses to specific Kompass product and service codes, identifying them as producers, distributors, or service providers.
Scrape directory data across different regional subdomains to capture localised business information.
Extract turnover ranges, employee headcount brackets, and registered capital figures where publicly available.
Identify active trading regions and countries for companies engaged in international commerce.
Link parent companies to their regional branches and subsidiaries using internal directory references.
Run monthly or quarterly updates to track changes in executive personnel, address relocations, or new product classifications.
Normalise inconsistent address formats and telephone numbers into a strict, queryable JSON or CSV schema.
Brief in. Clean data out.
Provide target countries, Kompass classification codes, or keyword sets. We design the extraction schema together.
We configure Scrapy crawlers, regional proxy rotation, session management, and CAPTCHA handling for kompass.com.
Schema validation, null-rate checks, and sample data delivery before full launch.
JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Kompass restricts access through pagination limits, IP tracking, and CAPTCHAs. We manage the infrastructure required for deep extraction.
Directory search results often truncate after a specific number of pages. We bypass this by recursively querying sub-categories, postal codes, and employee size filters to extract the full underlying dataset.
High-volume requests from datacenter IPs trigger immediate blocks. We route traffic through residential ISP proxies matching the target region, rotating IPs to distribute the request load.
B2B directories contain varied formats for phone numbers, addresses, and legal entities. Our pipeline applies regular expressions and standardisation logic to ensure clean, warehouse-ready data.
Every run emits structured logs to our observability stack. We alert on null-rate spikes, schema drift, and coverage drops, ensuring reliable data delivery.
Directory sites frequently deploy CAPTCHAs during sustained navigation. We integrate automated solving services to maintain pipeline throughput without manual intervention.
Sales teams extract targeted lists of companies and executives based on specific industry codes and geographic regions.
Procurement departments map alternative suppliers by filtering manufacturers and distributors within specific Kompass product categories.
Analysts assess industry concentration, company size distributions, and geographic clusters for specific B2B sectors.
RevOps teams update stale CRM records with current addresses, phone numbers, and executive contacts from the directory.
Strategy teams track competitor branch expansions, product category additions, and export market activity.
Enterprise data teams cross-reference internal vendor lists against Kompass profiles to validate legal entities and registration numbers.
"B2B directories hold critical firmographic data, but extracting complete category hierarchies requires navigating complex pagination and strict anti-bot measures."
Most teams struggle with directory scraping because results are artificially truncated and heavily monitored for automated access. DataFlirt manages the proxy rotation, deep-search querying, and schema normalisation so your data engineering team receives clean firmographics without the operational overhead.
Everything supported by our kompass.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Scrapy handles high-concurrency crawl orchestration, request deduplication, and retry logic for deep directory traversal.
We maintain pools of residential ISP proxies mapped to target directory regions. Rotation happens per-request to avoid rate limits.
Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.
Data delivered to where your team already works — no new tooling required.
About kompass.com scraping, legality, and pipeline operations.
Ask us directly →Scraping publicly available firmographic information is generally permissible. DataFlirt extracts only public company profiles and directory listings. We do not circumvent authentication walls to access premium paid data. Clients should review Kompass terms of service and consult legal counsel for specific commercial use cases.
Directory searches often limit results to a few thousand records per query. We bypass this by systematically applying granular filters, such as postal codes, specific employee size brackets, and sub-categories, ensuring we capture the entire underlying dataset.
We extract contact information that is publicly visible on the company profile page. Direct executive email addresses are typically gated behind Kompass premium subscriptions and are not included in public scraping pipelines.
Yes. We can target specific Kompass regional subdomains and apply geographic filters to extract companies registered only in your target markets.
Our pipeline includes a normalisation layer that standardises telephone numbers, formats addresses into constituent parts, and maps varying legal entity types to a consistent schema.
For B2B directories, clients typically request monthly or quarterly full-refreshes to capture new registrations and updated executive contacts. One-off historical extractions are also supported.
20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off extraction of specific industry sectors or a continuous feed of firmographic updates, we scope, build, and operate the pipeline. Tell us what you need.