We extract company profiles, DUNS numbers, corporate hierarchies, and executive data from Dun & Bradstreet. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Company Overview objects from dnb.com. All fields typed and schema-versioned.
"duns_number": "00-123-4567", "company_name": "Acme Manufacturing Ltd", "legal_name": "Acme Manufacturing Limited", "city": "London", "country": "United Kingdom", "year_founded": 1985, "website_url": "https://www.acmemfg.co.uk"
| # | duns_number | company_name | legal_name | address_line_1 | city | state |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Firmographics objects from dnb.com. All fields typed and schema-versioned.
"duns_number": "00-123-4567", "revenue_estimated_usd": 45000000.0, "employee_count": 250, "industry_primary": "Manufacturing", "naics_code": "332710", "sic_code": "3599", "company_type": "Private"
| # | duns_number | revenue_estimated_usd | employee_count | industry_primary | naics_code | naics_description |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Executives objects from dnb.com. All fields typed and schema-versioned.
"duns_number": "00-123-4567", "full_name": "Jane Doe", "job_title": "Chief Executive Officer", "department": "Executive", "management_level": "C-Level", "board_member": true
| # | duns_number | executive_id | full_name | job_title | department | management_level |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Corporate Hierarchy objects from dnb.com. All fields typed and schema-versioned.
"duns_number": "00-123-4567", "parent_duns": "00-987-6543", "parent_name": "Acme Global Holdings", "ultimate_parent_duns": "00-987-6543", "hierarchy_level": "Subsidiary", "subsidiary_count": 2
| # | duns_number | company_name | parent_duns | parent_name | ultimate_parent_duns | ultimate_parent_name |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Location Data objects from dnb.com. All fields typed and schema-versioned.
"duns_number": "00-123-4567", "location_type": "Headquarters", "city": "London", "country_iso": "GB", "latitude": 51.5074, "longitude": -0.1278, "postal_code": "EC1A 1BB"
| # | duns_number | location_type | address_full | street | city | state_province |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Our D&B scraper handles the business directory layer: firmographic profiles, corporate hierarchies, executive lists, and industry classifications, with bypass mechanisms for regional blocks and strict rate limits.
Extract revenue estimates, employee headcounts, year founded, and company descriptions for millions of public and private entities.
Capture the unique nine-digit Data Universal Numbering System identifier to match records against your existing CRM data.
Map parent, subsidiary, and branch relationships to understand ultimate beneficial ownership and corporate structures.
Extract key principals, C-suite executives, and board members associated with specific corporate entities.
Standardise your data with extracted NAICS, SIC, and proprietary D&B industry codes for precise market segmentation.
Scrape regional D&B directories across North America, Europe, and Asia to build international prospect lists.
Capture headquarters addresses, branch locations, and geographic coordinates for spatial analysis.
Monitor specific DUNS numbers for changes in executive leadership, revenue brackets, or corporate structure over time.
Navigate deep category and geographic search results to extract entire industry verticals without hitting display limits.
Brief in. Clean data out.
Provide target industries, geographies, or specific company names. We design the extraction schema together.
We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for dnb.com.
Schema validation, null-rate checks, and DUNS format verification before full launch.
JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Dun & Bradstreet protects its directory with aggressive rate limiting and bot detection. Here is how we maintain extraction stability.
D&B monitors request velocity strictly. We distribute requests across large residential ISP proxy pools and enforce strict delays between requests to mimic organic browsing behaviour and prevent IP bans.
Many directory pages load content dynamically via JavaScript. We use Playwright to execute scripts, trigger lazy-loaded elements, and extract data that simple HTTP requests cannot see.
Directory search results often cap visible pages. We use granular search parameters across postal codes and sub-industries to reduce result sets below the cap, ensuring complete coverage of a target sector.
We maintain multiple fallback selectors for firmographic data points, as D&B frequently updates profile layouts and obfuscates class names in their frontend code.
Revenue figures, employee counts, and addresses are cleaned and cast into correct data types before delivery, ensuring they are immediately ready for database insertion.
Sales operations teams append DUNS numbers, revenue data, and accurate industry codes to incomplete Salesforce or HubSpot records.
Data engineering teams use D&B profiles as a source of truth to deduplicate and standardise vendor and customer databases.
Marketing teams extract target accounts by specific NAICS codes and revenue brackets to build highly segmented outbound campaigns.
Compliance officers map corporate hierarchies to identify ultimate beneficial owners and assess third-party risk exposure.
Private equity firms analyze aggregate employee and revenue data across specific sectors to model total addressable market size.
Procurement teams automate vendor verification by matching submitted details against public D&B registry records.
"Dun & Bradstreet is the foundational registry of global commerce, but mapping millions of corporate entities requires purpose-built extraction infrastructure."
Most teams underestimate the investment required: reliable dnb.com scraping requires residential proxies, strict rate-limit management, CAPTCHA handling, and daily selector maintenance. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.
Everything supported by our dnb.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows. Combined via scrapy-playwright middleware.
We maintain pools of residential ISP proxies across target regions. Rotation happens per-request with sticky sessions where required. IP score monitoring prevents blacklisted pool contamination.
Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.
Data delivered to where your team already works — no new tooling required.
About dnb.com scraping, legality, and pipeline operations.
Ask us directly →Scraping publicly available directory information is generally permissible under applicable law. DataFlirt targets only public, non-authenticated company profiles and executive data. We do not circumvent authentication walls to access paid credit reports or proprietary financial scores. Clients should review terms of service and consult legal counsel for specific use cases.
We use large residential ISP proxy pools and throttle request concurrency to stay within acceptable thresholds. Our infrastructure mimics human browsing behaviour to prevent automated blocks.
Yes, we can map parent and subsidiary relationships where they are explicitly listed in the public directory profiles, providing a structured view of corporate family trees.
No. D&B credit scores and detailed financial risk metrics are gated behind paid authentication walls (such as D&B Finance Analytics). We only extract publicly accessible firmographic data.
Yes. You can provide a list of company names and addresses, and we will configure the pipeline to search the directory and return the corresponding DUNS numbers and profile data.
We can configure pipelines to run on your required cadence, whether that is a one-off historical extraction or a monthly refresh to detect changes in leadership or company status.
Our minimum engagement typically starts at 10,000 target companies or a specific industry vertical. Contact us with your target criteria for a custom scoping and quote.
20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off firmographic dump or a continuous CRM enrichment feed across millions of entities, we scope, build, and operate the pipeline. Tell us what you need.