Dune Scraper — Blockchain Query & Dashboard Data Extraction

Data Dictionary

Every field we extract from dune.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Dashboards objects from dune.com. All fields typed and schema-versioned.

dashboard_idtitleslugauthorcreated_atupdated_attagsstarsviewsfork_countwidget_count

"dashboard_id": "d_29481",
"title": "Ethereum Gas Tracker",
"author": "hildobby",
"views": 482910,
"stars": 1204,
"tags": "['ethereum', 'gas', 'network']"

#	dashboard_id	title	slug	author	created_at	updated_at
1
2
3

Complete list of extractable fields for SQL Queries objects from dune.com. All fields typed and schema-versioned.

query_idnamedescriptionauthor_handleraw_sqldatasetcreated_atlast_execution_timeexecution_statusis_private

"query_id": "q_847192",
"name": "Daily Active Users",
"author_handle": "optimism_pbc",
"dataset": "dune_sql",
"raw_sql": "SELECT date_trunc('day', block_time) AS day, COUNT(DISTINCT "from") AS users FROM optimism.transactions GROUP BY 1",
"execution_status": "success"

#	query_id	name	description	author_handle	raw_sql	dataset
1
2
3

Complete list of extractable fields for Query Results objects from dune.com. All fields typed and schema-versioned.

execution_idquery_idcolumnsrow_countbyte_sizerun_time_mssubmitted_atcompleted_atresult_data

"execution_id": "exec_01h2v3",
"query_id": "q_847192",
"row_count": 365,
"run_time_ms": 4210,
"completed_at": "2023-10-12T14:32:01Z",
"columns": "['day', 'users']"

#	execution_id	query_id	columns	row_count	byte_size	run_time_ms
1
2
3

Complete list of extractable fields for User Profiles objects from dune.com. All fields typed and schema-versioned.

user_idhandledisplay_namebiotwitter_handlegithub_handlejoined_datefollower_countfollowing_countdashboard_countquery_count

"handle": "hildobby",
"display_name": "hildobby",
"follower_count": 14291,
"dashboard_count": 42,
"query_count": 312,
"joined_date": "2021-04-12T00:00:00Z"

#	user_id	handle	display_name	bio	twitter_handle	github_handle
1
2
3

Complete list of extractable fields for Trending Metrics objects from dune.com. All fields typed and schema-versioned.

rank_positiondashboard_idtitleauthortime_windowview_growthstar_growthcategoryscraped_at

"rank_position": 1,
"dashboard_id": "d_91823",
"title": "Friend.tech metrics",
"view_growth": 482,
"time_window": "24h",
"scraped_at": "2023-10-12T15:00:00Z"

#	rank_position	dashboard_id	title	author	time_window	view_growth
1
2
3

Capabilities

Extract the logic behind Web3 analytics

Our Dune scraper parses the underlying SQL, dashboard configurations, and raw execution results — bypassing complex client-side rendering and internal API obscuration.

Dashboard Metadata Extraction

Capture title, author, tags, view counts, and star metrics across thousands of public dashboards.

SQL Query Parsing

Extract the raw PostgreSQL or DuneSQL syntax powering every public visualisation on the platform.

Result Set Capture

Download the actual tabular data returned by executed queries, avoiding manual CSV exports.

Trending & Discovery Tracking

Monitor the trending page to identify emerging protocols, narratives, and high-growth dashboards.

Author Profile Scraping

Map the top analysts in Web3 by extracting follower counts, social links, and total query output.

Widget & Visualisation Mapping

Link underlying SQL queries to their specific dashboard chart configurations and parameter inputs.

Multi-Chain Context

Identify datasets targeting Ethereum, Solana, Polygon, Arbitrum, and other EVM chains.

Fork & Lineage Tracking

Track query forks to understand how complex metrics are derived and modified by the community.

Execution Metrics

Monitor query run times, byte sizes, and row counts to optimise your own DuneSQL performance.

Scheduled Snapshots

Run pipelines to capture daily snapshots of dashboard metrics and trending positions.

// engagement pipeline

From dashboard URL to warehouse record

Brief in. Clean data out.

Define Scope

d 0

Provide dashboard URLs, author handles, or target categories. We design the extraction schema together.

Pipeline Build

d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for dune.com.

Validation & QA

d 4–6

Schema validation, null-rate checks, and data type normalisation before full launch.

Delivery

ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Dune pipeline handles the hard parts

Dune relies heavily on Next.js hydration and internal GraphQL APIs. Here is how we extract the data cleanly.

// fingerprinting

Identity rotation

TLS fingerprintrandomised

User-agentrotated

IP poolresidential

Challenges blocked0

// pagination

Page coverage

48,291 pages queued running

// observability

Pipeline health

99.9%

uptime

142ms

p99 lat

0.3%

null rate

alerts

SPA architecture

Next.js state hydration capture

Dune is a heavy Single Page Application. We intercept the Next.js build manifest and extract the raw JSON state hydration props, bypassing the need to scrape the rendered DOM entirely.

WAF bypass

Cloudflare and rate limit circumvention

Dune employs strict rate limiting and Cloudflare protection on its API routes. We distribute requests across residential IP pools with browser-matched TLS fingerprints to maintain access.

Internal API routing

Reverse engineering query execution

To extract query results, our pipeline maps the internal API calls used by the Dune frontend, allowing us to fetch the raw JSON result sets rather than attempting to parse HTML tables.

Complex JSON structures

Parsing nested widget configurations

Dashboard layouts are stored as deeply nested JSON objects. We flatten these structures into relational formats, linking specific visualisations to their underlying SQL query IDs.

Pagination handling

Deep navigation of user profiles

Top analysts have thousands of queries. Our crawlers handle the specific pagination tokens required to extract complete historical query lists without hitting timeout limits.

Applications

Who uses Dune data — and how

Teams across industries use dune.com data to build competitive products and smarter operations.

Protocol Competitive Analysis

Track TVL, user growth, and volume metrics across competing chains by aggregating public dashboard data.

Tokenomics Research

Analyse token distribution, staking ratios, and velocity by extracting the underlying SQL logic used by top researchers.

Alpha Generation

Monitor trending dashboards for emerging narratives and new protocol launches before they hit mainstream crypto media.

Developer Intelligence

Identify top query authors and analysts for recruitment or partnership by tracking view counts and fork metrics.

On-Chain Forensics

Aggregate queries tracking exploits, hacks, or wallet movements to build internal security databases.

Alternative Data for Quants

Feed structured query results into trading models to correlate on-chain activity with price action.

Technical Spec

Dune scraper — technical capabilities

Everything supported by our dune.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering

Full Playwright sessions to trigger initial state hydration

Supported

Cloudflare bypass

Automated solver integration and TLS fingerprinting

Supported

Internal API interception

Direct capture of GraphQL and REST responses from the frontend

Supported

Query result pagination

Handling offset/limit tokens for large result sets

Supported

Raw SQL extraction

Capture of the exact syntax used in the query editor

Supported

Dashboard widget mapping

Linking visualisations to their data sources

Supported

Private team queries

Queries marked as private by Dune premium teams

Partial

Premium execution tiers

Triggering high-compute clusters without authentication

Partial

Historical execution logs

Access to previous run states and metrics

Supported

Infrastructure

Infrastructure powering the Dune pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus

SPA Hydration Capture

We parse Next.js __NEXT_DATA__ objects directly from the DOM, avoiding brittle CSS selectors and extracting the raw JSON state.

WAF Circumvention

Residential proxy pools combined with TLS fingerprint spoofing ensure our requests bypass Cloudflare's bot mitigation layers.

High-Volume JSON Parsing

Our pipelines flatten deeply nested dashboard configurations and query results into normalised, warehouse-ready schemas.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON

Nested structures preserving dashboard widget relationships

CSV

Flat tabular data for query results and user profiles

XLS

Excel format for direct analyst consumption

Parquet

Columnar format optimised for analytical queries

AWS S3

Direct delivery to your cloud storage buckets

Webhook

HTTP POST for real-time trending dashboard alerts

API

RESTful endpoints to query extracted datasets

BigQuery

Direct streaming into Google Cloud data warehouses

Snowflake

Automated staging and loading into Snowflake

PostgreSQL

Relational upserts into your existing database schema

Direct bucket delivery — compatible with any data lake

// faq

Common questions.

About dune.com scraping, legality, and pipeline operations.

Ask us directly →

Is scraping Dune legal?

Scraping publicly available information is generally permissible. DataFlirt only targets public dashboards, queries, and profiles on Dune. We do not attempt to bypass authentication to access private team data. Clients should review Dune's Terms of Service and consult legal counsel for specific use cases.

How do you handle Cloudflare protection?

We utilise residential ISP proxies, browser-matched TLS fingerprints, and automated solvers to navigate WAF challenges without triggering blocks or CAPTCHAs.

Can you extract the raw SQL queries?

Yes. We extract the exact PostgreSQL or DuneSQL syntax written by the author for any public query.

Do you support extracting the actual query result data?

Yes. We intercept the internal API responses to capture the tabular data returned by the query execution, up to the limits displayed on the public frontend.

How fresh is the trending data?

We can configure pipelines to poll the trending dashboards page at hourly intervals, providing near real-time visibility into emerging narratives.

What is the minimum viable engagement?

Engagements typically start with a defined list of target authors, categories, or a continuous feed of the top 1,000 trending dashboards. Contact us to scope your specific volume requirements.

Can you scrape private dashboards?

No. DataFlirt strictly targets publicly accessible data. We do not extract information gated behind private team accounts or premium authentication.

Dune analytics data,
at warehouse scale.

Every field we extract from dune.com

Extract the logic behind Web3 analytics

From dashboard URL to warehouse record

How our Dune pipeline handles the hard parts

Who uses Dune data — and how

Dune scraper — technical capabilities

Infrastructure powering the Dune pipeline

Your data, your destination

Common questions.

Tell us what
to extract.
We do the rest.

Data Extraction for Every Industry

Dune analytics data, at warehouse scale.

Every field we extract from dune.com

Extract the logic behind Web3 analytics

From dashboard URL to warehouse record

How our Dune pipeline handles the hard parts

Who uses Dune data — and how

Dune scraper — technical capabilities

Infrastructure powering the Dune pipeline

Your data, your destination

Common questions.

Tell us whatto extract. We do the rest.

Data Extraction for Every Industry

Dune analytics data,
at warehouse scale.

Tell us what
to extract.
We do the rest.