SYSTEM all green source dune.com queue 12,841 dashboards p99 latency 312ms dataflirt.com · scraper/dune-com
RUN · 42 active pipelines · dune.com live

Dune analytics data,
at warehouse scale.

We extract public dashboards, SQL queries, execution metrics, and trending Web3 analytics from Dune. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Dashboards extracted
14.2K /day
Queries parsed
89.4K /24h
Data points
2.1M /run
Active pipelines
42
Uptime
99.94%
Data Dictionary

Every field we extract from dune.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Dashboards objects from dune.com. All fields typed and schema-versioned.

dashboard_idtitleslugauthorcreated_atupdated_attagsstarsviewsfork_countwidget_count
dashboards
● 200 OK
"dashboard_id": "d_29481",
"title": "Ethereum Gas Tracker",
"author": "hildobby",
"views": 482910,
"stars": 1204,
"tags": "['ethereum', 'gas', 'network']"
# dashboard_idtitleslugauthorcreated_atupdated_at
1
2
3

Complete list of extractable fields for SQL Queries objects from dune.com. All fields typed and schema-versioned.

query_idnamedescriptionauthor_handleraw_sqldatasetcreated_atlast_execution_timeexecution_statusis_private
sql_queries
● 200 OK
"query_id": "q_847192",
"name": "Daily Active Users",
"author_handle": "optimism_pbc",
"dataset": "dune_sql",
"raw_sql": "SELECT date_trunc('day', block_time) AS day, COUNT(DISTINCT "from") AS users FROM optimism.transactions GROUP BY 1",
"execution_status": "success"
# query_idnamedescriptionauthor_handleraw_sqldataset
1
2
3

Complete list of extractable fields for Query Results objects from dune.com. All fields typed and schema-versioned.

execution_idquery_idcolumnsrow_countbyte_sizerun_time_mssubmitted_atcompleted_atresult_data
query_results
● 200 OK
"execution_id": "exec_01h2v3",
"query_id": "q_847192",
"row_count": 365,
"run_time_ms": 4210,
"completed_at": "2023-10-12T14:32:01Z",
"columns": "['day', 'users']"
# execution_idquery_idcolumnsrow_countbyte_sizerun_time_ms
1
2
3

Complete list of extractable fields for User Profiles objects from dune.com. All fields typed and schema-versioned.

user_idhandledisplay_namebiotwitter_handlegithub_handlejoined_datefollower_countfollowing_countdashboard_countquery_count
user_profiles
● 200 OK
"handle": "hildobby",
"display_name": "hildobby",
"follower_count": 14291,
"dashboard_count": 42,
"query_count": 312,
"joined_date": "2021-04-12T00:00:00Z"
# user_idhandledisplay_namebiotwitter_handlegithub_handle
1
2
3

Complete list of extractable fields for Trending Metrics objects from dune.com. All fields typed and schema-versioned.

rank_positiondashboard_idtitleauthortime_windowview_growthstar_growthcategoryscraped_at
trending_metrics
● 200 OK
"rank_position": 1,
"dashboard_id": "d_91823",
"title": "Friend.tech metrics",
"view_growth": 482,
"time_window": "24h",
"scraped_at": "2023-10-12T15:00:00Z"
# rank_positiondashboard_idtitleauthortime_windowview_growth
1
2
3

Capabilities

Extract the logic behind Web3 analytics

Our Dune scraper parses the underlying SQL, dashboard configurations, and raw execution results — bypassing complex client-side rendering and internal API obscuration.

Dashboard Metadata Extraction

Capture title, author, tags, view counts, and star metrics across thousands of public dashboards.

SQL Query Parsing

Extract the raw PostgreSQL or DuneSQL syntax powering every public visualisation on the platform.

Result Set Capture

Download the actual tabular data returned by executed queries, avoiding manual CSV exports.

Trending & Discovery Tracking

Monitor the trending page to identify emerging protocols, narratives, and high-growth dashboards.

Author Profile Scraping

Map the top analysts in Web3 by extracting follower counts, social links, and total query output.

Widget & Visualisation Mapping

Link underlying SQL queries to their specific dashboard chart configurations and parameter inputs.

Multi-Chain Context

Identify datasets targeting Ethereum, Solana, Polygon, Arbitrum, and other EVM chains.

Fork & Lineage Tracking

Track query forks to understand how complex metrics are derived and modified by the community.

Execution Metrics

Monitor query run times, byte sizes, and row counts to optimise your own DuneSQL performance.

Scheduled Snapshots

Run pipelines to capture daily snapshots of dashboard metrics and trending positions.

// engagement pipeline

From dashboard URL to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide dashboard URLs, author handles, or target categories. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for dune.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, and data type normalisation before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Dune pipeline handles the hard parts

Dune relies heavily on Next.js hydration and internal GraphQL APIs. Here is how we extract the data cleanly.

pipeline-monitor · dune.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
SPA architecture
Next.js state hydration capture

Dune is a heavy Single Page Application. We intercept the Next.js build manifest and extract the raw JSON state hydration props, bypassing the need to scrape the rendered DOM entirely.

WAF bypass
Cloudflare and rate limit circumvention

Dune employs strict rate limiting and Cloudflare protection on its API routes. We distribute requests across residential IP pools with browser-matched TLS fingerprints to maintain access.

Internal API routing
Reverse engineering query execution

To extract query results, our pipeline maps the internal API calls used by the Dune frontend, allowing us to fetch the raw JSON result sets rather than attempting to parse HTML tables.

Complex JSON structures
Parsing nested widget configurations

Dashboard layouts are stored as deeply nested JSON objects. We flatten these structures into relational formats, linking specific visualisations to their underlying SQL query IDs.

Pagination handling
Deep navigation of user profiles

Top analysts have thousands of queries. Our crawlers handle the specific pagination tokens required to extract complete historical query lists without hitting timeout limits.

Applications

Who uses Dune data — and how

Teams across industries use dune.com data to build competitive products and smarter operations.

01
Protocol Competitive Analysis

Track TVL, user growth, and volume metrics across competing chains by aggregating public dashboard data.

02
Tokenomics Research

Analyse token distribution, staking ratios, and velocity by extracting the underlying SQL logic used by top researchers.

03
Alpha Generation

Monitor trending dashboards for emerging narratives and new protocol launches before they hit mainstream crypto media.

04
Developer Intelligence

Identify top query authors and analysts for recruitment or partnership by tracking view counts and fork metrics.

05
On-Chain Forensics

Aggregate queries tracking exploits, hacks, or wallet movements to build internal security databases.

06
Alternative Data for Quants

Feed structured query results into trading models to correlate on-chain activity with price action.

Why DataFlirt

"Dune holds the most comprehensive community-driven Web3 analytics on the internet, but accessing it systematically requires navigating heavy client-side rendering and aggressive rate limits."

Extracting data from Dune requires more than simple HTTP requests. Their Next.js architecture heavily relies on client-side state hydration, internal API routing, and Cloudflare protection. DataFlirt manages the JavaScript execution, proxy rotation, and complex JSON parsing required to turn Dune's dashboards into queryable warehouse tables.

Technical Spec

Dune scraper — technical capabilities

Everything supported by our dune.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions to trigger initial state hydration
Supported
Cloudflare bypass
Automated solver integration and TLS fingerprinting
Supported
Internal API interception
Direct capture of GraphQL and REST responses from the frontend
Supported
Query result pagination
Handling offset/limit tokens for large result sets
Supported
Raw SQL extraction
Capture of the exact syntax used in the query editor
Supported
Dashboard widget mapping
Linking visualisations to their data sources
Supported
Private team queries
Queries marked as private by Dune premium teams
Partial
Premium execution tiers
Triggering high-compute clusters without authentication
Partial
Historical execution logs
Access to previous run states and metrics
Supported
Infrastructure

Infrastructure powering the Dune pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
SPA Hydration Capture

We parse Next.js __NEXT_DATA__ objects directly from the DOM, avoiding brittle CSS selectors and extracting the raw JSON state.

WAF Circumvention

Residential proxy pools combined with TLS fingerprint spoofing ensure our requests bypass Cloudflare's bot mitigation layers.

High-Volume JSON Parsing

Our pipelines flatten deeply nested dashboard configurations and query results into normalised, warehouse-ready schemas.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Nested structures preserving dashboard widget relationships
CSV
Flat tabular data for query results and user profiles
XLS
Excel format for direct analyst consumption
Parquet
Columnar format optimised for analytical queries
AWS S3
Direct delivery to your cloud storage buckets
Webhook
HTTP POST for real-time trending dashboard alerts
API
RESTful endpoints to query extracted datasets
BigQuery
Direct streaming into Google Cloud data warehouses
Snowflake
Automated staging and loading into Snowflake
PostgreSQL
Relational upserts into your existing database schema
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About dune.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Dune legal?

Scraping publicly available information is generally permissible. DataFlirt only targets public dashboards, queries, and profiles on Dune. We do not attempt to bypass authentication to access private team data. Clients should review Dune's Terms of Service and consult legal counsel for specific use cases.

How do you handle Cloudflare protection?

We utilise residential ISP proxies, browser-matched TLS fingerprints, and automated solvers to navigate WAF challenges without triggering blocks or CAPTCHAs.

Can you extract the raw SQL queries?

Yes. We extract the exact PostgreSQL or DuneSQL syntax written by the author for any public query.

Do you support extracting the actual query result data?

Yes. We intercept the internal API responses to capture the tabular data returned by the query execution, up to the limits displayed on the public frontend.

How fresh is the trending data?

We can configure pipelines to poll the trending dashboards page at hourly intervals, providing near real-time visibility into emerging narratives.

What is the minimum viable engagement?

Engagements typically start with a defined list of target authors, categories, or a continuous feed of the top 1,000 trending dashboards. Contact us to scope your specific volume requirements.

Can you scrape private dashboards?

No. DataFlirt strictly targets publicly accessible data. We do not extract information gated behind private team accounts or premium authentication.

$ dataflirt scope --new-project --source=dune.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off export of top DeFi queries or a continuous feed of trending dashboards — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →