SYSTEM all green source glassnode.com queue 12,409 endpoints p99 latency 218ms dataflirt.com · scraper/glassnode-com
RUN · 42 active pipelines · glassnode.com live

On-chain intelligence,
at institutional scale.

We extract network activity, exchange flows, SOPR, and derivatives data from Glassnode. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Metrics extracted
14.2K /run
Time-series points
4.8M /day
Asset coverage
1,204 /run
Active pipelines
42
Uptime
99.98%
Data Dictionary

Every field we extract from glassnode.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Network Activity objects from glassnode.com. All fields typed and schema-versioned.

assetdateactive_addressesnew_addressestransaction_counttransfer_volumemean_transfer_sizefees_totalfees_mean
network_activity
● 200 OK
"asset": "BTC",
"date": "2026-05-12T00:00:00Z",
"active_addresses": 942105,
"new_addresses": 412590,
"transaction_count": 315420,
"transfer_volume": 45210.5,
"fees_total": 24.5,
"fees_mean": 7.7e-05
# assetdateactive_addressesnew_addressestransaction_counttransfer_volume
1
2
3

Complete list of extractable fields for Exchange Flows objects from glassnode.com. All fields typed and schema-versioned.

assetexchangeinflow_volumeoutflow_volumenet_flowexchange_balancepercentage_of_supplydate
exchange_flows
● 200 OK
"asset": "ETH",
"exchange": "Binance",
"inflow_volume": 145000.2,
"outflow_volume": 121000.5,
"net_flow": 23999.7,
"exchange_balance": 4500120.4,
"percentage_of_supply": 3.75,
"date": "2026-05-12T00:00:00Z"
# assetexchangeinflow_volumeoutflow_volumenet_flowexchange_balance
1
2
3

Complete list of extractable fields for Miner Metrics objects from glassnode.com. All fields typed and schema-versioned.

assetminer_balanceminer_revenuedifficultyhashratepuell_multiplethermocapdate
miner_metrics
● 200 OK
"asset": "BTC",
"miner_balance": 1820450.5,
"miner_revenue": 450.2,
"difficulty": 83500000000000,
"hashrate": 612000000,
"puell_multiple": 1.15,
"thermocap": 54200000000,
"date": "2026-05-12T00:00:00Z"
# assetminer_balanceminer_revenuedifficultyhashratepuell_multiple
1
2
3

Complete list of extractable fields for Market Indicators objects from glassnode.com. All fields typed and schema-versioned.

assetpricemarket_caprealised_capmvrv_z_scoresoprnvt_signaldate
market_indicators
● 200 OK
"asset": "BTC",
"price": 64210.5,
"market_cap": 1260000000000,
"realised_cap": 485000000000,
"mvrv_z_score": 2.14,
"sopr": 1.02,
"nvt_signal": 45.2,
"date": "2026-05-12T00:00:00Z"
# assetpricemarket_caprealised_capmvrv_z_scoresopr
1
2
3

Complete list of extractable fields for Derivatives objects from glassnode.com. All fields typed and schema-versioned.

assetexchangeopen_interestfunding_rateliquidations_longliquidations_shortvolumedate
derivatives
● 200 OK
"asset": "BTC",
"exchange": "Deribit",
"open_interest": 45000.5,
"funding_rate": 0.012,
"liquidations_long": 1200000.0,
"liquidations_short": 450000.0,
"volume": 850000000.0,
"date": "2026-05-12T00:00:00Z"
# assetexchangeopen_interestfunding_rateliquidations_longliquidations_short
1
2
3

Capabilities

Extract every metric — without the frontend friction

Glassnode relies on heavy React applications and complex chart rendering logic to display data. Our pipeline intercepts the raw underlying data streams before they reach the browser, providing pure, structured time-series data.

Network Activity Extraction

Extract active addresses, transaction counts, transfer volumes, and fee metrics across Bitcoin, Ethereum, and major ERC-20 tokens.

Exchange Flow Tracking

Capture inflow, outflow, net flow, and total exchange balance data segmented by individual exchanges or aggregated.

Miner Behaviour Signals

Track miner balances, revenue streams, Puell Multiple, difficulty adjustments, and hashrate distributions in real time.

Market Indicator Parsing

Extract complex derived metrics like MVRV Z-Score, SOPR (Spent Output Profit Ratio), Realised Cap, and NVT Signal.

Derivatives & Futures Data

Monitor open interest, perpetual funding rates, futures volume, and liquidation cascades across major derivatives platforms.

Entity-Adjusted Metrics

Access Glassnode's proprietary entity-adjusted data, filtering out internal exchange transfers and change addresses for cleaner signals.

Historical Backfilling

Extract complete historical time-series data for any supported metric, allowing you to build and test quantitative models from genesis blocks.

Sub-Minute Resolution

Capture high-frequency block-by-block data where available, bypassing the standard daily or hourly aggregation views.

Continuous Sync

Run continuous pipelines that append new metric points as they are published, maintaining an up-to-date data warehouse without manual exports.

// engagement pipeline

From metric selection to warehouse delivery

Brief in. Clean data out.

Define Scope
d 0

Specify the assets (BTC, ETH, DeFi tokens), metrics (SOPR, Flows), and resolution (daily, hourly, block) required.

Pipeline Build
d 2–4

We configure Playwright interceptors to bypass Cloudflare and extract the raw JSON payloads powering Glassnode's charts.

Validation & QA
d 4–6

We validate data continuity, precision matching, and handle timezone normalisation against the raw blockchain state.

Delivery
ongoing

Structured time-series data pushed via Parquet or JSON to your S3, BigQuery, or Snowflake instances.

Under the hood

Overcoming Glassnode's extraction barriers

Extracting data from financial intelligence platforms requires intercepting backend APIs rather than parsing HTML. Here is how we ensure data integrity.

pipeline-monitor · glassnode.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
API Interception
Bypassing Canvas chart rendering

Glassnode displays data using complex HTML5 Canvas and SVG charts which are impossible to scrape via DOM parsing. We use Playwright to intercept the underlying XHR/Fetch requests, extracting the raw JSON time-series data before it hits the charting library.

Anti-bot evasion
Cloudflare Turnstile mitigation

Financial data platforms employ strict Cloudflare protection. Our infrastructure utilises TLS fingerprint spoofing, residential proxies, and automated CapSolver integration to maintain uninterrupted access to the public data endpoints.

Pagination handling
Historical data reconstruction

Long-term historical charts often load data in chunks based on the zoom level. Our crawlers programmatically manipulate the date range parameters to extract the complete, unaggregated dataset from genesis to present.

Data normalisation
Standardised timestamps and precision

We normalise all Unix timestamps to UTC ISO-8601 strings and preserve the exact floating-point precision of the original metrics, ensuring your quantitative models receive mathematically accurate inputs.

Change detection
Incremental appending

Instead of redownloading the entire history every day, our pipelines track the latest timestamp per metric and only extract the delta, optimising pipeline execution time and reducing your storage costs.

Applications

Who uses Glassnode data — and how

Teams across industries use glassnode.com data to build competitive products and smarter operations.

01
Quantitative Trading

Algorithmic funds ingest SOPR, NVT, and funding rates to build mean-reversion and momentum models for digital assets.

02
Macro Analysis

Research desks track exchange balances and miner accumulation phases to gauge market cycle positioning and liquidity.

03
Risk Management

Lending protocols and OTC desks monitor derivative liquidations and open interest to adjust collateral requirements dynamically.

04
Fund Reporting

Asset managers aggregate on-chain activity metrics to provide institutional clients with fundamental valuation reports.

05
DeFi Monitoring

Protocol treasuries track capital flows across Ethereum and Layer 2s to optimise yield strategies and liquidity provision.

06
Academic Research

Universities and economic institutes extract historical entity-adjusted data to study market efficiency and network effects.

Why DataFlirt

"On-chain metrics provide the fundamental valuation layer for digital assets, but extracting structured time-series data from charting UIs requires dedicated interception infrastructure."

Attempting to scrape Glassnode via traditional DOM parsing fails immediately due to Canvas rendering and Cloudflare protections. DataFlirt intercepts the raw API traffic, handles the pagination of historical data, and delivers clean, mathematically precise time-series arrays directly to your quantitative infrastructure. We handle the extraction so your quants can focus on alpha.

Technical Spec

Glassnode scraper — technical capabilities

Everything supported by our glassnode.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

API Interception
Extraction of raw JSON payloads powering the frontend charts
Supported
Historical Backfill
Complete data extraction from asset genesis to current block
Supported
Cloudflare Bypass
Automated Turnstile resolution and TLS fingerprinting
Supported
Multi-Asset Coverage
Support for BTC, ETH, and all supported ERC-20 tokens
Supported
Incremental Updates
Delta extraction based on the latest recorded timestamp
Supported
Webhook Delivery
HTTP POST delivery for real-time metric updates
Supported
Tier 3 Advanced Metrics
Extraction of proprietary metrics requiring a paid Glassnode subscription
Partial
Custom Studio Workspaces
Saving or extracting user-specific custom chart configurations
Partial
Infrastructure

Infrastructure powering the Glassnode pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Playwright Interception

Playwright handles the headless browser execution, specifically configured to intercept and dump XHR/Fetch responses containing the raw time-series data before Canvas rendering.

Residential Proxy Rotation

We utilise ISP-grade residential proxies to distribute requests and avoid rate-limiting or IP bans from Glassnode's Web Application Firewall (WAF).

Automated Orchestration

Apache Airflow manages the scheduling and dependency logic, ensuring that daily or hourly metric updates are fetched, validated, and pushed to your warehouse reliably.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Nested arrays of time-series data — ideal for document stores
CSV
Flat tabular data — perfect for immediate quantitative analysis
XLS
Excel-compatible format for manual review and reporting
Parquet
Columnar format optimised for BigQuery, Snowflake, and Athena
AWS S3
Direct delivery to your cloud storage buckets
Webhook
HTTP POST delivery for real-time algorithmic triggers
API
REST endpoint to query your extracted historical data
PostgreSQL
Direct upsert into your relational database schema
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About glassnode.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Glassnode legal?

Scraping publicly accessible, unauthenticated data from Glassnode is generally permissible. DataFlirt extracts only the data that is freely visible on the public frontend. We do not bypass authentication to access paid Tier 2 or Tier 3 metrics. Clients should consult their legal counsel regarding their specific use cases and Glassnode's Terms of Service.

How do you extract data from the charts?

We do not use OCR or DOM parsing on the visual charts. Instead, our infrastructure intercepts the background network requests (XHR/Fetch) that the frontend application makes to Glassnode's backend servers, capturing the raw JSON arrays containing the exact timestamps and metric values.

Can you extract historical data from years ago?

Yes. Our pipelines can manipulate the date parameters in the intercepted requests to paginate backwards, extracting the complete historical dataset for any public metric from the asset's genesis block to the present day.

How frequently can the data be updated?

Depending on the specific metric's publication schedule, we can configure pipelines to run daily, hourly, or at sub-minute intervals to capture the latest data points as soon as they are pushed to the frontend.

Do you support extraction of paid metrics?

No. DataFlirt focuses exclusively on extracting publicly available data. We do not manage authenticated sessions or extract data that requires a paid Glassnode subscription.

What format is best for backtesting?

For quantitative backtesting, we recommend Parquet or CSV. Parquet is highly efficient for large historical datasets loaded into pandas or polars, while CSV offers immediate readability and compatibility with standard financial modelling tools.

$ dataflirt scope --new-project --source=glassnode.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a complete historical backfill of Bitcoin metrics or a continuous feed of Ethereum exchange flows — we build and operate the infrastructure. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →