SYSTEM all green source nansen.ai queue 12,408 wallets p99 latency 318ms dataflirt.com · scraper/nansen-ai
RUN * 42 active pipelines * nansen.ai live

Blockchain analytics,
at warehouse scale.

We extract wallet labels, Smart Money token flows, NFT minting volumes, and DeFi contract metrics from Nansen. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Wallet labels extracted
3.8M /week
Token flow updates
912K /24h
Smart Money alerts
14.2K /run
Active pipelines
42
Uptime
99.98%
Data Dictionary

Every field we extract from nansen.ai

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Wallet Profiles objects from nansen.ai. All fields typed and schema-versioned.

wallet_addressnansen_labelentity_namebalance_usdtop_tokentop_token_balancefirst_active_datelast_active_datetransaction_countchain_deployments
wallet_profiles
● 200 OK
"wallet_address": "0x7a250d5630b4cf539739df2c5dacb4c659f2488d",
"nansen_label": "Smart Money",
"entity_name": "Jump Trading",
"balance_usd": 4589201.55,
"top_token": "USDC",
"transaction_count": 84210,
"last_active_date": "2026-05-12T14:32:00Z"
# wallet_addressnansen_labelentity_namebalance_usdtop_tokentop_token_balance
1
2
3

Complete list of extractable fields for Token Flows objects from nansen.ai. All fields typed and schema-versioned.

token_addresstoken_symbolsmart_money_inflow_24hsmart_money_outflow_24hexchange_inflow_24hexchange_outflow_24hunique_sendersunique_receiversnet_flow_usdtimestamp
token_flows
● 200 OK
"token_symbol": "LINK",
"smart_money_inflow_24h": 1250000.0,
"smart_money_outflow_24h": 450000.0,
"exchange_inflow_24h": 8900000.0,
"net_flow_usd": 800000.0,
"unique_senders": 1432,
"timestamp": "2026-05-12T14:00:00Z"
# token_addresstoken_symbolsmart_money_inflow_24hsmart_money_outflow_24hexchange_inflow_24hexchange_outflow_24h
1
2
3

Complete list of extractable fields for NFT Analytics objects from nansen.ai. All fields typed and schema-versioned.

collection_addresscollection_namefloor_price_ethvolume_24h_ethsmart_money_owners_counttotal_mintsunique_mintersmint_revenue_ethroyalty_fee_pctblue_chip_holders
nft_analytics
● 200 OK
"collection_name": "Bored Ape Yacht Club",
"floor_price_eth": 24.5,
"volume_24h_eth": 156.2,
"smart_money_owners_count": 412,
"unique_minters": 6400,
"blue_chip_holders": 3210,
"royalty_fee_pct": 2.5
# collection_addresscollection_namefloor_price_ethvolume_24h_ethsmart_money_owners_counttotal_mints
1
2
3

Complete list of extractable fields for DeFi Contracts objects from nansen.ai. All fields typed and schema-versioned.

contract_addressprotocol_nametvl_usddaily_active_usersgas_consumed_ethtransaction_volume_24htop_depositor_addresstop_withdrawer_addresschain_idaudit_status
defi_contracts
● 200 OK
"protocol_name": "Aave V3",
"tvl_usd": 4200500000.0,
"daily_active_users": 12450,
"gas_consumed_eth": 45.2,
"transaction_volume_24h": 85000000.0,
"chain_id": 1,
"audit_status": "Verified"
# contract_addressprotocol_nametvl_usddaily_active_usersgas_consumed_ethtransaction_volume_24h
1
2
3

Complete list of extractable fields for Smart Money Alerts objects from nansen.ai. All fields typed and schema-versioned.

alert_idwallet_addresswallet_labeltransaction_hashtoken_symbolamountusd_valueaction_typegas_paid_ethtimestamp
smart_money alerts
● 200 OK
"wallet_label": "Heavy DEX Trader",
"transaction_hash": "0x9f8e7d6c5b4a39281706f5e4d3c2b1a09f8e7d6c5b4a39281706f5e4d3c2b1a0",
"token_symbol": "PEPE",
"amount": 5000000000.0,
"usd_value": 45000.0,
"action_type": "DEX Swap",
"timestamp": "2026-05-12T14:45:12Z"
# alert_idwallet_addresswallet_labeltransaction_hashtoken_symbolamount
1
2
3

Capabilities

Extract the signal from the noise

Our Nansen scraper bypasses Cloudflare protections and parses dynamic dashboard visualisations, extracting raw time-series data and entity labels directly into your warehouse.

Wallet Label Mining

Extract Nansen proprietary labels for millions of addresses, categorising entities as Smart Money, funds, exchanges, or heavy DEX traders.

Smart Money Flow Tracking

Monitor token inflows and outflows specific to highly profitable wallets, capturing early accumulation or distribution patterns.

NFT Collection Metrics

Capture minting volumes, unique minter counts, and Smart Money ownership percentages across new and established NFT contracts.

DeFi Protocol TVL

Track Total Value Locked, daily active users, and dominant depositor addresses across major decentralised finance protocols.

Cross-Chain Entity Mapping

Aggregate wallet behaviour and balances across Ethereum, Arbitrum, Optimism, Polygon, and other supported EVM chains.

Token Balances & PnL

Extract historical token balances and estimated profit/loss metrics for specific high-value entities and funds.

Exchange Inflow/Outflow

Monitor net token movements into and out of known centralised exchange hot wallets to gauge market selling pressure.

Gas Consumption Tracking

Identify trending contracts and protocols by extracting real-time gas consumption metrics from Nansen dashboards.

Automated Alert Extraction

Capture real-time Smart Money alerts and significant fund movements as structured JSON payloads.

Scheduled + Streaming Modes

Run one-off bulk exports or configure continuous pipelines at hourly, daily, or real-time cadences with change-detection diffing.

// engagement pipeline

From blockchain dashboard to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target tokens, wallet addresses, or specific Nansen dashboards. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Playwright crawlers, Cloudflare bypass mechanisms, and API interception logic for nansen.ai.

Validation & QA
d 4–6

Schema validation, null-rate checks, and data consistency verification against on-chain realities before full launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Nansen pipeline handles the hard parts

Financial data platforms invest heavily in scraping detection. Here is how we stay resilient, and why quantitative teams choose managed infrastructure over DIY.

pipeline-monitor · nansen.ai · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Cloudflare Turnstile bypass

Nansen relies heavily on Cloudflare for bot mitigation. Our crawlers use TLS fingerprint spoofing, residential IP rotation, and automated Turnstile solvers to maintain persistent, unflagged sessions.

Dynamic rendering
Dashboard API interception

Instead of parsing complex Canvas or SVG charts, our Playwright instances intercept the underlying GraphQL and REST API calls that populate Nansen dashboards, extracting the raw time-series arrays.

Session management
Persistent authentication handling

For data requiring free-tier or standard authentication, we manage cookie persistence and JWT token refresh cycles automatically, ensuring pipelines do not fail due to expired sessions.

Change detection
Only re-scrape what has changed

For large wallet lists, we maintain a hash index of last-seen values per address. Subsequent runs only push diffs, reducing compute cost and downstream processing load for your data engineering team.

Monitoring & alerting
24/7 pipeline health with anomaly detection

Every run emits structured logs to our observability stack. We alert on null-rate spikes, missing labels, and schema drift, and respond before you notice. SLA uptime is contractual.

Applications

Who uses Nansen data, and how

Teams across industries use nansen.ai data to build competitive products and smarter operations.

01
Hedge Fund Alpha Generation

Quantitative funds ingest Smart Money token flows and wallet labels to build predictive models for asset price movements.

02
Token Market Making

Market makers monitor exchange inflows and outflows for specific tokens to adjust liquidity provision and spread strategies.

03
NFT Collection Valuation

NFT funds track Smart Money ownership concentration and minting velocity to price illiquid digital assets.

04
Competitor Protocol Analysis

DeFi founders track TVL, daily active users, and user retention metrics of competing protocols to optimise their own tokenomics.

05
Security & Exploit Tracking

Security firms monitor fund movements from known exploiter addresses to trace laundered assets across bridges and mixers.

06
Retail Trading Signals

Trading platforms integrate Nansen alert feeds to provide their retail users with institutional-grade on-chain signals.

Why DataFlirt

"Nansen holds the most comprehensive wallet labelling dataset in crypto, but extracting time-series data from their dynamic dashboards requires extensive infrastructure."

Most teams underestimate the investment required: reliable Nansen scraping requires Cloudflare bypass, API interception, session persistence, and anomaly monitoring. DataFlirt absorbs that complexity so your quants can focus on the analysis, not the infrastructure.

Technical Spec

Nansen scraper: technical capabilities

Everything supported by our nansen.ai scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions required for dashboard hydration and API call interception
Supported
Cloudflare bypass
Automated Turnstile solving and TLS fingerprint spoofing
Supported
Wallet label extraction
Capture entity names and categories for millions of addresses
Supported
Token flow time-series
Historical inflow and outflow data aggregated by entity type
Supported
NFT minting feeds
Real-time extraction of minting volumes and minter profiles
Supported
Smart Money alerts
Structured capture of high-value transaction notifications
Supported
Webhook delivery
HTTP POST per record or batch, useful for real-time trading workflows
Supported
Nansen Alpha exclusive reports
Research reports gated behind high-tier institutional subscriptions
Partial
VIP tiered wallet tracking
Custom wallet profiling gated behind authenticated VIP tiers
Partial
Infrastructure

Infrastructure powering the Nansen pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Playwright API Interception

Instead of fragile DOM scraping, Playwright intercepts the underlying GraphQL and REST API responses that populate Nansen dashboards, ensuring clean data extraction.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across US and EU regions. Rotation happens per-request with sticky sessions where required to bypass Cloudflare.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state is stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested, schema versioned per run
CSV
Flat file with typed columns, Excel and Sheets compatible
XLS
Excel spreadsheet format for analyst workflows
Parquet
Columnar format for BigQuery, Snowflake, and Athena
AWS S3
Direct bucket delivery, compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
RESTful endpoints to query your extracted datasets
BigQuery
Streamed directly into your dataset with schema auto-detect
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About nansen.ai scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Nansen legal?

Scraping publicly accessible data is generally permissible. DataFlirt targets only non-authenticated or free-tier dashboard data. We do not circumvent high-tier authentication walls or extract proprietary research reports gated behind Nansen Alpha subscriptions. Clients should review Nansen Terms of Service and consult legal counsel for specific use cases.

How do you handle Nansen Cloudflare protections?

We use residential ISP proxies, full Playwright browser sessions with realistic TLS fingerprints, and automated Turnstile solvers. Our request timing is modelled on human behaviour to prevent session invalidation.

Can you extract data from dynamic charts?

Yes. Rather than attempting to parse Canvas or SVG elements visually, our infrastructure intercepts the network requests fetching the chart data, extracting the raw time-series arrays directly.

How fresh is the data?

Real-time streaming pipelines can achieve sub-minute latency for specific wallet alerts or token flows. Full dashboard refreshes typically run at hourly or daily cadences depending on your requirements.

Can you track historical token flows?

Yes. We can extract historical time-series data available on the dashboards, and maintain a continuous append-only table in your warehouse from the date your pipeline starts.

What is the minimum viable engagement?

Our smallest packages start at a defined list of tokens or wallets with daily delivery. For larger cross-chain extractions or custom schema requirements, we price based on compute volume and delivery frequency.

Do you support cross-chain data extraction?

Yes. We extract data across all EVM-compatible chains supported by Nansen, including Ethereum, Arbitrum, Optimism, Polygon, and Base, normalising the output into a single unified schema.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run for a specific token or wallet set as part of the pre-engagement scoping process, allowing you to validate schema fit and data quality before signing any contract.

$ dataflirt scope --new-project --source=nansen.ai ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off wallet label export or a continuous Smart Money flow feed, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →