We extract wallet labels, Smart Money token flows, NFT minting volumes, and DeFi contract metrics from Nansen. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.
Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.
Complete list of extractable fields for Wallet Profiles objects from nansen.ai. All fields typed and schema-versioned.
"wallet_address": "0x7a250d5630b4cf539739df2c5dacb4c659f2488d", "nansen_label": "Smart Money", "entity_name": "Jump Trading", "balance_usd": 4589201.55, "top_token": "USDC", "transaction_count": 84210, "last_active_date": "2026-05-12T14:32:00Z"
| # | wallet_address | nansen_label | entity_name | balance_usd | top_token | top_token_balance |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Token Flows objects from nansen.ai. All fields typed and schema-versioned.
"token_symbol": "LINK", "smart_money_inflow_24h": 1250000.0, "smart_money_outflow_24h": 450000.0, "exchange_inflow_24h": 8900000.0, "net_flow_usd": 800000.0, "unique_senders": 1432, "timestamp": "2026-05-12T14:00:00Z"
| # | token_address | token_symbol | smart_money_inflow_24h | smart_money_outflow_24h | exchange_inflow_24h | exchange_outflow_24h |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for NFT Analytics objects from nansen.ai. All fields typed and schema-versioned.
"collection_name": "Bored Ape Yacht Club", "floor_price_eth": 24.5, "volume_24h_eth": 156.2, "smart_money_owners_count": 412, "unique_minters": 6400, "blue_chip_holders": 3210, "royalty_fee_pct": 2.5
| # | collection_address | collection_name | floor_price_eth | volume_24h_eth | smart_money_owners_count | total_mints |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for DeFi Contracts objects from nansen.ai. All fields typed and schema-versioned.
"protocol_name": "Aave V3", "tvl_usd": 4200500000.0, "daily_active_users": 12450, "gas_consumed_eth": 45.2, "transaction_volume_24h": 85000000.0, "chain_id": 1, "audit_status": "Verified"
| # | contract_address | protocol_name | tvl_usd | daily_active_users | gas_consumed_eth | transaction_volume_24h |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Complete list of extractable fields for Smart Money Alerts objects from nansen.ai. All fields typed and schema-versioned.
"wallet_label": "Heavy DEX Trader", "transaction_hash": "0x9f8e7d6c5b4a39281706f5e4d3c2b1a09f8e7d6c5b4a39281706f5e4d3c2b1a0", "token_symbol": "PEPE", "amount": 5000000000.0, "usd_value": 45000.0, "action_type": "DEX Swap", "timestamp": "2026-05-12T14:45:12Z"
| # | alert_id | wallet_address | wallet_label | transaction_hash | token_symbol | amount |
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 |
Our Nansen scraper bypasses Cloudflare protections and parses dynamic dashboard visualisations, extracting raw time-series data and entity labels directly into your warehouse.
Extract Nansen proprietary labels for millions of addresses, categorising entities as Smart Money, funds, exchanges, or heavy DEX traders.
Monitor token inflows and outflows specific to highly profitable wallets, capturing early accumulation or distribution patterns.
Capture minting volumes, unique minter counts, and Smart Money ownership percentages across new and established NFT contracts.
Track Total Value Locked, daily active users, and dominant depositor addresses across major decentralised finance protocols.
Aggregate wallet behaviour and balances across Ethereum, Arbitrum, Optimism, Polygon, and other supported EVM chains.
Extract historical token balances and estimated profit/loss metrics for specific high-value entities and funds.
Monitor net token movements into and out of known centralised exchange hot wallets to gauge market selling pressure.
Identify trending contracts and protocols by extracting real-time gas consumption metrics from Nansen dashboards.
Capture real-time Smart Money alerts and significant fund movements as structured JSON payloads.
Run one-off bulk exports or configure continuous pipelines at hourly, daily, or real-time cadences with change-detection diffing.
Brief in. Clean data out.
Provide target tokens, wallet addresses, or specific Nansen dashboards. We design the extraction schema together.
We configure Playwright crawlers, Cloudflare bypass mechanisms, and API interception logic for nansen.ai.
Schema validation, null-rate checks, and data consistency verification against on-chain realities before full launch.
JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.
Financial data platforms invest heavily in scraping detection. Here is how we stay resilient, and why quantitative teams choose managed infrastructure over DIY.
Nansen relies heavily on Cloudflare for bot mitigation. Our crawlers use TLS fingerprint spoofing, residential IP rotation, and automated Turnstile solvers to maintain persistent, unflagged sessions.
Instead of parsing complex Canvas or SVG charts, our Playwright instances intercept the underlying GraphQL and REST API calls that populate Nansen dashboards, extracting the raw time-series arrays.
For data requiring free-tier or standard authentication, we manage cookie persistence and JWT token refresh cycles automatically, ensuring pipelines do not fail due to expired sessions.
For large wallet lists, we maintain a hash index of last-seen values per address. Subsequent runs only push diffs, reducing compute cost and downstream processing load for your data engineering team.
Every run emits structured logs to our observability stack. We alert on null-rate spikes, missing labels, and schema drift, and respond before you notice. SLA uptime is contractual.
Quantitative funds ingest Smart Money token flows and wallet labels to build predictive models for asset price movements.
Market makers monitor exchange inflows and outflows for specific tokens to adjust liquidity provision and spread strategies.
NFT funds track Smart Money ownership concentration and minting velocity to price illiquid digital assets.
DeFi founders track TVL, daily active users, and user retention metrics of competing protocols to optimise their own tokenomics.
Security firms monitor fund movements from known exploiter addresses to trace laundered assets across bridges and mixers.
Trading platforms integrate Nansen alert feeds to provide their retail users with institutional-grade on-chain signals.
"Nansen holds the most comprehensive wallet labelling dataset in crypto, but extracting time-series data from their dynamic dashboards requires extensive infrastructure."
Most teams underestimate the investment required: reliable Nansen scraping requires Cloudflare bypass, API interception, session persistence, and anomaly monitoring. DataFlirt absorbs that complexity so your quants can focus on the analysis, not the infrastructure.
Everything supported by our nansen.ai scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.
Open-source tooling on proven cloud infra — no vendor lock-in, full observability.
Instead of fragile DOM scraping, Playwright intercepts the underlying GraphQL and REST API responses that populate Nansen dashboards, ensuring clean data extraction.
We maintain pools of residential ISP proxies across US and EU regions. Rotation happens per-request with sticky sessions where required to bypass Cloudflare.
Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state is stored in managed Postgres.
Data delivered to where your team already works — no new tooling required.
About nansen.ai scraping, legality, and pipeline operations.
Ask us directly →Scraping publicly accessible data is generally permissible. DataFlirt targets only non-authenticated or free-tier dashboard data. We do not circumvent high-tier authentication walls or extract proprietary research reports gated behind Nansen Alpha subscriptions. Clients should review Nansen Terms of Service and consult legal counsel for specific use cases.
We use residential ISP proxies, full Playwright browser sessions with realistic TLS fingerprints, and automated Turnstile solvers. Our request timing is modelled on human behaviour to prevent session invalidation.
Yes. Rather than attempting to parse Canvas or SVG elements visually, our infrastructure intercepts the network requests fetching the chart data, extracting the raw time-series arrays directly.
Real-time streaming pipelines can achieve sub-minute latency for specific wallet alerts or token flows. Full dashboard refreshes typically run at hourly or daily cadences depending on your requirements.
Yes. We can extract historical time-series data available on the dashboards, and maintain a continuous append-only table in your warehouse from the date your pipeline starts.
Our smallest packages start at a defined list of tokens or wallets with daily delivery. For larger cross-chain extractions or custom schema requirements, we price based on compute volume and delivery frequency.
Yes. We extract data across all EVM-compatible chains supported by Nansen, including Ethereum, Arbitrum, Optimism, Polygon, and Base, normalising the output into a single unified schema.
Absolutely. We provide a sample run for a specific token or wallet set as part of the pre-engagement scoping process, allowing you to validate schema fit and data quality before signing any contract.
20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off wallet label export or a continuous Smart Money flow feed, we scope, build, and operate the pipeline. Tell us what you need.