SYSTEM all green source kraken.com queue 12,491 pairs p99 latency 84ms dataflirt.com · scraper/kraken-com
RUN - 84 active pipelines - kraken.com live

Kraken market data,
at warehouse scale.

We extract L2 order books, historical trades, OHLCV candles, and asset metadata from Kraken. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Tickers tracked
8,492 /run
Order book updates
42.1M /day
Trade records
18.5M /24h
Active pipelines
84
Uptime
99.99%
Data Dictionary

Every field we extract from kraken.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Ticker & Pricing objects from kraken.com. All fields typed and schema-versioned.

pair_nameask_priceask_whole_lot_volumeask_lot_volumebid_pricebid_whole_lot_volumebid_lot_volumelast_trade_pricelast_trade_volumevolume_todayvolume_24hvwap_todayvwap_24htrades_todaytrades_24hlow_todaylow_24hhigh_todayhigh_24hopen_todaytimestamp
ticker_& pricing
● 200 OK
"pair_name": "XXBTZUSD",
"ask_price": 64231.1,
"bid_price": 64231.0,
"last_trade_price": 64231.1,
"volume_24h": 4192.4819,
"vwap_24h": 63912.4,
"low_24h": 62100.0,
"high_24h": 64890.5,
"timestamp": "2026-05-12T09:14:00Z"
# pair_nameask_priceask_whole_lot_volumeask_lot_volumebid_pricebid_whole_lot_volume
1
2
3

Complete list of extractable fields for L2 Order Book objects from kraken.com. All fields typed and schema-versioned.

pair_namesnapshot_timestampsideprice_levelvolumeorder_timestampdepth_rankis_updatechecksum
l2_order book
● 200 OK
"pair_name": "XETHZUSD",
"snapshot_timestamp": "2026-05-12T09:14:00Z",
"side": "ask",
"price_level": 3450.25,
"volume": 12.45,
"order_timestamp": 1715505231,
"depth_rank": 1,
"is_update": false
# pair_namesnapshot_timestampsideprice_levelvolumeorder_timestamp
1
2
3

Complete list of extractable fields for OHLCV Candles objects from kraken.com. All fields typed and schema-versioned.

pair_nameinterval_minutestimestampopen_pricehigh_pricelow_priceclose_pricevwapvolumetrade_count
ohlcv_candles
● 200 OK
"pair_name": "XXBTZUSD",
"interval_minutes": 60,
"timestamp": 1715504400,
"open_price": 64100.5,
"high_price": 64350.0,
"low_price": 63980.2,
"close_price": 64231.1,
"vwap": 64185.3,
"volume": 342.15,
"trade_count": 1842
# pair_nameinterval_minutestimestampopen_pricehigh_pricelow_price
1
2
3

Complete list of extractable fields for Historical Trades objects from kraken.com. All fields typed and schema-versioned.

pair_namepricevolumetimestampbuy_sellmarket_limitmiscellaneoustrade_id
historical_trades
● 200 OK
"pair_name": "XXRPZUSD",
"price": 0.5124,
"volume": 1500.0,
"timestamp": 1715505231.412,
"buy_sell": "buy",
"market_limit": "market",
"miscellaneous": "",
"trade_id": 14928192
# pair_namepricevolumetimestampbuy_sellmarket_limit
1
2
3

Complete list of extractable fields for Asset Metadata objects from kraken.com. All fields typed and schema-versioned.

asset_idasset_classaltnamedecimalsdisplay_decimalscollateral_valuestatusmargin_rateminimum_order
asset_metadata
● 200 OK
"asset_id": "XXBT",
"asset_class": "currency",
"altname": "XBT",
"decimals": 10,
"display_decimals": 5,
"collateral_value": 1.0,
"status": "enabled",
"minimum_order": 0.0001
# asset_idasset_classaltnamedecimalsdisplay_decimalscollateral_value
1
2
3

Capabilities

Extract crypto market data with precision

Our Kraken scraper handles high-frequency exchange feeds, WebSocket streams, REST rate limits, and precision decimal formatting to deliver clean quantitative data.

L2 Order Book Snapshots

Extract deep order book state across hundreds of pairs. Bid and ask levels, volumes, and timestamps captured at specified polling intervals.

OHLCV Candle Extraction

Retrieve historical pricing data at 1m, 5m, 15m, 1h, 4h, and 1d intervals. Open, high, low, close, volume, and VWAP metrics included.

Real-Time Ticker Feeds

Monitor 24-hour volume, VWAP, last trade price, and current spread across the entire Kraken asset catalogue.

Historical Trade Logs

Extract every public trade execution for a given pair. Includes price, volume, side, order type, and millisecond timestamps.

Margin & Futures Data

Track margin fee rates, collateral weights, and futures contract specifications across Kraken platforms.

Staking Yield Tracking

Monitor advertised APY and reward rates for proof-of-stake assets listed on the exchange.

Asset & Pair Metadata

Extract decimal precision parameters, minimum order sizes, and trading status flags for all listed instruments.

High-Frequency Polling

Optimised request scheduling to capture high-frequency market movements without triggering Cloudflare blocks.

Multi-Format Delivery

Export tick data as lightweight JSON, tabular CSV, or compressed Parquet files for time-series databases.

// engagement pipeline

From target pairs to structured datasets

Brief in. Clean data out.

Define Scope
d 0

Specify trading pairs, required depth, time intervals, and historical lookback periods. We map the schema.

Pipeline Build
d 2–4

We configure extraction logic, handle rate limits, and implement precision decimal parsing for crypto assets.

Validation & QA
d 4–6

Schema validation, gap detection in time-series data, and outlier checks before production deployment.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or delivered via Webhook on schedule.

Under the hood

Handling exchange infrastructure challenges

Crypto exchanges deploy aggressive rate limiting and complex data structures. Here is how we maintain data integrity.

pipeline-monitor · kraken.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Rate limits
Distributed IP rotation for high-frequency polling

Kraken enforces strict API call limits based on a decay model. We distribute requests across our proxy network to maintain high-frequency polling for order books without hitting HTTP 429 errors or temporary bans.

Data precision
String preservation for crypto decimals

Floating-point math destroys cryptocurrency data integrity. We extract and deliver all pricing and volume metrics as high-precision strings or exact numeric types to prevent rounding errors in downstream systems.

Time-series gaps
Automated pagination and gap filling

Extracting historical trades requires precise timestamp pagination. Our state management tracks the exact millisecond of the last extracted trade, ensuring zero duplicate records and zero missing ticks during high-volume periods.

Cloudflare bypass
Handling edge protection on web endpoints

While public APIs exist, certain metadata and staking details require scraping Kraken web properties. We use Playwright with residential proxies to bypass Cloudflare turnstiles and extract undocumented frontend state.

Monitoring
Schema drift and market halt detection

Exchanges frequently update asset tickers or halt trading pairs. Our pipeline monitors for structural changes, delistings, and anomalous volume drops, alerting you immediately if market conditions affect data availability.

Applications

Who uses Kraken market data

Teams across industries use kraken.com data to build competitive products and smarter operations.

01
Algorithmic Backtesting

Quantitative trading desks ingest historical OHLCV and trade tick data to backtest strategies and tune execution algorithms.

02
Arbitrage Monitoring

Market participants track L2 order book depth across multiple exchanges to identify and execute cross-exchange arbitrage.

03
Market Making

Liquidity providers analyse historical spread dynamics and order book density to optimise quoting strategies.

04
Portfolio & Tax Software

Fintech applications pull daily close prices and historical exchange rates to calculate portfolio valuations and tax liabilities.

05
Academic Research

Universities and research institutions analyse market microstructure, volatility patterns, and liquidity events in crypto markets.

06
Sentiment & Volume Analysis

Analysts correlate trade volume spikes and order book imbalances with external news events to model market sentiment.

Why DataFlirt

"Kraken provides deep liquidity across hundreds of pairs, but reconstructing historical order books requires infrastructure built for high-throughput extraction."

Most teams underestimate the compute required to process raw crypto exchange feeds. Extracting L2 depth and historical ticks requires persistent connection management, rate-limit circumvention, and precision decimal handling. DataFlirt absorbs that complexity so your quants can focus on alpha generation.

Technical Spec

Kraken scraper technical specifications

Everything supported by our kraken.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

L2 Order Book Depth
Capture up to 500 levels of bid/ask depth per pair
Supported
Historical OHLCV
Extract candles across multiple timeframes from inception
Supported
Public Trade History
Millisecond-precision tick data for all public executions
Supported
Staking Yield Rates
Track advertised APY across supported assets
Supported
Spot Trading Pairs
Comprehensive coverage of all fiat and crypto pairings
Supported
Futures Contract Specs
Extract margin requirements and contract details
Supported
Decimal Preservation
Deliver values as strings to prevent floating-point errors
Supported
Private Account Balances
Requires authenticated API keys and user permissions
Partial
OTC Desk Quotes
Requires verified institutional account login
Partial
Infrastructure

Infrastructure powering the Kraken pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
High-Frequency Polling

Distributed request architecture allows sub-second polling of public endpoints without triggering rate limit bans.

Stateful Pagination

Redis-backed state management ensures exact millisecond continuation for historical trade extraction, preventing data gaps.

Cloud-Native Storage

High-volume tick data is batched, compressed, and written directly to object storage via AWS Lambda and ECS workers.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited JSON for easy ingestion
CSV
Flat tabular data for quantitative analysis
XLS
Excel format for manual review and reporting
Parquet
Columnar format optimised for time-series databases
AWS S3
Direct delivery to your cloud storage buckets
Webhook
HTTP POST for real-time order book updates
API
Queryable REST interface for extracted datasets
BigQuery
Direct streaming into Google Cloud analytics
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About kraken.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Kraken public data legal?

Scraping public market data, such as order books and trade histories, is generally permissible. DataFlirt extracts only publicly accessible, non-authenticated market information. We do not handle private keys or execute trades. Clients should review exchange terms of service and consult legal counsel.

How do you manage Kraken API rate limits?

Kraken uses a complex call rate decay system. We model this decay precisely and distribute requests across a large pool of residential and datacenter IPs, ensuring we remain well below the threshold for any single connection.

How far back can you extract historical trades?

We can extract trade history back to the inception of the trading pair on Kraken. For high-volume pairs like BTC/USD, this requires processing millions of records via stateful pagination.

How do you handle floating-point precision issues?

Cryptocurrency volumes and prices require high decimal precision. We extract and deliver all numerical data as exact strings by default, ensuring your downstream systems can parse them into appropriate Decimal types without data loss.

Can you track delisted or renamed assets?

Our pipelines monitor asset metadata continuously. If an asset is halted, delisted, or undergoes a ticker change, the pipeline logs the event and continues extraction based on your configured fallback rules.

What is the delivery latency for order book snapshots?

For REST-based polling pipelines, we can deliver snapshots at sub-minute intervals depending on the number of pairs tracked. Data is pushed immediately via Webhook or batched to S3.

$ dataflirt scope --new-project --source=kraken.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a historical dump of OHLCV candles or continuous L2 order book snapshots across 50 pairs, we build and operate the infrastructure. Tell us your requirements.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →