← Glossary / Proxy Scoring

What is Proxy Scoring?

Proxy scoring is the continuous, automated evaluation of an IP address's viability for data extraction before it is assigned to a scraper worker. Instead of blindly routing requests and waiting for a 403 Forbidden, a scoring engine pre-flights IPs against target-specific heuristics—checking ASN reputation, latency, historical block rates, and fraud scores. For high-throughput pipelines, it's the difference between burning through a residential pool in an hour and sustaining a steady, undetected crawl for weeks.

IP ProxiesRoutingReputationASNHealth Check
// 02 — definitions

Grade before
you route.

The mechanics of evaluating proxy health and reputation in real-time to prevent burning IPs on hardened targets.

Ask a DataFlirt engineer →

TL;DR

Proxy scoring assigns a dynamic health and reputation metric to every IP in a pool based on latency, ASN history, and target-specific success rates. It prevents scrapers from using burned IPs, reduces CAPTCHA triggers, and optimizes the cost of residential bandwidth by routing low-score IPs to easier targets.

01Definition & structure
Proxy scoring is the algorithmic evaluation of an IP address to determine its suitability for a specific scraping task. A comprehensive score evaluates network metrics (latency, jitter, uptime) alongside reputation metrics (fraud score, historical block rate, ASN trust). The output is a dynamic value that dictates whether the proxy gateway will route a request through that IP.
02How it works in practice
When a scraper requests a URL, the proxy gateway intercepts the request. Instead of picking a random IP from the pool, the gateway queries its scoring ledger for the target domain. It filters out IPs with low scores for that target, selects the highest-scoring available IP, and routes the request. When the response returns, the gateway inspects the HTTP status code and updates the IP's score in real-time.
03The target-specific matrix
A critical concept in proxy scoring is that there is no such thing as a "good" proxy—only a proxy that is good for a specific target. An IP might be permanently banned on a DataDome-protected site but have a pristine reputation on an Akamai-protected site. Production scoring engines maintain a multi-dimensional matrix, tracking an IP's score independently across dozens of WAF and target profiles.
04How DataFlirt handles it
We maintain a real-time ledger of over 5 million IPs. Our gateway scores every IP across 40+ distinct anti-bot profiles. If an IP's score drops below our 0.85 threshold for a strict target, it is instantly quarantined from that target's routing table and placed on a cooldown timer. This prevents cascading failures and ensures our clients never pay for bandwidth wasted on predictable 403s.
05The "clean IP" misconception
Many engineers assume a "fresh" or newly acquired IP is the best choice for scraping. In reality, anti-bot systems often penalize IPs with zero history. A residential IP that has a history of normal, human-like browsing (and therefore a high proxy score) will almost always outperform a completely blank-slate datacenter IP, even if the datacenter IP has never been blocked.
// 03 — the math

How do you quantify
proxy health?

A proxy score is a composite of network performance and target-specific reputation. DataFlirt recalculates these weights dynamically based on the target's anti-bot strictness.

Base Proxy Score = S = (w1 · SuccessRate) − (w2 · Latency) − (w3 · FraudScore)
Weights adjust per target. High fraud score heavily penalizes the IP. Standard routing heuristic
IP Burn Rate = B = Blocks / (TotalRequests + 1)
Calculated per target domain. B > 0.05 usually triggers a cooldown. DataFlirt gateway logic
DataFlirt Routing Threshold = Troute > 0.85
Only IPs scoring above 0.85 for a specific target are assigned to its queue. Internal SLO
// 04 — proxy evaluation trace

Scoring a residential IP
against Cloudflare.

A live trace of our proxy gateway evaluating a newly rotated Indian residential IP before assigning it to an e-commerce pipeline.

IP: 49.36.x.xASN: 55836Target: CF-Strict
edge.dataflirt.io — live
CAPTURED
// pre-flight check
ip.asn: "AS55836 Reliance Jio"
ip.type: "residential"
ip.fraud_score: 12 // low risk

// latency probe
ping.target: 142ms
ping.jitter: 18ms

// historical ledger lookup
history.target_cf: "no blocks in 72h"
history.global_success: 0.98

// active probe (headless)
probe.tls_ja4: "t13d1516h2_8daaf6152771"
probe.response: 200 OK
probe.captcha: false

// scoring engine
score.calculated: 0.94
routing.status: ASSIGNED TO POOL
// 05 — scoring penalties

What degrades a
proxy's score.

The primary factors that cause a proxy to be down-ranked or quarantined in a high-volume scraping pool. Scores are recalculated after every request.

POOL SIZE ·  ·  ·  ·  ·   5.2M IPs
EVAL RATE ·  ·  ·  ·  ·   Real-time
UPDATED ·  ·  ·  ·  ·  ·  2026-05-19
01

Target-specific 403s / CAPTCHAs

immediate penalty · Direct evidence of detection
02

High fraud score (IPAM flag)

pre-flight penalty · IP listed in spam/bot databases
03

Connection timeouts

reliability penalty · Unstable residential nodes
04

ASN concentration limits

diversity penalty · Too many requests from one ISP
05

Concurrent session overload

capacity penalty · IP handling too many parallel streams
// 06 — routing intelligence

Don't guess the route,

calculate it.

DataFlirt's proxy gateway doesn't just round-robin requests. Every IP in our residential and mobile pools is continuously scored against specific anti-bot profiles. If an IP starts seeing elevated challenge rates on a DataDome-protected target, its score for that specific target drops, and the gateway automatically routes it to lower-security endpoints to cool down. This preserves the IP's utility and keeps pipeline success rates above 99%.

gateway.score_profile

Real-time scoring matrix for IP 49.36.x.x across different target profiles.

target.cloudflare 0.94route
target.datadome 0.42cooldown
target.perimeterx 0.88route
target.akamai 0.91route
ip.fraud_risk lowclean
ip.latency_avg 142ms
pool.status active

Stay ahead of the pipeline

Data engineering
intel, weekly.

Anti-bot shifts, scraping infrastructure updates, dataset delivery patterns, and business outcomes from our pipelines. Short, technical, no fluff.

// 07 — FAQ

Common
questions.

About proxy evaluation, fraud scores, routing logic, and how DataFlirt maximizes residential IP lifespan.

Ask us directly →
What is the difference between proxy scoring and proxy rotation? +
Proxy rotation is the mechanism of switching IPs between requests. Proxy scoring is the intelligence that decides which IP to rotate to. Without scoring, rotation is blind—you might rotate from a burned IP to another burned IP. Scoring ensures you only rotate to IPs mathematically likely to succeed on that specific target.
How do fraud scores affect proxy viability? +
Anti-bot vendors subscribe to the same IPAM and threat intelligence databases that security teams use. If an IP has a high fraud score (due to past spam, credential stuffing, or malware activity), the anti-bot classifier starts your session with a massive penalty. A high fraud score IP will often get a CAPTCHA before it even sends a TLS fingerprint.
Why does an IP work for one site but fail on another? +
Because proxy reputation is decentralized. An IP might have triggered a rate limit on an Akamai-protected airline site, burning its reputation there, but remain completely pristine in the eyes of a Cloudflare-protected e-commerce site. This is why global proxy scores are useless; scores must be maintained per-target or per-WAF.
How does DataFlirt prevent burning residential IPs? +
We use predictive scoring and aggressive cooldowns. If an IP's success rate on a target drops by even 2%, we pull it from that target's routing table immediately. We then route it to lower-tier targets (like basic surface web catalogs) while its reputation on the strict target naturally resets over 24-72 hours.
How often should proxy scores be updated? +
In a production pipeline, scores must be updated synchronously with every response. If a worker receives a 403, the gateway must instantly decrement that IP's score for that target. Batch updating scores every hour means you will burn thousands of requests in the interim.
Is it legal to test proxies against targets? +
Sending standard HTTP probes to evaluate latency and response codes is standard network behavior. However, aggressively probing a target to map its anti-bot thresholds can be viewed as hostile. We rely primarily on passive scoring—updating scores based on the results of actual, legitimate data extraction requests rather than synthetic probes.
$ dataflirt scope --new-project --target=proxy-scoring READY

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off catalogue dump or a continuous feed across millions of records — we scope, build, and operate the pipeline.

hello@dataflirt.com  ·  Bengaluru  ·  IST  ·  typical reply < 4h