What is Requests Per Second (RPS)?
Requests Per Second (RPS) is the fundamental throughput metric of a scraping pipeline, measuring how many HTTP requests are dispatched and acknowledged within a one-second window. It dictates pipeline duration, proxy pool exhaustion rates, and the probability of triggering target rate limits. For data engineers, tuning RPS is a balancing act between data freshness and pipeline survival — push too hard, and you burn your IP reputation; go too slow, and your data is stale before it hits S3.