What is Pages Per Minute?
Pages per minute (PPM) is the aggregate throughput metric of a scraping pipeline, measuring how many complete, validated HTML documents or JSON payloads are successfully fetched and parsed within a 60-second window. While engineers often focus on requests per second (RPS) at the network layer, PPM is the actual business metric: it accounts for retries, proxy timeouts, CAPTCHA blocks, and extraction failures. A high RPS with a low PPM means you are burning bandwidth on failed requests.