What is Change Data Capture (CDC)?
Change Data Capture (CDC) is an architectural pattern that identifies and tracks row-level changes in a database — inserts, updates, and deletes — and streams those events to downstream consumers in real time. In the context of scraping pipelines, CDC flips the extraction model from batch polling to event-driven delivery. Instead of querying a massive target catalog every night to see what changed, a CDC-enabled pipeline streams only the delta records, drastically reducing compute costs, egress bandwidth, and target server load.