What is Message Queue?
Message queue is the asynchronous communication layer that decouples URL discovery from data extraction in a scraping pipeline. Instead of a crawler fetching a page and immediately parsing it, it pushes the URL payload to a queue where idle worker nodes consume it at their own pace. This architecture absorbs traffic spikes, handles retries without blocking the main thread, and ensures that a temporary database outage doesn't crash an entire multi-day crawl.