What is Scrapy Spider?
Scrapy Spider is the core Python class in the Scrapy framework that defines how a specific site should be crawled and parsed. It dictates the initial requests, follows pagination links, and extracts structured data from the response. When a spider is poorly written, it blocks the reactor thread, leaks memory, and turns a high-throughput async pipeline into a fragile, single-threaded bottleneck.