What is Node Selection?
Node selection is the process of targeting specific elements within a parsed HTML or XML document to extract their underlying data. It is the foundational mechanism of the extraction layer, relying on query languages like CSS selectors or XPath to traverse the Document Object Model (DOM). In production pipelines, node selection is the most brittle component—when a target website updates its layout, selectors fail, causing silent data loss or pipeline crashes if not actively monitored.