What is Schema Evolution?
Schema evolution is the controlled process of adapting data structures over time without breaking downstream consumers or losing historical context. In scraping pipelines, target websites change their DOM and API payloads constantly. Evolution dictates how those upstream changes—new fields, renamed keys, altered types—are safely propagated into your data warehouse. Without a formal evolution strategy, schema drift causes silent data loss and pipeline failures.