What is Data Validity?
Data validity is the degree to which extracted records conform to the defined structural, typological, and business rules of your schema. Unlike accuracy — which measures if a value is true to the real world — validity only cares if the value is formatted correctly and falls within acceptable bounds. In a scraping pipeline, failing to enforce validity at the extraction layer means you are quietly writing poisoned types into your data warehouse, breaking downstream aggregations.