What is Snowflake Schema?
Snowflake schema is a multidimensional data model where a central fact table is connected to multiple dimension tables, which are themselves normalized into sub-dimensions. For scraped data pipelines, it trades query simplicity for storage efficiency and strict data integrity. While it reduces redundancy when storing complex hierarchical data — like nested e-commerce categories or multi-level corporate ownership — it requires heavy join operations that can bottleneck downstream analytics if not indexed correctly.