漏 2026 Hashnode
The broken dashboard at 2 AM. The failing ML model. The frantic Slack message: "Why did this column disappear?" Data teams everywhere face these crises because organizations often manage data reactively, allowing schema changes to occur without warni...

Data lineage is the practice of tracking where data comes from, how it changes, and where it ends up.For data engineers, it鈥檚 like a map showing every step data takes so you can understand dependencies and the impact of changes, from the very beginni...

Preface 馃摎 I鈥檝e actually written a post on data contracts before, so have a quick scan here if you want to see a project I created on them using Python, AWS S3 and other libraries (like Selenium and Soda). What is a contract? 馃 Let鈥檚 first talk abou...

Preface 馃挮 This is a practical end-to-end data pipeline demo to show what a data project incorporating data contracts looks like. We鈥檒l be scraping the Premier League table standings for the 2023/24 season, as of 13th February 2024 (the date this art...
