Apr 25 · 3 min read · Every application you use generates data. But raw data is messy and not useful on its own. If you’ve ever wondered how companies turn messy data into meaningful insights—this is where ETL pipelines co
Join discussionMar 1 · 8 min read · When a function throws, the error channel disappears from the type system. The caller gets unknown in a catch block. There's no equivalent to Java's throws clause. No way for the compiler to say: Thi
Join discussion
Feb 13 · 27 min read · Mastering Complex Orchestration Scenarios. Introduction: The Orchestrator's Toolkit Imagine you're conducting a symphony. You don't just wave your baton - you cue sections, adjust tempo, handle surprises, and ensure harmony. That's what advanced work...
Join discussionFeb 3 · 8 min read · From Legacy Workflow Tools to Declarative Data Pipelines Introduction: The Orchestration Evolution Imagine you're building a complex data pipeline. You have data scattered across cloud storage, databases, and APIs. You need to transform it, validate ...
Join discussionJan 23 · 12 min read · Project aim The main aim of this project, I would say, was to pick up Kubernetes while exercising my Python, Docker, and Postgres muscles. I have always heard of Kubernetes (I don’t feel competent enough to graduate to K8s yet), but I chalked it off ...
Join discussionJan 18 · 11 min read · Big Picture: ETL vs ELT ETL (Extract → Transform → Load): Transform happens before loading into the target warehouse/mart. Common when target compute is limited or rules must be enforced early. ELT (Extract → Load → Transform): Raw data is loaded ...
Join discussionJan 11 · 4 min read · If you’ve ever heard terms like data lake, data warehouse, ETL, streaming, or lakehouse and thought: “I kind of get it… but not really” —you’re not alone. This post breaks down modern data architecture in simple terms, explains why each component e...
Join discussionNov 16, 2025 · 5 min read · From Raw to Ready: Building ETL Pipelines on GCP with Dataflow and Dataproc Data. Data everywhere. But what good is it if it's just sitting there, raw and unorganized? That's where ETL pipelines come in. They're like the chefs of the data world, taki...
Join discussion
Nov 15, 2025 · 19 min read · Part 1 (Introduction + Project Overview + Architecture Diagram explanation) 1. Introduction The Forever Retail Data Engineering Project is a fully functional, production-grade pipeline designed to showcase: cloud data ingestion scalable warehouse d...
Join discussion