Apr 1 · 3 min read · Data engineering has undergone a fundamental transformation over the past decade, driven by exponential data growth, cloud computing, and the demand for real-time analytics. Traditionally, data pipeli
Join discussion
Mar 16 · 18 min read · Why Your 10,000+ Queries Break After Migration — and How to Fix Them If you have ever migrated a data warehouse from Teradata to Google Cloud BigQuery, you have probably experienced that sinking feel
Join discussion
Feb 15 · 27 min read · Extending Kestra to Every Corner of Your Data Stack. Introduction: The Power of Plugins Imagine you're a master chef. You don't just have one knife - you have specialized tools for every task: a paring knife for delicate work, a chef's knife for chop...
Join discussionDec 18, 2025 · 3 min read · 📜 What Is the ETL/ELT Evolution? For decades, ETL (Extract, Transform, Load) was the standard approach to building data pipelines.Data was extracted from source systems, transformed in intermediate servers, and then loaded into warehouses. With clou...
Join discussion
Dec 16, 2025 · 3 min read · 📜 What Does “Raw to Ready” Mean? Every data journey begins with raw data — logs, events, transactions, files, APIs, sensors, and user interactions.Raw data is often incomplete, inconsistent, duplicated, or unstructured. Before it can be analysed, it...
Join discussion
Sep 4, 2025 · 3 min read · Imagine trying to manually process millions of records every day… that’s why data pipelines are essential. In Data Engineering, Pipelines are the heart of the process. Without them, data engineers and analysts would need to manually transfer and proc...
Join discussionJun 27, 2025 · 2 min read · 1. ETL (Extract - Transform - Load) ETL là viết tắt của 3 bước trong quy trình xử lý dữ liệu: Extract: Thu thập dữ liệu từ nhiều nguồn khác nhau (cơ sở dữ liệu, API, file, web,…). Transform: Làm sạch, chuẩn hóa, tính toán, thay đổi định dạng dữ liệ...
Join discussion
May 28, 2025 · 4 min read · This is the second post in my DBT blog series. If you missed the first part introducing DBT and its role in modern data workflows, you can find it here. After understanding what DBT is and why it's revolutionary for data transformation workflows, it'...
Join discussionMay 27, 2025 · 2 min read · Should I transform the data before storing it? Or after? Welcome to the world of ETL vs ELT. In this blog, let us go through some topics such as: What ETL and ELT actually mean (without any jargon!) Key differences and trade offs Which is better f...
Join discussion