Apr 27 · 14 min read · How I combined Apache Flink, Redpanda, Airflow, dbt Cloud, and Grafana to track Bitcoin, Ethereum, Solana, BNB, and Cardano in real time — all running on Google Cloud for less than a cup of coffee per
Join discussion
Feb 12 · 9 min read · Why Traditional Stateless Processing Falls Short Stateless stream processors treat each event independently, making them simple but fundamentally limited. They cannot answer questions like "how many unique users visited in the last hour?" or "what's ...
Join discussionJan 18 · 8 min read · This blog walks through Apache Flink end to end, starting from what Flink is, how its architecture works, and how to deploy and properly utilize a Kubernetes cluster using Flink’s standalone Kubernetes mode. The goal is not just to get Flink running,...
Join discussion
Jan 14 · 4 min read · Imagine you have a factory that processes things. Flink is like that factory, and Kubernetes is like the factory floor manager that decides where machines go and how they run. 1. What is Flink? Apache Flink is a distributed data processing engine. ...
Join discussion
Dec 9, 2025 · 14 min read · A recent piece titled "Event Streaming is Topping Out" has been making the rounds in the data infrastructure community. While I respect the author's willingness to share uncomfortable truths about the industry, I think the argument conflates Confluen...
Join discussion
Aug 20, 2025 · 7 min read · Introduction You have done everything right. Your team has modernized to Java 17 LTS, your builds are clean, your Flink job compiles without warnings, and you have successfully pushed it to Amazon Managed Service for Apache Flink. You sit back, expec...
Join discussion
Jul 22, 2025 · 9 min read · Overview Change Data Capture (CDC) is a design pattern used to track and respond to data changes. This document provides an overview and practical approaches to implement CDC without using Kafka and NiFi. The methods discussed include: Custom Applic...
Join discussion
Jul 22, 2025 · 7 min read · Overview This page provides a step-by-step guide to setting up and running a Change Data Capture (CDC) pipeline using Flink CDC to capture changes in a PostgreSQL database and replicate them to another PostgreSQL database. Prerequisites PostgreSQL:...
Join discussion
Jul 20, 2025 · 4 min read · Introduction In today’s data-first landscape, organizations are under increasing pressure to make decisions in real time. While traditional batch ETL systems have served us well, they come with inherent limitations — data latency, operational complex...
Join discussion