© 2026 Hashnode
A streaming data warehouse built on RisingWave and Apache Iceberg combines the freshness of a real-time pipeline with the analytical depth of a traditional data warehouse—without the cost of either a dedicated streaming engine or a commercial warehou...

Apache Iceberg combined with RisingWave provides the ideal foundation for IoT data lakehouses. RisingWave ingests high-frequency sensor streams via Kafka, applies real-time transformations using materialized views, and sinks enriched data into Iceber...

Metadata Is the Real Index: How Lakehouse Systems Avoid Scanning the World Today I want to talk about something that separates toy systems from production-grade data infrastructure: Metadata. Specifically, how modern lakehouse systems use metadata to...

Modern data platforms promise scale, speed, and flexibility — yet they often feel overly complex.Data lakes, warehouses, pipelines, BI layers, governance tools — instead of simplifying analytics, many platforms seem to add more moving parts. This com...

The data world, my friends, is in a fascinating state of flux. For years, we've chased the elusive "single source of truth," battling data silos and wrestling with the inherent trade-offs between the flexibility of data lakes and the robust governanc...

En el universo de los datos, la presencia de duplicados es casi una garantía. Desde registros de clientes que se repiten hasta transacciones que aparecen más de una vez, los datos duplicados son un problema silencioso que puede socavar la fiabilidad ...
