Tag feed

#delta-lake

11 posts5 followers

Explore Hashnode

Alternatives

Trending tags this week

MIMyData Insightsmydatainsights-blogs.hashnode.devJul 15 · 9 min read

Delta Lake on Microsoft Fabric: Migration Gotchas & Fixes

Migrating Delta Lake workloads to Microsoft Fabric is not a lift-and-shift. The OneLake storage model, the way Fabric handles Delta table registration, and the quirks around external shortcuts mean th

0

SSapotaCorpsapotacorpvn.hashnode.devJul 12 · 7 min read

Delta Lake schema enforcement and evolution: the guardrail and the trap

There is a moment that happens on almost every lakehouse project, usually a few weeks in. A scheduled job that has been running fine suddenly fails, and the error says Delta Lake refused the write bec

0

SSapotaCorpsapotacorpvn.hashnode.devJul 12 · 7 min read

CDC into a lakehouse: Change Data Feed, MERGE, and not reprocessing everything

Most pipelines start append-only, and for a while that is fine. Data arrives, you add it to the table, life is simple. Then the source starts sending updates to records you already have, and deletes f

0

SSapotaCorpsapotacorpvn.hashnode.devJul 12 · 7 min read

The Delta Lake performance gotchas nobody warns you about

A Delta table that flew at launch and crawls six months later is one of the most common things teams bring us, and the cause is almost never the thing they suspect. They look at cluster size and query

0

AAAbstract Algorithmsabstractalgorithms.hashnode.devApr 19 · 34 min read

Reading and Writing Data in Spark: Parquet, Delta, JSON, and JDBC

TLDR: Parquet's columnar layout with row-group statistics enables predicate pushdown that can reduce a 500 GB scan to 8 GB. Delta Lake wraps Parquet with a JSON transaction log to add ACID semantics a

0

RLRisingWave Labsrisingwave.comApr 3 · 7 min read

Apache Iceberg vs Delta Lake vs Apache Hudi: Table Format Comparison

Apache Iceberg, Delta Lake, and Apache Hudi are all open table formats that bring ACID transactions and reliable analytics to data lakes. Iceberg offers the broadest multi-engine support and most flexible metadata design. Delta Lake has the deepest S...

0

AAAbstract Algorithmsabstractalgorithms.hashnode.devMar 28 · 23 min read

Medallion Architecture: Bronze, Silver, and Gold Layers in Practice

TLDR: Medallion Architecture solves the "data swamp" problem by organizing a data lake into three progressively refined zones — Bronze (raw, immutable), Silver (cleaned, conformed), Gold (aggregated,

0

AAAbstract Algorithmsabstractalgorithms.hashnode.devMar 28 · 24 min read

Modern Table Formats: Delta Lake vs Apache Iceberg vs Apache Hudi

TLDR: Delta Lake, Apache Iceberg, and Apache Hudi are open table formats that wrap Parquet files with a transaction log (or snapshot tree) to deliver ACID guarantees, time travel, schema evolution, an

0

APArnaud Pouppevillearnaudp.devFeb 18 · 7 min read

How Z-Order Cut My Databricks ETL Time in Half

I spent years working with SQL Server before moving to Databricks. In SQL Server, clustered indexes are second nature. You define them, the engine physically organizes rows on disk in that order, and your queries fly. When I started building data pip...

0

AMAlex Mercedtechblog.alexmerced.comJan 7, 2025 · 5 min read

When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

Blog: What is a Data Lakehouse and a Table Format? Free Copy of Apache Iceberg the Definitive Guide Free Apache Iceberg Crash Course Lakehouse Catalog Course Iceberg Lakehouse Engineering Video Playlist The value of the lakehouse model, along with t...

0

#delta-lake

Search Hashnode

#delta-lake

Explore Hashnode

Trending tags this week

Delta Lake on Microsoft Fabric: Migration Gotchas & Fixes

Delta Lake schema enforcement and evolution: the guardrail and the trap

CDC into a lakehouse: Change Data Feed, MERGE, and not reprocessing everything

The Delta Lake performance gotchas nobody warns you about

Reading and Writing Data in Spark: Parquet, Delta, JSON, and JDBC

Apache Iceberg vs Delta Lake vs Apache Hudi: Table Format Comparison

Medallion Architecture: Bronze, Silver, and Gold Layers in Practice

Modern Table Formats: Delta Lake vs Apache Iceberg vs Apache Hudi

How Z-Order Cut My Databricks ETL Time in Half

When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability