Alex Mercedtechblog.alexmerced.com·May 15, 20243 Reasons Data Engineers Should Embrace Apache IcebergData engineers are constantly seeking ways to streamline workflows and enhance data management efficiency. Apache Iceberg, a high-performance table format for huge analytic datasets, has emerged as a game-changer in the field. By offering powerful fe...Discussapacheiceberg
Farbod AhmadianforDataChef's Blogblog.datachef.co·May 14, 2024Apache Iceberg CompactionIntroduction Apache Iceberg introduces a powerful compaction feature, especially beneficial for Change Data Capture (CDC) workloads. This document outlines the key properties and commands necessary for effective Iceberg table management, focusing on ...Discuss·10 likes·47 readsapacheiceberg
Alex Mercedtechblog.alexmerced.com·Apr 21, 2024A Deep Intro to Apache Iceberg and Resources for Learning MoreFor a long time, siloed data systems such as databases and data warehouses were sufficient. These systems provided convenient abstractions for various data management tasks, including: Storage locations and methods for data. Identification and recog...Discuss·1 likeapacheiceberg
Alex Mercedtechblog.alexmerced.com·Apr 18, 2024Collection of Hands-on Exercises to Get Started with Apache IcebergDon't Miss Out on Several Great Talks on Apache Iceberg at the Subsurface Conference on May 2nd and 3rd, 2024. Register now for free Apache Iceberg is an innovative data lakehouse table format designed to revolutionize how you manage large-scale data...Discussapacheiceberg
Alex Mercedtechblog.alexmerced.com·Apr 4, 2024Understanding the Future of Apache Iceberg CatalogsApache Iceberg is revolutionizing the data industry as an open-source table format that allows data lake storage layers to function as full-fledged data warehouses, a concept known as a data lakehouse. This transformation has led to the development o...Discuss·1 like·40 readsapacheiceberg
Prabodh AgarwalforCMD-LYNEtoplyne.hashnode.dev·Apr 3, 2024Skiing with SnowflakeIn this article, I will demonstrate how to formulate a lakehouse strategy that pairs well with Snowflake. A few months ago, I began exploring opportunities to develop ETL pipelines in Ray. I had to perform my PoC on SnowflakeDB. Unfortunately, Ray C...Discuss#apache-spark
Alex Mercedtechblog.alexmerced.com·Apr 1, 2024End-to-End Basic Data Engineering Tutorial (Spark, Dremio, Superset)Data engineering aims to make data accessible and usable for data analytics and data science purposes. This involves several key aspects: Transferring data from operational systems like databases to systems optimized for analytical access. Modeling...Discuss·1 like·36 readsdata lakehouse
Alex Mercedtechblog.alexmerced.com·Mar 19, 20245 Open Source Data Projects You Should Be FollowingFollow Me On Social Subscribe to my SubStack Open source technology significantly impacts various development areas, and the data sector is no exception. Today's data landscape features increasingly large datasets that often rely on external sources ...Discussibis
Alex Mercedtechblog.alexmerced.com·Mar 9, 20245 reasons Dremio is the ideal Apache Iceberg Lakehouse PlatformThe Apache Iceberg table format has seen an impressive expansion in its compatibility with a vast spectrum of data platforms and tools. Among these, Dremio stands out as a pioneer, having embraced Apache Iceberg early on. In this article, we delve in...Discussdata-engineering
Alex Mercedtechblog.alexmerced.com·Mar 6, 2024The Apache Iceberg Lakehouse: The Great Data Equalizer (disrupting the Snowflake/Databricks status quo)Get an Early Release Copy of Apache Iceberg the Definitive Guide Follow this tutorial to create a Data Lakehouse on your Laptop Iceberg Lakehouse Engineering Video Playlist In the dynamic realm of data platform development, competition among vendors ...Discuss·1 likesnowflake