Alex Mercedtechblog.alexmerced.com·Apr 4, 2024Understanding the Future of Apache Iceberg CatalogsApache Iceberg is revolutionizing the data industry as an open-source table format that allows data lake storage layers to function as full-fledged data warehouses, a concept known as a data lakehouse. This transformation has led to the development o...Discussapacheiceberg
Alex Mercedtechblog.alexmerced.com·Apr 1, 2024End-to-End Basic Data Engineering Tutorial (Spark, Dremio, Superset)Data engineering aims to make data accessible and usable for data analytics and data science purposes. This involves several key aspects: Transferring data from operational systems like databases to systems optimized for analytical access. Modeling...Discuss·27 readsdata lakehouse
Alex Mercedtechblog.alexmerced.com·Mar 28, 2024Great Blogs on DataOps for Apache Iceberg LakehousesDataOps, short for Data Operations, represents the seamless orchestration of people, processes, and technology to enhance the quality and reduce the cycle time of data analytics. At the heart of this approach is data versioning, a critical practice t...Discussdata lakehouse
Alex Mercedtechblog.alexmerced.com·Mar 6, 2024The Apache Iceberg Lakehouse: The Great Data Equalizer (disrupting the Snowflake/Databricks status quo)Get an Early Release Copy of Apache Iceberg the Definitive Guide Follow this tutorial to create a Data Lakehouse on your Laptop Iceberg Lakehouse Engineering Video Playlist In the dynamic realm of data platform development, competition among vendors ...Discusssnowflake
Alex Mercedtechblog.alexmerced.com·Mar 1, 2024A Deep Dive into the Concept and World of Apache Iceberg CatalogsGet a Free Copy of "Apache Iceberg: The Definitive Guide" Build an Iceberg Lakehouse on Your Laptop Apache Iceberg is an open-source table format designed for data lakehouse architectures, enabling the organization of data on data lakes in a manner ...Discussapacheiceberg
ByteHousebytehouse.hashnode.dev·Dec 20, 202310 use cases of a data lakehouse for modern businessesA data lakehouse is a contemporary data architecture that merges the attributes of a "data lake" and a "data warehouse." This approach offers a cohesive method for storing, governing, and analysing data within an organisation. The concept of a data l...Discussiceberg
Philip Bellbigdataevangelism.hashnode.dev·Apr 14, 2023Lessons Learned Running Presto at Meta ScalePresto is a free, open-source SQL query engine. We’ve been using it at Meta for the past ten years and learned a lot while doing so. Running anything at scale - tools, processes, services - takes problem-solving to overcome unexpected challenges. Her...Discuss·27 readsdata lakehouse
dataguruthedataguru.hashnode.dev·Feb 4, 2023Lakehouse Architecture - will it be the future of data warehouse?Image Source: Data Lakehouse – Databricks One of the foundational papers (Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics (cidrdb.org)) coined the idea of Lakehouse which explores the opportunity to ha...Discuss·82 readsdata-engineering