Kay SauterProkaysauter.hashnode.dev·Nov 19, 2024Loading files automatically to bronze lakehouseIn my last post on LinkedIn, I explained how to export AdventureWorks2022 tables to csv files. If you don’t want to generate them, you can get them here, as stated in the last post (after my edit in which I realized I made a mistake). My actual blog ...Discussdata-engineering
Alex Mercedalexmerced.hashnode.dev·Nov 15, 2024Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg TablesBlog: What is a Data Lakehouse and a Table Format? Free Copy of Apache Iceberg the Definitive Guide Free Apache Iceberg Crash Course Lakehouse Catalog Course Iceberg Lakehouse Engineering Video Playlist Manually orchestrating data pipelines to hand...Discussapacheiceberg
Alex Mercedalexmerced.hashnode.dev·Oct 16, 2024Data Lakehouse Roundup #1 - News and Insights on the LakehouseI’m excited to kick off a new series called "Data Lakehouse Roundup," where I’ll cover the latest developments in the data lakehouse space, approximately every quarter. These articles are designed to quickly bring you up to speed on new releases and ...Discussdata lakehouse
Alex Mercedalexmerced.hashnode.dev·Oct 5, 2024Ultimate Directory of Apache Iceberg ResourcesThis article is a comprehensive directory of Apache Iceberg resources, including educational materials, tutorials, and hands-on exercises. Whether you're a beginner or an experienced data engineer, this guide will help you navigate the world of Apach...Discussapacheiceberg
Alex Mercedalexmerced.hashnode.dev·Sep 25, 2024Virtualization + Lakehouse + Mesh = Data At ScaleFree Copy of Apache Iceberg: The Definitive Guide Free Apache Iceberg Crash Course As data continues to grow exponentially in scale, speed, and variety, organizations are grappling with the challenges of managing and leveraging vast amounts of infor...Discussdata virtualization
Alex Mercedalexmerced.hashnode.dev·Jul 31, 2024Understanding the Polaris Iceberg Catalog and Its ArchitectureNOTE: I am working on a hands-on tutorial for Polaris, so please watch for the Dremio Blog in the coming days. Also, check out many other great articles on the Dremio blog about Apache Iceberg, Data Lakehouses, and more. Apache Iceberg Crash Course ...Discussdata lakehouse
Alex Mercedalexmerced.hashnode.dev·Jul 26, 2024Reliability with Apache IcebergGet a Free Copy of "Apache Iceberg: The Definitive Guide" Sign Up for the Free Apache Iceberg Crash Course Calendar of Data Lakehouse Events Apache Iceberg is a powerful table format designed to handle large analytic datasets reliably and efficientl...Discussapacheiceberg
Alex Mercedalexmerced.hashnode.dev·Jul 12, 2024Databases Deconstructed: The Value of Data Lakhouses and Table FormatsCheckout out my Apache Iceberg Crash Course Get a free copy of Apache Iceberg the Definitive Guide Databases and data warehouses are powerful systems that simplify working with data by abstracting many of the inherent challenges, including: Storage...Discussdata engineer
Alex Mercedalexmerced.hashnode.dev·Jun 4, 2024Open Source Table Format + Open Source Catalog = No Vendor Lock-in (Nessie, Polaris, Gravitino)Two key components enable the data lakehouse to reach its full potential: the table format and the data catalog. A table format allows collections of files in your data lakehouse to be recognized as database tables, while a catalog facilitates tracki...Discussapacheiceberg
StarRocks Engineeringstarrocks.hashnode.dev·May 24, 2024Comparison of the Open Source Query Engines: Trino and StarRocksIn this post, we want to compare Trino, the popular distributed query engine that runs analytical queries over big volumes of data with interactive latencies with StarRocks. Sources of Information We’ve consulted StarRocks committers (Heng Zhao, Star...DiscussDatabases