Nicolas Fränkelfrankel.hashnode.dev·Nov 7, 2024DuckDB in ActionDisclaimer: this post includes affiliate links; I may receive compensation if you purchase the book from the different links provided in this post. This review is about DuckDB in Action by JoMark Needham, Michael Hunger, and Michael Simons from Manni...Discussbooks
Alex Mercedalexmerced.hashnode.dev·Oct 7, 2024Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python NotebookApache Iceberg Crash Course: What is a Data Lakehouse and a Table Format? Free Copy of Apache Iceberg the Definitive Guide Free Apache Iceberg Crash Course Iceberg Lakehouse Engineering Video Playlist Data engineers and scientists often work wit...DiscussPython
Tobias Müllertobilg.com·Sep 20, 2024Querying IP addresses and CIDR ranges with DuckDBI had a use case that eventually required performing IP address lookups in a given list of CIDR ranges, as I maintain an open source project that gathers IP address range data from public cloud providers, and also wrote an article in my blog about an...Discuss·548 readsduckDB
Sandeep Pawarfabric.guru·Jul 30, 2024Using fsspec To Define OneLake Filesystem in FabricI mentioned on X the other day that, like other filesystem backends such as S3 and GCS, you can use fsspec to define the OneLake filesystem too. In this blog, I will explain how to define it and why it's important to know about it. https://x.com/Pawa...Sachin Nandanwar and 1 other are discussing this2 people are discussing thisDiscuss·2 likes·715 readsdaft
Manuel Schmidbauerdeltaload.hashnode.dev·May 24, 2024Building a poor man's data lake for Shopify dataInspired by a recent blog post, I decided to experiment with various technologies and build a small data lake for Shopify data. In this project, the following technologies are used: Data Ingestion:dlthub I use the dlt connector to push data from the ...DiscussPoor Man's Data LakeData-lake
Cloud Tunedcloudtuned.hashnode.dev·May 19, 2024Daily Hack #day46 - DuckDB Minimalistic Analytics DatabaseDuckDB is an open-source, lightweight database engine optimized for analytical workloads. It is characterized by its exceptional query performance, minimal resource requirements, and ease of integration into existing applications. DuckDB is particula...DiscussDaily HacksduckDB
Tobias Müllertobilg.com·Apr 16, 2024Chat with a DuckA while ago I published sql-workbench.com and the accompanying blog post called "Using DuckDB-WASM for in-browser Data Engineering". The SQL Workbench enables its users to analyze local or remote data directly in the browser. This lowers the bar rega...Discuss·2 likes·756 readsduckDB
Fritz Larcoblog.slingdata.io·Apr 12, 2024Export Data From StarRocks into DuckDBIntroduction Let's look at how we can easily export data from StarRocks into a local DuckDB database with Sling, a flexible command-line interface (CLI) data integration tool that enables rapid extraction and loading of data directly from the termina...Discuss·271 readsduckDB
Constantin Lungudatawise.dev·Apr 5, 2024A portable Data Analytics stack using Docker, Mage, dbt-core, DuckDB and SupersetJust wanted to share a small learning-by-doing project of mine. It's a containerized Data Analytics suite, covering end-to-end analytics process for a small (imaginary) company. We're talking about:- generating example data in parquet files using Pyt...Discuss·433 readsMy Data JourneyDocker
Tobias Müllertobilg.com·Jan 27, 2024Using DuckDB-WASM for in-browser Data EngineeringIntroduction DuckDB, the in-process DBMS specialized in OLAP workloads, had a very rapid growth during the last year, both in functionality, but also popularity amongst its users, but also with developers that contribute many projects to the Open Sou...Fabio Chen and 6 others are discussing this7 people are discussing thisDiscuss·8 likes·18.7K readsdata-engineering