Peter Marshallpmio.hashnode.dev·Dec 9, 2024Modelling and analysing behavioural systems with Apache DruidIn his Druid Summit 2024 keynote, Eric Tschetter emphasized Druid’s potential for behavioral analytics. This article builds on Eric’s keynote, offering insights and strategies for designing analytics that prioritise behaviour. Apache Druid is an esta...apachedruid
Analytics Engineering Digestthatjuanpablo.hashnode.dev·Oct 4, 2024Dimensional Data Model for App Help Center PageIntroduction The success of a digital product heavily depends on providing users with efficient support and guidance. One of the key tools for achieving this is a well-structured help center that users can navigate to find solutions to their problems...dimensional modelling
Analytics Engineering Digestthatjuanpablo.hashnode.dev·Sep 27, 2024Dimensional Data Modeling for an E-Commerce StoreIntroduction: Understanding the Business Process In an online retail environment like Walmart, there are multiple critical processes involved in fulfilling customer orders. These include capturing customer details, managing product inventories, proce...data-engineering
Analytics Engineering Digestthatjuanpablo.hashnode.dev·Sep 20, 2024Dimensional Data Model for Airbnb and VRBOUnderstanding the Business Domain Before diving into the data model, it's crucial to understand the key entities and relationships in an online marketplace for short-term homestays. Here are the primary entities: Users: Guests and hosts Properties:...analytics engineering
Vivek Khatritech.peculiarvivek.com·Aug 17, 2024DDIA - Chapter 3 - Storage and Retrieval - (Part 2)Henlo frens We discussed till LSM-tree in the last blog, had to break it down into 2 cause this chapter has too much information. Many databases use B-trees, it has become the standard. LSM-tree and B-tree both keep the key value pair sorted, but bot...Databases
Analytics Engineering Digestthatjuanpablo.hashnode.dev·Aug 16, 2024Dimensional Data Model for Ride-Sharing App AnalyticsCover photo credits: Ehsan Haque via Pexels "Dimensional models are gracefully extensible to accommodate change. The predictable framework of a dimensional model withstands unexpected changes in user behavior" - Ralph Kimball Understanding the Busi...analytics engineering
Shiv Iyershiviyer.hashnode.dev·Jun 30, 2024Strategies to Optimize Hierarchical Data in SQL Databases for Better PerformanceApplying hierarchical optimization to network structures in SQL-based database engineering involves organizing and optimizing hierarchical data to improve query performance, data integrity, and overall system efficiency. Here’s a detailed guide on ho...59 readsDatabases
Shiv Iyershiviyer.hashnode.dev·Apr 29, 2024Best Practices for Pagination in ClickHouse DatabasesImplementing pagination in ClickHouse is crucial for efficiently handling large datasets, especially when building applications that require user interaction with substantial amounts of data. ClickHouse provides several approaches to achieve paginati...392 readsClickHouse
Shiv Iyershiviyer.hashnode.dev·Apr 28, 2024How to Use the CAST Operator for Data Type Conversion in ClickHouseIn ClickHouse, the CAST operator is used to convert a value from one data type to another. This can be particularly useful in scenarios where you need to ensure that data types match between different parts of your queries, such as when comparing col...337 readsOLAP
Shiv Iyershiviyer.hashnode.dev·Apr 23, 2024Optimizing Linux Server Settings for Enhanced ClickHouse Performance: A Guide for High-Volume Data IngestionOptimizing a Linux server for ClickHouse, especially to handle high-velocity, high-volume data ingestion, involves several layers of system tuning. These enhancements are designed to maximize the performance of ClickHouse by leveraging the full poten...104 readsClickHouse