nikhilrao.blogHarnessing the IP Address Data Type in Apache Druid (2025)Everything you need to ingest, store, and analyze billions of IPv4 & IPv6 addresses at interactive speed. Why a native IP data type? Storing every address as a plain string inflates segment size, hurts bitmap selectivity, and forces costly runtime ...May 18, 2025·4 min read
nikhilrao.blogGeospatial Queries in Apache Druid: A Ride-Sharing ExampleApache Druid is a high-performance, real-time analytics database designed for large-scale data processing. Among its powerful features is the ability to handle geospatial data, enabling fast and efficient queries over latitude and longitude coordinat...May 14, 2025·6 min read
nikhilrao.blogAutomating Google Cloud with GitHub Actions using gcloud CLIIn today's DevOps landscape, automating tasks and deployments is essential for efficiency and reliability. One powerful way to achieve this is by leveraging GitHub Actions to connect with Google Cloud Platform (GCP) using the gcloud CLI. In this post...Jan 22, 2025·2 min read
nikhilrao.blogHow to Run a Scheduled Bash Script with Google Cloud PlatformOverview Why would you want to run a scheduled Bash script in the cloud? Couldn’t you just run cron on your local machine? Running a bash script in the cloud rather than on a local machine offers developers a multitude of advantages that can enhance ...Jan 22, 2025·4 min read
nikhilrao.blogGoogle Dataflow Optimization: Streaming EngineWhat is Streaming Engine "By default, the Dataflow pipeline runner executes the steps of your streaming pipeline entirely on worker virtual machines, consuming worker CPU, memory, and Persistent Disk storage. Dataflow's Streaming Engine moves pipelin...Sep 13, 2023·2 min read