Feb 25 · 14 min read · Setting up the OpenTelemetry 'Astronomy Shop' demo on your local machine is easy, but the real adventure begins when you move it to a public GKE cluster with a custom domain, HTTPS, and proper routing
Join discussion
Nov 30, 2025 · 8 min read · Introduction Picture this: A user reports that your checkout flow is slow. You check the logs. Everything looks fine. You check the metrics. P99 latency is... bad. But why? Without tracing, you're guessing. Maybe it's the payment service? The invento...
Join discussionNov 5, 2025 · 4 min read · Introduction Distributed tracing helps understand how services behave when requests travel through multiple microservices. Tools like Jaeger are very useful, but they can also consume significant memory and CPU resources. This becomes especially noti...
Join discussion
Oct 7, 2025 · 18 min read · The operational landscape of modern software has fundamentally shifted. The monolithic applications of yesteryear, while presenting their own set of challenges, at least offered a singular, albeit complex, execution path. Today, the pervasive adoptio...
Join discussionJul 15, 2025 · 3 min read · Observability isn't just about logs anymore. In 2025, understanding how your API behaves under load, in production, across services is critical. That’s where OpenTelemetry shines. Unified logs, traces, and metrics Framework-agnostic setup Vendor-n...
Join discussion
Jun 15, 2025 · 12 min read · In my last article, I wrote about monitoring using Grafana and Prometheus. Today I would like to add tracing to the mix. There are three types of data when it comes to monitoring: Logging Metrics Tracing We already have the first two types of da...
Join discussion
Jun 3, 2025 · 8 min read · Introduction Have you ever had a bug that occurred in production and you have no idea what went wrong because your logs won’t tell you exactly what went wrong or a request that takes usually long to process. Sometimes debugging these issues without a...
Join discussion
May 8, 2025 · 4 min read · Observability is a critical part of software engineering. I like to think about it from a medical context; the engineer is the doctor and the server/cluster/app is the patient, not necessarily having to be sick but still needs check-ups to make sure ...
Join discussion
Mar 7, 2025 · 3 min read · Introduction In Part 4, we implemented the orderd service. In a distributed system, tracking what happens across multiple services is crucial. Logging provides insights into what occurred, while tracing helps follow a request's journey across service...
Join discussion