4d ago · 9 min read · A team I worked with tried to build their billing system on top of their tracing pipeline. The idea was clean: every operation already generates a span; spans already have duration and attributes; adding user_id and billable_units to each span lets f...
Join discussionApr 13 · 11 min read · Advanced Observability on AWS ECS Fargate: OpenTelemetry, Jaeger, and Autonomous Self-Healing Introduction Production-grade deployments require more than automation — they require instrumentation. Automated deployments that succeed silently and fail ...
Join discussion
Apr 1 · 4 min read · "Why did the Agent do that?" If you are building Agentic systems today, this is the question that keeps you up at night. AI Agents are inherently non-deterministic. They loop, they reason, and they c
Join discussion
Mar 29 · 12 min read · Node.js OpenTelemetry in Production: Distributed Tracing, Custom Spans, and OTLP Distributed systems fail in distributed ways. A request enters your API gateway, fans out across a dozen microservices, hits three databases, publishes to a message queu...
Join discussionMar 29 · 12 min read · Spry Monitoring and Observability: Logging, Metrics, Tracing, and Alerting Monitoring and observability are critical for maintaining production‑grade Spry applications. In this comprehensive guide, we'll explore how to implement a complete observabil...
Join discussionMar 29 · 31 min read · Spry with Monitoring and Observability: Metrics, Logging, and Tracing with OpenTelemetry Building reliable Spry applications requires more than just writing code—you need visibility into how your application behaves in production. Monitoring and obse...
Join discussion