How I Instrumented vLLM on Kubernetes: The Dashboards, Queries, and SLOs
A practical observability setup for LLM inference on KServe — and the one-line misconfiguration it caught.
LLM serving breaks the assumptions behind ordinary service dashboards. A single "request late
vinayakgajare.hashnode.dev8 min read