Aalexeyg377inalexeygolev.hashnode.dev00Tracing in Production: When 1–150ms Turns Into 700ms May 18 · 4 min read · We run machine learning models in production serving real-time traffic. When our managed serving platform introduced built-in MLflow tracing, we wanted to enable it. Tracing promised visibility into iJoin discussion
Aalexeyg377inalexeygolev.hashnode.dev00The Hidden Cost Curve of “One-Click” ML ObservabilityMay 13 · 4 min read · The ask was reasonable. We had a high-volume ML serving endpoint and we, engineers and data scientists, wanted visibility into what requests and responses looked like in production. The managed ML serJoin discussion