One thing that's becoming clear with agentic systems is that traditional monitoring isn't enough anymore. Uptime can be green while an agent is quietly burning tokens, looping on tool calls, or making risky decisions.
The session-level visibility and security audit aspects here are what stood out to me. As agents get access to more tools and workflows, understanding why an action happened becomes just as important as knowing that it happened.
We've seen similar challenges at IT Path Solutions when working on AI agent deployments teams usually start by tracking infrastructure metrics, but the real operational insights come from tracing sessions, tool usage, token consumption, and abnormal behavior patterns.
Observability for AI is quickly evolving from a nice-to-have into a core part of running agents safely and cost-effectively in production.