We hit this exact constraint when building a multi-agent system — the moment our codebase exceeded the context window, agent reasoning quality dropped noticeably on cross-module tasks. Having a visible badge that tracks this would have caught the drift earlier instead of debugging degraded output quality. Curious whether the badge accounts for the effective context after system prompts and tool definitions consume their share of the window.