OpenAI Prompt Caching: Undocumented Cross-Model Behavior and Production Cost Implications
I'm building an AI agent from scratch—no frameworks, no abstractions—specifically to understand where every token goes and how much it costs. This is Phase 3 of my token economics research.
Phase 1 covered basic tool calling mechanics. Phase 2 reveal...
blog.pragmaticbyharsh.com12 min read