Three Kinds of Caching: Prompt, Semantic, Result
Every "AI app optimisation" post tells you to cache. None of them tell you which cache. There are at least three distinct caches that could live in an LLM pipeline, and they win in different places, stack in different orders, and fail in different wa...
ai-zero-to-hero.hashnode.dev12 min read