Semantic Cache for LLM responses
Prerequisites
API Management instance with an Azure OpenAI model deployment as an API
Deployment for the following APIs:
Chat Completion API
Embeddings API
Configured API Management instance to use managed identity authentication to the Azur...
dimitaronai.com2 min read