Home
Community

Semantic caching

Semantic caching

James Perkins

for

Unkey

unkey.hashnode.dev

·

Jun 26, 2024

Semantic caching

Large language models are getting faster and cheaper. The below charts show progress in OpenAI's GPT family of models over the past year: Cost per million tokens ($) Tokens per second Recent releases like Meta's Llama 3 and Gemini Flash have pushed...

Unkey Launchweek 1

No comments yet

Be the first to start the conversation.