Discussion

Mohsin Mumtaz

Software Developemnt Engineer @ Slanglabs

Jul 27, 2021

Layer your cache system like a wedding cake

In this article, we'll discuss how we reduced inference latency by over 30% using a series of caches. One of the significant services while building conversational technology like voice assistants is the infer service: the one responsible for handlin...

immohsin.hashnode.dev13 min read

Responses

No responses yet.

Search Hashnode

Layer your cache system like a wedding cake

Responses

Recent in Forum