VRVijay Ram Enagantiinvijay-ram.hashnode.dev·Apr 2 · 16 min readTurboQuant by GoogleImplementing an ICLR 2026 paper on KV cache compression, discovering the gap between theory and practice, and building something that actually works. Idea The idea to try and build a justified clone 10