VRVijay Ram Enagantiinvijay-ram.hashnode.dev10TurboQuant by Google1d ago · 16 min read · Implementing an ICLR 2026 paper on KV cache compression, discovering the gap between theory and practice, and building something that actually works. Idea The idea to try and build a justified clone Join discussion