© 2026 LinearBytes Inc.
Search posts, tags, users, and pages
Yuxin Xu
Can I ask how we got the calculation of "can reduce memory usage by up to 32 times"? Thank you! Been stuck in that step for a while not knowing why.
Is it because of the below example (20/0.6 = 33.3333...)?
Ce Gao
Cofounder @TensorChord
Yuxin Xu Hello, that statement represents a theoretical result. In the case of FP32 (floating-point 32-bit) format, it requires 32 bits to store values, whereas a binary vector only needs 1 bit per element.