STGreat article, the comparison to the post-quantization model should certainly be done more thoroughly.Comment·Article·Apr 18, 2024·Are All Large Language Models Really in 1.58 Bits?