Types of LLM Quantization: By Timing, Scope, and Mapping
9h ago · 14 min read · TLDR: There is no single "best" LLM quantization. You classify and choose quantization along three axes: when you quantize (timing), what you quantize (scope), and how values are encoded (mapping). In practice, most teams start with weight quantizati...
Join discussion



