BitNet's 100B-on-a-CPU Achievement Isn't What You Think It Is
Originally published at lizecheng.net
Microsoft open-sourced bitnet.cpp yesterday. The headline landed with predictable breathlessness: a 100-billion-parameter model running on a single commodity CPU at 5-7 tokens per second. That's approximately hu...
lizecheng.hashnode.dev5 min read