Try it Live: BitNet Microsoft launched a language model called BitNet with 2 billion parameters trained on 4 trillion tokens. The specialty of this model is how the parameters values are stored. BitNet b1.58 only uses about 1.58 bits per parameter Pa...
blog.sribalaji.io2 min read
No responses yet.