Mercury 2: Inception's Diffusion LLM at 1,000 Tokens/s
4d ago · 10 min read · Most language models work like a very fast typist: one token at a time, left to right, no going back. Mercury 2 from Inception Labs works more like an editor: it starts with a rough draft and refines the whole thing in parallel until it converges on ...
Join discussion



















