Ollama Just Got 93% Faster on Mac. Here's How to Enable It.
My M4 Max was decoding Qwen3.5 at 58 tokens per second yesterday. Today it's doing 112. Same model, same hardware, same prompt. The only thing that changed was a single environment variable.
Ollama 0.19 shipped on March 31, 2026 with a preview of its...
alan-west.hashnode.dev5 min read