© 2026 Hashnode
Building a phone agent on a multimodal LLM: dropping faster-whisper and letting Gemma 4 hear the caller directly — a response-time and reply-accuracy benchmark across English, French, and Mandarin My

When Alibaba's Qwen team released Qwen3.6-27B on April 22, 2026, it flipped an assumption that had quietly become dogma in the open-source AI space: that more parameters meant better code. A 27-billion parameter dense model — running on hardware you ...
