5d ago · 10 min read · Alibaba's Qwen team dropped Qwen 3.6 Max Preview on April 20, 2026, and it immediately claimed the top spot on six major coding and agentic benchmarks — including SWE-bench Pro, where it posted a 57.3% score that surpassed Claude Opus 4.7 and GPT-5.5...
Join discussion5d ago · 10 min read · Alibaba's Qwen team released Qwen 3.6 Plus in late March 2026, and the benchmarks sent a clear message to the agentic coding community: a model outside the usual Claude/GPT duopoly now leads on the benchmark that matters most to developers running mu...
Join discussion
May 6 · 10 min read · TL;DR — I was happily running Qwen3.6 on llama.cpp. Then I saw claims of 2× speed with vLLM + NVFP4 + DFlash. So I installed it, fought through crashes, and measured it myself. Verdict: it's real. 88–
Join discussionMay 6 · 10 min read · Alibaba's Qwen team dropped Qwen 3.6 Max Preview on April 20, 2026, and it immediately claimed the top spot on six major coding and agentic benchmarks — including SWE-bench Pro, where it posted a 57.3% score that surpassed Claude Opus 4.7 and GPT-5.5...
Join discussionMay 2 · 10 min read · Alibaba dropped Qwen3.6-Max-Preview on April 20, 2026, and it immediately claimed the top score on six of the most demanding AI benchmarks: SWE-bench Pro, Terminal-Bench 2.0, SkillsBench, QwenClawBench, QwenWebBench, and SciCode. It scores 52 on the ...
Join discussionMay 2 · 10 min read · While the AI industry's attention has been locked on the OpenAI-Anthropic-Google triad, Alibaba's Qwen team has been shipping models at a pace that is quietly redrawing the competitive map. Qwen 3.5, released in March 2026, is the most capable open-w...
Join discussionApr 30 · 10 min read · Alibaba dropped Qwen3.6-Max-Preview on April 20, 2026, and it immediately claimed the top score on six of the most demanding AI benchmarks: SWE-bench Pro, Terminal-Bench 2.0, SkillsBench, QwenClawBench, QwenWebBench, and SciCode. It scores 52 on the ...
Join discussionApr 29 · 8 min read · TL;DR: Qwen3.6-Max-Preview is Alibaba's new proprietary flagship, released April 20, 2026. No open weights — API only. It tops six agentic coding benchmarks including SWE-bench Pro and Terminal-Bench 2.0. On Terminal-Bench 2.0, it ties Claude Opus 4....
Join discussionApr 25 · 5 min read · An RTX 3090 costs maybe $700 used. A single card with 24GB VRAM. And as of April 17, you can run a model on it that scores 73.4% on SWE-bench Verified — the industry's hardest real-world software engineering benchmark — at 50 to 65 tokens per second....
Join discussion