Do Open Frontier Models Have A Chance Against Closed Models ?
Which of the new open-ish frontier models has the best chance to stand up against closed-source models on both cost and quality?
I ran Ship-Bench against Kimi K2.6, Qwen 3.6 Plus, and DeepSeek v4 Pro
jason.agostoni.net12 min read