New Ovis2.6-30B-A3B, a lil better than Qwen3-VL-30B-A3B
A new MoE-powered vision model just dethroned the previous champion...
Ovis 2.6-30B-A3B has reportedly surpassed Qwen3-VL-30B-A3B to become the new top vision model at its size. The key upgrade is a shift to a Mixture-of-Experts (MoE) architecture, which promises superior multimodal performance at a lower serving cost. It also features major improvements in long-context understanding, high-resolution analysis, and document comprehension. While its coding may not beat GLM 4.7 Flash, it's now the benchmark for vision in the 30B-A3B class.
Why It Matters
This leap in efficient, high-performance vision AI lowers the barrier for advanced multimodal applications, from document analysis to complex visual reasoning.