
ERNIE has another open source model, and it’s a doozy. ERNIE-4.5-VL-28B-A3B-Thinking is a lightweight reasoning model with 3B active parameters that can outperform Gemini 2.5 Pro and GPT-5-High. It has an interesting feature called “Thinking with Images” that allows zooming in and out to capture finger details. Ernie is very competitive with GPT-5-High and Gemini 2.5 Pro across the board but it shines in MathVista, ChartQA, and AI2D (transparent).
A new addition to the ERNIE open-source model family is here!
Meet ERNIE-4.5-VL-28B-A3B-Thinking, our lightweight multimodal reasoning model.
> 3B active parameters with enhanced semantic alignment between visual and language modalities
> Outperforming Gemini-2.5-Pro and… pic.twitter.com/jRuSqLOOpV— Baidu Inc. (@Baidu_Inc) November 11, 2025
[HT]

