Close Menu
    What's Hot

    Kamo-1 3D Conditional Video Model

    December 4

    Invideo VFX House: VFX Studio for Kling o1

    December 3

    Seedream 4.5 from ByteDance Delivers Cleaner Text, Smarter Edits

    December 3
    Facebook X (Twitter) Instagram
    • AI Robots
    • AI News
    • Text to Video AI Tools
    • ChatGPT
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Rad NeuronsRad Neurons
    • AI Robots
      • AI Coding
    • ChatGPT
    • Text to Video AI
    Subscribe
    Rad NeuronsRad Neurons
    Home » ERNIE-4.5-VL-28B-A3B-Thinking Multimodal Outperforms GPT-5?
    AI News

    ERNIE-4.5-VL-28B-A3B-Thinking Multimodal Outperforms GPT-5?

    AI NinjaBy AI NinjaNovember 111 Min Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    ERNIE has another open source model, and it’s a doozy. ERNIE-4.5-VL-28B-A3B-Thinking is a lightweight reasoning model with 3B active parameters that can outperform Gemini 2.5 Pro and GPT-5-High. It has an interesting feature called “Thinking with Images” that allows zooming in and out to capture finger details.  Ernie is very competitive with GPT-5-High and Gemini 2.5 Pro across the board but it shines in MathVista, ChartQA, and AI2D (transparent).

    A new addition to the ERNIE open-source model family is here!

    Meet ERNIE-4.5-VL-28B-A3B-Thinking, our lightweight multimodal reasoning model.

    > 3B active parameters with enhanced semantic alignment between visual and language modalities
    > Outperforming Gemini-2.5-Pro and… pic.twitter.com/jRuSqLOOpV

    — Baidu Inc. (@Baidu_Inc) November 11, 2025

    [HT]

    ERNIE
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleWeb Capture: MagicPath’s Extension for HTML to React Conversion
    Next Article Nano Banana 2 Early Preview Tests Are Out
    AI Ninja
    • Website

    Related Posts

    AI News

    Video & Image JSON Prompts Cheatsheet

    December 1
    AI News

    Deepseek V3.2 Changes the Game, Competes with GPT 5, Gemini 3.0

    December 1
    AI News

    Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

    November 28
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Runway Gen-3 Alpha Turbo Advanced Camera Controls Guide

    November 77 Views

    BadVideo: Backdoor Attack Against Text-To-Video Models

    April 2410 Views

    o1-preview Tries to Cheat Against Stockfish

    January 333 Views
    More
    AI News

    Video & Image JSON Prompts Cheatsheet

    AI NinjaDecember 1
    AI News

    Deepseek V3.2 Changes the Game, Competes with GPT 5, Gemini 3.0

    AI NinjaDecember 1
    AI News

    Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

    AI NinjaNovember 28
    Most Popular

    Prompt Cannon: Run Prompts Across Multiple Models

    June 243,283 Views

    Dipal D1 2.5K Curved Screen 3D AI Character

    June 23961 Views

    GPTARS: GPT Powered TARS Robot

    November 21686 Views
    Our Picks

    Kamo-1 3D Conditional Video Model

    December 4

    Invideo VFX House: VFX Studio for Kling o1

    December 3

    Seedream 4.5 from ByteDance Delivers Cleaner Text, Smarter Edits

    December 3
    Tags
    3D agent AI AI model ai video app avatar browser canvas ChatGPT Chess Claude coding DeepSeek ElevenLabs ERNIE Gemini glasses GPT Grok Higgsfield image kling leonardo LLM Manus MCP midjourney model music nano banana o3 OpenAI open source QWEN robot runway sora text to video Veo 2 Veo 3 Vibe coding video video model Voice

    © 2025 Rad Neurons. Inspired by Entropy Grid
    • Home
    • Terms of Use
    • Privacy Policy
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.