Close Menu
    What's Hot

    Jules Tools Lets You Do Tasks via Command Line

    October 3

    GLM-4.6 LLM for Advanced Agentic, Reasoning & Coding

    October 3

    Rodin Gen-2 Turns Any Image Into a 3D Model

    October 2
    Facebook X (Twitter) Instagram
    • AI Robots
    • AI News
    • Text to Video AI Tools
    • ChatGPT
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Rad NeuronsRad Neurons
    • AI Robots
      • AI Coding
    • ChatGPT
    • Text to Video AI
    Subscribe
    Rad NeuronsRad Neurons
    Home » GPT 4.5 Leads in Elimination Game That Tests Reasoning, Strategy & Deception
    AI News

    GPT 4.5 Leads in Elimination Game That Tests Reasoning, Strategy & Deception

    AI NinjaBy AI NinjaMarch 31 Min Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Since its release, there has been a lot of discussion about how smart GPT 4.5 really is. It doesn’t score as high as o3-mini when it comes to coding in certain benchmarks. It is also very expensive to use for developers. But as it turns out, in the Elimination Game, which tests LLMs in social reasoning, strategy, and deception, it is leading other models.

     

     

    The idea is simple: in this game, players engage in public and private conversations, form alliances and vote to eliminate each other round by round. A jury of eliminated players then casts deciding votes to crown the winner. As far as double crossing, Claude 3.7 Sonnet had a greater tendency to do so.

    [HT]

    GPT 4.5
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMiniMax Image-01: Cheap Cinematic Text-to-Image Model
    Next Article Opera’s Browser Operator: AI Agent That Does Things For You
    AI Ninja
    • Website

    Related Posts

    AI News

    GLM-4.6 LLM for Advanced Agentic, Reasoning & Coding

    October 3
    AI News

    Rodin Gen-2 Turns Any Image Into a 3D Model

    October 2
    AI News

    DeepSeek-V3.2-Exp with DeepSeek Sparse Attention(DSA) for Efficient Long-Context Handling

    September 29
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Higgsfield Now Integrates Flux.1 Kontext

    June 1216 Views

    RayNeo X3 Pro Augmented Reality Glasses with AI

    June 1310 Views

    ASUS RUC-1000G Edge AI Computer with 600W GPU

    May 220 Views
    Most Popular

    Prompt Cannon: Run Prompts Across Multiple Models

    June 242,877 Views

    Dipal D1 2.5K Curved Screen 3D AI Character

    June 23876 Views

    GPTARS: GPT Powered TARS Robot

    November 21635 Views
    Our Picks

    Jules Tools Lets You Do Tasks via Command Line

    October 3

    GLM-4.6 LLM for Advanced Agentic, Reasoning & Coding

    October 3

    Rodin Gen-2 Turns Any Image Into a 3D Model

    October 2
    Tags
    3D agent AI AI model ai video app avatar browser canvas ChatGPT Chess Claude coding Cursor DeepSeek ElevenLabs Gemini glasses GPT Grok Hailuo Higgsfield image kling leonardo LLM MCP midjourney model music nano banana o3 OpenAI open source QWEN robot runway sora text to video Veo 2 Veo 3 Vibe coding video video model Voice

    © 2025 Rad Neurons. Inspired by Entropy Grid
    • Home
    • Terms of Use
    • Privacy Policy
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.