Close Menu
    What's Hot

    Runway Gen 4.5 Gets Image to Video

    January 23

    Remotion Agent Skills: Now You Can Generate Videos with Claude Code

    January 22

    World API is Now Live for Generating 3D Worlds from Text, Images, Videos

    January 22
    Facebook X (Twitter) Instagram
    • AI Robots
    • AI News
    • Text to Video AI Tools
    • ChatGPT
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Rad NeuronsRad Neurons
    • AI Robots
      • AI Coding
    • ChatGPT
    • Text to Video AI
    Subscribe
    Rad NeuronsRad Neurons
    Home » Gemini 2.5 Flash and Pro Text-to-Speech Updates Announced
    AI Audio

    Gemini 2.5 Flash and Pro Text-to-Speech Updates Announced

    AI NinjaBy AI NinjaDecember 113 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Google has already managed to take the lead with Gemini 3 Pro in many areas. It also has incredibly powerful TTS models. The latest updates can now give you more control over style, tone, pace, and accents. These models now offer context-aware speed adjustments and follow instructions better

    We’re launching Gemini 2.5 Flash and Pro Text-to-Speech (TTS) model updates 🚀

    Improvements include:

    – Emotional style and tone versatility
    – Context-aware pacing control
    – Improved multiple-speaker capabilities

    Dive into the blog to learn how these advancements are giving…

    — Google AI Developers (@googleaidevs) December 10, 2025

    Here is a sample prompt you can use to generate your own voice:

    ASMR Pro
    # AUDIO PROFILE: Willow T.
    ## "The ASMR Whisperer"
    
    ## The Scene: Recorded inside a converted Sprinter van parked near Burleigh Heads. The space is small and padded with tapestries and macramé, creating a very "dry" but warm acoustic environment. The microphone is a Neumann KU 100 Dummy Head (binaural), meaning the audio should pan slightly left and right as the character moves, simulating 3D space.
    
    ### DIRECTORS NOTES
    Style: Relaxed Gold Coast bohemian style ASMR content creator.
    Accent: Gold Coast, Australia
    The "Grounding" Breath: Deep, diaphragmatic exhales that sound like ocean waves. Not sharp, but long and audible releases of air.
    Wetness/Mouth Sounds: Essential for ASMR. The listener should hear the sticky, subtle sounds of the tongue moving against the roof of the mouth (the "clicks" and "smacks") between words.
    Prosody & Pacing: The "Drift": The tempo is incredibly slow and liquid. Words bleed into each other. There is zero urgency.
    The "Smile" filter: The voice must sound like the speaker is constantly smiling. This brightens the tone even when whispering.
    High Rising Terminal (Softened): The classic Australian upward inflection at the end of sentences, but slowed down. It shouldn't sound questioning, just open and inviting.
    Tone & Articulation:
    The Gold Coast Vowel Shift: "I" (as in "light") becomes a wide, slow "loit" or "lah-ee-t." "O" (as in "no") drifts into the classic Aussie "naur," but breathy and soft, not harsh. Sibilance: The 'S' sounds should be prominent but crisp, creating a high-frequency "tingle" trigger.
    Vocal Fry (The "Morning Voice"): A rumbly, relaxed texture in the lower register, sounding like they just woke up from a nap on the beach.

    You can try these new models in Google AI Studio. There is also a playground app for playing around with this.

    [HT]

    fTTS
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleCue Chef Cube O1 AI Cooking Gadget with Thermal Imager
    Next Article OpenAI Steps Up with GPT 5.2, Overtakes Claude Opus 4.5 in GDPval-AA
    AI Ninja
    • Website

    Related Posts

    AI Audio

    Lipsync-2-pro: Edit What Anyone Says In Any Video

    September 2
    AI Audio

    ElevenLabs Voice Design v3 Announced

    June 26
    AI Audio

    Nari Labs Dia Outperforms ElevenLabs, Sesame CSM-1B

    April 23
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    QwQ-32B DeepSeek R1 Comparable Model

    March 60 Views

    Google Shares Tips on Veo 3 Prompts

    August 11 Views

    ChatGPT Search Now Available without a Sign Up

    February 64 Views
    More
    AI Audio

    Lipsync-2-pro: Edit What Anyone Says In Any Video

    AI NinjaSeptember 2
    AI Audio

    ElevenLabs Voice Design v3 Announced

    AI NinjaJune 26
    AI Audio

    Nari Labs Dia Outperforms ElevenLabs, Sesame CSM-1B

    AI NinjaApril 23
    Most Popular

    Prompt Cannon: Run Prompts Across Multiple Models

    June 243,558 Views

    Dipal D1 2.5K Curved Screen 3D AI Character

    June 231,026 Views

    How to Use Claude in Unity & Unreal Engine with MCP

    March 19755 Views
    Our Picks

    Runway Gen 4.5 Gets Image to Video

    January 23

    Remotion Agent Skills: Now You Can Generate Videos with Claude Code

    January 22

    World API is Now Live for Generating 3D Worlds from Text, Images, Videos

    January 22
    Tags
    3D agent AI AI model ai video avatar canvas ChatGPT Chess Claude Claude Code coding DeepSeek ElevenLabs Gemini glasses GPT Grok Hailuo Higgsfield image kling leonardo LLM Manus midjourney Mini PC model music nano banana o3 offline OpenAI open source QWEN robot runway sora text to video Veo 2 Veo 3 Vibe coding video video model Voice

    © 2026 Rad Neurons. Inspired by Entropy Grid
    • Home
    • Terms of Use
    • Privacy Policy
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.