Hunyuan Sonic Turns Images with Audio Into Speeches, Songs

In this day and age, you don’t need a whole lot to generate stunning videos with AI. Hunyuan Sonic is a nifty approach to breathing life into static images. It uses temporal audio learning for accurate lip-sync and natural expressions. By using a motion-decoupled controller, motion of the head and expression movement “are disentangled and independently controlled by intra-audio clips.”

Sonic can generate stunning videos with an image and audio input. It can generate long videos up to 10 minutes. As the above video shows, Sonic can create more dynamic, natural videos. Sonic works well with images that are not real humans.

[HT: Zhejiang University,Tencent ]

What's Hot

Video & Image JSON Prompts Cheatsheet

Deepseek V3.2 Changes the Game, Competes with GPT 5, Gemini 3.0

Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

Video & Image JSON Prompts Cheatsheet

Deepseek V3.2 Changes the Game, Competes with GPT 5, Gemini 3.0

Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

Midjourney Enhances Moodboards with Code Mixing

Apple Announces iPad mini Made for Apple Intelligence with ChatGPT

Cursor 2.0 Composer: Coding Model for Agentic Use

SOUYIE SW-9 GPT Powered Smartwatch

Pi GPT: Vibe Coding for Raspberry Pi

NarTick GPT Powered E-Ink Calendar

Most Popular

Prompt Cannon: Run Prompts Across Multiple Models

Dipal D1 2.5K Curved Screen 3D AI Character

GPTARS: GPT Powered TARS Robot

Our Picks

Video & Image JSON Prompts Cheatsheet

Deepseek V3.2 Changes the Game, Competes with GPT 5, Gemini 3.0

Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

What's Hot

Hunyuan Sonic Turns Images with Audio Into Speeches, Songs

Related Posts