Many top AI video models now have an avatar feature, so you will be able to upload an image and audio to turn it into a talking video. Wan is no different. With Alibaba’s Wan2.2-S2V, you will now be able to create dynamic talking and singing avatars. It supports multi-format output, including portrait, bust, and full body format.
Alibaba’s Wan2.2-S2V turns a single image and voice into avatars that talk, sing, and move! With dynamic scenes, smart motion, and multi-format output, the possibilities are endless 🪄
Try it out now to explore how you can create your own AI-powered avatar today! pic.twitter.com/EVidtBaouk
— Alibaba Group (@AlibabaGroup) September 2, 2025
This model is capable of generating natural facial expressions, body movements, and camera work. It supports full-body and half-body character generation. The above video gives you an idea how this model works.
[HT]