
Grok Voice Agent API was just announced yesterday and people are doing cool things with it. It is designed to help you build voice agents that speak in many languages. It is #1 on Big Bench Audio and 5 times faster than the closest competitor. atariorbit has already gotten it to run on ReachyMini. It enables the robot to interact with humans in a smarter, more natural way.
Grok Voice and realtime api make the new ReachyMini into the coolest robot for your desk. Puts life into this robot! Got the Voice API up and running in just moments.#huggingface #reachy #grokVoice @ClementDelangue pic.twitter.com/O3XyJuzyln
— atariorbit (@atariorbit) December 17, 2025
As xAI explains:
“We built our entire voice stack in-house, training our VAD, tokenizer, and audio models from scratch. In blind head-to-head human evaluations against OpenAI, Grok is consistently rated as the preferred model across axes such as pronunciation, accent, and prosody.”
These agents can use custom tools or use xAI’s real-time search.
[HT]

