News

A new update has been rolled out to the Figure 01 humanoid robot equipping it with ChatGPT Vision to see and speech-to-speech communications.
Text-to-speech models from ElevenLabs, Hume AI, and Descript are all pushing the limits of AI-generated voice technology.
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally ...
ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...
Hume claims Octave is the first text-to-speech system powered by a large language model (LLM) trained not only on text but on speech and emotion tokens, enabling it to understand words in context ...
AI text-to-speech programs could “unlearn” how to imitate certain people New research shows models can be directly edited to hide selected voices, even when users specifically ask for them.
Google is enhancing Gemini's text-to-speech (TTS). On Tuesday at Google I/O 2025, the company previewed a new TTS feature, built on native audio output, that can "converse in more expressive ways ...