ReleaseMarch 26, 20261 min read
Voxtral TTS Revolutionizes Voice Cloning with 3-Second Audio Samples
Mistral's groundbreaking Voxtral TTS model can clone voices from just three seconds of audio, supporting nine languages and boasting impressive latency and naturalness scores. This innovation sets a new standard for text-to-speech technology, outperforming competitors in key areas.
French AI startup Mistral has released Voxtral TTS, its first text-to-speech model that supports nine languages and can clone voices from just three seconds of audio. The article Mistral's first open-weight TTS model Voxtral clones voices from three seconds of audio across nine languages appeared first on The Decoder.