ReleaseMarch 26, 20261 min read

Voxtral TTS Revolutionizes Voice Cloning with 3-Second Audio Samples

Mistral's groundbreaking Voxtral TTS model can clone voices from just three seconds of audio, supporting nine languages and boasting impressive latency and naturalness scores. This innovation sets a new standard for text-to-speech technology, outperforming competitors in key areas.

French AI startup Mistral has released Voxtral TTS, its first text-to-speech model that supports nine languages and can clone voices from just three seconds of audio. The article Mistral's first open-weight TTS model Voxtral clones voices from three seconds of audio across nine languages appeared first on The Decoder.

Browse All Models Compare Models All News

Voxtral TTS Revolutionizes Voice Cloning with 3-Second Audio Samples

Explore