This demo is running a fine-tuned XTTS model. XTTS is a multilingual text-to-speech and voice-cloning model. This demo features zero-shot voice cloning.

Supported languages: Finnish: fi, English: en, Estonian: et, German: de, Russian: ru


Language

Select an output language for the synthesised speech

This check can improve output if your microphone or reference voice is noisy

I agree to the terms of the CPML: https://coqui.ai/cpml

Examples
Text Prompt Language Reference Audio Cleanup Reference Voice Agree