Training a new voice for Piper TTS with only a single phrase
3 naggie 1 7/3/2025, 12:31:02 PM calbryant.uk ↗
Comments (1)
magicalhippo · 14h ago
Author uses Chatterbox TTS' zero-shot voice cloning to generate synthetic training data from a single phrase, Whisper STT to verify the generated voice sample to catch generation errors, and then uses the synthetic data set to fine-tune Piper TTS the standard way.