I am sitting in a room: Finding the fixpoint of Chatterbox voice cloning [video]

1 gcr 1 7/19/2025, 1:34:26 AM youtube.com โ†—

Comments (1)

gcr ยท 19h ago
I applied Chatterbox TTS's open-source voice cloning in a loop!

At iteration i, Chatterbox attempts to mimic the vocal style of `output[i-1]` by copying the content, rhythm, and prosody of `output[i-2]`.

It takes around ten minutes to become quite bad, and it gets unrecognizable within about 20. By the middle, it starts sounding like some new human language, but becomes completely unrecognizable glossolalia by the end.