Issue: Voice Quality Degradation (Childish Tone) After Dataset & Config Changes in Coqui VITS Voice Cloning
Summary The user experienced a significant regression in voice prosody and pitch (resulting in a “childish” tone) while attempting to fix pronunciation issues in a Coqui TTS VITS voice cloning pipeline. The regression occurred after two distinct failure modes: a naive dataset expansion followed by a drastic dataset reduction and configuration change. The core failure … Read more