Questions about training time, training set sizes and expectations #3791
Replies: 1 comment
-
Just to update on this: One sounds like static, the other is more of grating high pitched sound: |
Beta Was this translation helpful? Give feedback.
-
Hi there, I've been trudging my way through this, me being relatively inexperienced with coding and TTS in general, trying to get somewhere. My main goal is to end up with a custom model that produces audio which sounds like the speaker in my sample files.
I have about 800 1 to 5 second long wavs with transcribed with corresponding metadata file for training. I attempted to train a WaveRNN vocoder and ran ~700 epochs (14 hours on a GTX1080) but whenever I went to test the .pth on some text, the output was just static noise.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions