You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While training the vocoder, it writes to the following three folders "audio", "dev_eval", and "train_no_dev_eval".
The audios inside "dev_eval" and "train_no_dev_eval" all sounds very good. However, the audios inside "audio" all have audible hissing background noise.
What's the difference between those three folders?
Is it possible that the audios written to "audios" folder were somehow generated differently from the ones inside the other two folders?
Thanks!
The text was updated successfully, but these errors were encountered:
Does it make sense that the waveform generated under training mode sounds worse than that generated under inference mode?
I mean, under training mode, since the model has access to the previous ground truth, the output should sound at least as good as the output using inference mode.
What's your opinion on this?
Thanks!
While training the vocoder, it writes to the following three folders "audio", "dev_eval", and "train_no_dev_eval".
The audios inside "dev_eval" and "train_no_dev_eval" all sounds very good. However, the audios inside "audio" all have audible hissing background noise.
What's the difference between those three folders?
Is it possible that the audios written to "audios" folder were somehow generated differently from the ones inside the other two folders?
Thanks!
The text was updated successfully, but these errors were encountered: