-
Hi, I am coming to k2/icefall with some experience training kaldi (chain) models, and am wondering how the two compare in terms of finetuning. I am in a situation where I have a "generic" dataset of a few thousand hours and a growing domain-specific dataset of a few dozen hours. It would be great if I could train a model once on the generic dataset (costs a lost of GPU hours) and then periodically finetune it on the latest batch of in-domain data. Would such an approach be more feasible in k2/icefall? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
For our latest (RNN-T) models we are being careful not to use batchnorm, so that is one fewer problem that you would have. |
Beta Was this translation helpful? Give feedback.
For our latest (RNN-T) models we are being careful not to use batchnorm, so that is one fewer problem that you would have.
Also, it should be possible with Lhotse to mix your data in with other data.