-
Notifications
You must be signed in to change notification settings - Fork 245
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
training problem with rnn_transducer #279
Comments
Hi @yiqiaoc11 |
@usimarit Thanks for comments. All my test were conducted on GPU as bs = 2. 4/6-layer encoder worked with different warmup steps in case of rnnt-loss, but not 8. Just trying to recover the performance of the pretrained. Conformer reportedly works which differs only with rnn_transducer. Feel to advise and I can try it on GPU here. |
@yiqiaoc11 Could you help me train 2 models for 30 epochs using rnnt-loss:
Then plot the loss of 2 models for better comparison? |
@usimarit, using the streaming config.yml (https://drive.google.com/file/d/1xYFYi3z94ZqaQZ-cTyiNekBwhITh1Ru4l) with warmup_steps=40000 , right? From the timeline, you seemed to apply warp-transducer loss to get the pretrained .h5 weights. |
@yiqiaoc11 Yes, with the pretrained config I trained the rnn transducer on TPUs so warp-transducer loss cannot be applied, only rnnt-loss can be used here. But you can experiment with warp-transducer loss too, plotting the loss of 2 models for better comparison. |
@usimarit, Now I'm having 2 x 3090, 2 x 30 epochs will take fairly long time with rnnt-loss. Now 8-layer doesn't converge and 4-layer converge with > 40000 warmup-steps. Conformer using the same rnnt-loss works. Could rnn_transducer differ while you pretrained it giving same loss, same optimizer, same number of weights? |
@yiqiaoc11 The rnn_transducer structure stays the same in version v1.0.x |
The transducer in TensorFlowASR\examples\rnn_transducer doesn't work for the current version with either current or pretrained config.yml. This is a fundamental function. Can the author or someone give it a try to validate it?
The text was updated successfully, but these errors were encountered: