Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request support of hessian-free optimization in rnn training. #93

Open
linjing-lab opened this issue Jul 27, 2024 · 0 comments
Open

Request support of hessian-free optimization in rnn training. #93

linjing-lab opened this issue Jul 27, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@linjing-lab
Copy link

I have the 63 pages paper named "Training Deep and Recurrent Networks with Hessian-Free Optimization", the hessian-free mainly the reproduced trait in hessian-updated training and reduce most hessian memory with hessian-free training mode to accelerate much steps than SGD relevant methods.

According to the description of uis-rnn, that is a reproduced algorithm of RNN struct, so I recommend to add a feature of hessian-free mode to train compatible scale task in case of the prior of uis-rnn.

Randomly choose a method named Preconditioned Conjugate Gradient algorithm (PCG), this method located at page 9 of the requester's research paper "Training Deep and Recurrent Networks with Hessian-Free Optimization", which consists of The generalized Gaussian-Newton matrix, Damping, Preconditioning contents. Compare Gaussian-Newton and RNN, both are the algorithms with residual sequences and their evaluation.

There is a feature request of hessian-free support, due to the existed testing platform of uis-rnn. This produce report have no correlation of binded download version, only a helpful request of add hessian-free optimization when encounter large-scale residual datasets.

@linjing-lab linjing-lab added the bug Something isn't working label Jul 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant