Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide an Adam solver #2827

Closed
slavnyk opened this issue Jul 27, 2015 · 4 comments
Closed

Provide an Adam solver #2827

slavnyk opened this issue Jul 27, 2015 · 4 comments

Comments

@slavnyk
Copy link

slavnyk commented Jul 27, 2015

Are there any plans to add Adam solver to Caffe in the near future?
Adam: A Method for Stochastic Optimization
Python implementation (Theano)
Lua implementation (Torch)
I have been using the Lua implemenation for training neural networks for low-level image processing (such as noise reduction and super-resolution) with great success, but could not reproduce results of similar quality in Caffe with SGD/Adagrad/Nesterov. There are reports of better results (faster, more accurate) when training various types of deep networks using Adam.

@pathak22 pathak22 added the JL label Jul 30, 2015
@kashif
Copy link

kashif commented Jul 30, 2015

I would like to try to do this, if no one else has started. I took the time to update the adam implementation for Lasagne and torch to the latest version of the paper. So next I will try to implement it in caffe.

@longjon longjon changed the title Adam solver? Provide an ADAM solver Aug 2, 2015
@longjon longjon added enhancement and removed JL labels Aug 2, 2015
@longjon longjon changed the title Provide an ADAM solver Provide an Adam solver Aug 2, 2015
@PatWie
Copy link
Contributor

PatWie commented Aug 3, 2015

Just started to write the solver
https://github.com/PatWie/caffe/blob/adam/src/caffe/solver.cpp

Not sure if we really want to use

this->iter

for computing the effective stepsize.

@kashif
Copy link

kashif commented Aug 3, 2015

Ok I was thinking perhaps:

int t = this->iter_ / this->param_.stepsize() + 1;

@PatWie PatWie mentioned this issue Aug 4, 2015
@shelhamer
Copy link
Member

Thanks for the suggested solver. Closing as addressed by #2856 -- see the PR for status.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants