RMSprop clean up and rebase #2867

ronghanghu · 2015-08-06T00:41:24Z

Rebased and adapted RMSprop implementation #1890 to the new solver interface #2518 and #1977. The original author is @erogol. Pulled against master instead of dev.

The RMSprop solver is based on G. Hinton's lecture (http://www.cs.toronto.edu/~tijmen/csc321/slides/lecture_slides_lec6.pdf). Param gradients are divided by average root mean square of gradients in recent batches. It can be seen as a mini-batch version of using only the sign of gradients.

Update rule:

MeanSquare(t) = rms_decay * MeanSquare(t-1) + (1 - rms_decay) * gradient(t)^2
param_update(t) = gradient(t) / (sqrt(MeanSquare(t)) + delta)

Momentum is not supported for RMSprop solver, as in #1890.

erogol · 2015-08-07T11:01:47Z

thanks for handling this :)

jeffdonahue · 2015-08-07T18:50:36Z

src/caffe/test/test_gradient_based_solver.cpp

@@ -521,7 +531,7 @@ TYPED_TEST(NesterovSolverTest, TestNesterovLeastSquaresUpdateWithMomentum) {
  const Dtype kMomentum = 0.5;
  const int kNumIters = 1;
  for (int i = 0; i <= kNumIters; ++i) {
-    this->TestLeastSquaresUpdate(kLearningRate, kWeightDecay, kMomentum, i);
+    this->TestLeastSquaresUpdate(kLearningRate, kWeightDecay, kMomentum, 0., i);


These should be declared as constants (e.g. const Dtype kRMSDecay = 0) like the other args to make the meaning clear.

jeffdonahue · 2015-08-07T18:51:41Z

Thanks @erogol for the original work and thanks @ronghanghu for the rebase. This looks good except as noted above.

ronghanghu · 2015-08-07T19:22:57Z

@jeffdonahue OK, I'll handle them. Thanks for the comments!

ronghanghu · 2015-08-07T19:52:13Z

Fixed those issues. ~~I expect this PR to be merged after #2856 and #2782.~~

jeffdonahue · 2015-08-07T20:32:55Z

Cool, LGTM. @ronghanghu feel free to merge whenever it's easiest for you, before or after the other two PRs.

Implement RMSProp solver and cleaned up to adjust to new solver interface that uses accumulated gradients and refactored regularization.

ronghanghu · 2015-08-09T07:29:49Z

Took a further rebase on #2866. Authorship preserved for @erogol in commit

Ready to merge.

RMSProp clean up and rebase

shelhamer · 2015-08-09T07:48:12Z

src/caffe/test/test_gradient_based_solver.cpp

+
+ protected:
+  virtual void InitSolver(const SolverParameter& param) {
+    this->solver_.reset(new RMSPropSolver<Dtype>(param));


Could you set the RMS decay here, instead of introducing the decay argument to least squares and snapshotting tests? Since it is unique to this solver I think it is best handled here.

shelhamer · 2015-08-09T07:51:46Z

@ronghanghu Sorry I didn't catch this earlier, but I have a suggestion for the RMS decay parameter in the tests. Instead of introducing another argument and setting it for every test, this param could be set by the RMSProp test class for encapsulation. Could you send a follow-up PR to make this change?

ronghanghu · 2015-08-09T08:18:44Z

@shelhamer Yes, I can send another PR to do that. Adam solver is also going to introduce a momentum2 parameter, which can be handle in the same way (put into InitSolver()).

Addressed in #2888.

shelhamer added focus JD labels Aug 6, 2015

shelhamer mentioned this pull request Aug 6, 2015

RMSprop implementation based on G. Hinton Lecture 6 #1890

Closed

ronghanghu force-pushed the rms-prop branch 5 times, most recently from 0bd5360 to 3922737 Compare August 6, 2015 01:34

beniz mentioned this pull request Aug 6, 2015

Add support for RMSProp jolibrain/deepdetect#21

Closed

shelhamer mentioned this pull request Aug 6, 2015

Adaptive Solvers: AdaDelta, RMSprop, and ADAM #2860

Closed

3 tasks

shelhamer added the ready for review label Aug 6, 2015

jeffdonahue reviewed Aug 7, 2015
View reviewed changes

ronghanghu force-pushed the rms-prop branch 2 times, most recently from 3e8ab30 to fbd0533 Compare August 7, 2015 19:46

ronghanghu force-pushed the rms-prop branch from fbd0533 to 430db2b Compare August 7, 2015 20:17

ronghanghu force-pushed the rms-prop branch 2 times, most recently from 7f61b86 to abe99e8 Compare August 9, 2015 06:44

Implement RMSProp Solver

abe99e8

Implement RMSProp solver and cleaned up to adjust to new solver interface that uses accumulated gradients and refactored regularization.

ronghanghu added a commit that referenced this pull request Aug 9, 2015

Merge pull request #2867 from ronghanghu/rms-prop

698fc76

RMSProp clean up and rebase

ronghanghu merged commit 698fc76 into BVLC:master Aug 9, 2015

ronghanghu deleted the rms-prop branch August 9, 2015 07:37

shelhamer reviewed Aug 9, 2015
View reviewed changes

ronghanghu restored the rms-prop branch August 9, 2015 07:55

ronghanghu mentioned this pull request Aug 9, 2015

Encapsulate kRMSDecay in solver tests #2888

Merged

ronghanghu deleted the rms-prop branch August 9, 2015 08:38

ronghanghu mentioned this pull request Aug 12, 2015

Multi-GPU Data Parallelism (with Parallel Data Layers) #2903

Merged

9 tasks

PatWie mentioned this pull request Aug 14, 2015

information about new implemented solvers #2920

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RMSprop clean up and rebase #2867

RMSprop clean up and rebase #2867

ronghanghu commented Aug 6, 2015

erogol commented Aug 7, 2015

jeffdonahue Aug 7, 2015

jeffdonahue commented Aug 7, 2015

ronghanghu commented Aug 7, 2015

ronghanghu commented Aug 7, 2015

jeffdonahue commented Aug 7, 2015

ronghanghu commented Aug 9, 2015

shelhamer Aug 9, 2015

shelhamer commented Aug 9, 2015

ronghanghu commented Aug 9, 2015

RMSprop clean up and rebase #2867

RMSprop clean up and rebase #2867

Conversation

ronghanghu commented Aug 6, 2015

erogol commented Aug 7, 2015

jeffdonahue Aug 7, 2015

Choose a reason for hiding this comment

jeffdonahue commented Aug 7, 2015

ronghanghu commented Aug 7, 2015

ronghanghu commented Aug 7, 2015

jeffdonahue commented Aug 7, 2015

ronghanghu commented Aug 9, 2015

shelhamer Aug 9, 2015

Choose a reason for hiding this comment

shelhamer commented Aug 9, 2015

ronghanghu commented Aug 9, 2015