Deduplicate solver regularization, logging, and local rates and decays #2518

shelhamer · 2015-05-27T03:11:13Z

This simplifies the solver code by de-duplicating shared logic.

ca81667 is taken from Refactor solvers regularization and logging code #2397 by @cypof but renames Solver::Iteration() to Solver::MakeUpdate()
to verb.
a85f7f1 is a minor follow-up to simplify Decouple the computational batch size and minibatch size by accumulating gradients #1977

I plan to merge this shortly to make way for an updated #1977.

jeffdonahue · 2015-05-27T04:15:08Z

I just looked over this -- looks great! This refactoring was much needed. Thanks @cypof and @shelhamer. Only issue I see is I'm not sure about the verb in MakeUpdate -- I think DoUpdate or PerformUpdate would be more suggestive of what it does. Personally when I first saw the name MakeUpdate in the PR text, I guessed its purpose was that of ComputeUpdateValue.

jeffdonahue · 2015-05-27T04:19:22Z

Actually, I guess it is replacing ComputeUpdateValue, but also doing net_->Update() at the end. Would it not be better to keep calling it ComputeUpdateValue and just keep net_->Update at the end of the body of Step?

cypof · 2015-05-27T04:44:51Z

No the net_->Update() needs to be in, so that it is not executed by solvers that are not of type SGDSolver. This way, only the root solver in a parallel setup will apply the update. That was actually the race I introduced when I split the big PR into small ones. Another name could be ApplyGradients?

jeffdonahue · 2015-05-27T05:25:54Z

Okay, I see -- thanks for the explanation @cypof. I like Apply but also like Update; how about ApplyUpdate?

Designate `Solver::ApplyUpdate()` as the core method to compute and apply parameter updates given the current state of the Net. Make `Solver::ComputeUpdateValue()` a subordinate call overloaded by the `SGDSolver`s to take care of optimization algorithm details.

shelhamer · 2015-05-27T19:29:04Z

Thanks for the comments @cypof and @jeffdonahue. I went with ApplyUpdate. Please merge if this looks right to you, Jeff.

shelhamer · 2015-05-27T19:38:06Z

p.s. ignore the travis push check -- the travis Pr check is the one to heed. The push check was triggered by my accidental push to BVLC/caffe and then my deleting the branch made it fail.

jeffdonahue · 2015-05-27T20:42:44Z

Cool, LGTM

Deduplicate solver regularization, logging, and local rates and decays

cypof and others added 2 commits May 26, 2015 20:07

Refactor solvers regularization and logging code

ca81667

deduplicate decay and local rate in solver updates

a85f7f1

shelhamer added JL ready for review labels May 27, 2015

shelhamer mentioned this pull request May 27, 2015

Decouple the computational batch size and minibatch size by accumulating gradients #1977

Merged

3 tasks

jeffdonahue added a commit that referenced this pull request May 27, 2015

Merge pull request #2518 from shelhamer/dedup_solvers

b12c171

Deduplicate solver regularization, logging, and local rates and decays

jeffdonahue merged commit b12c171 into BVLC:master May 27, 2015

shelhamer deleted the dedup_solvers branch May 27, 2015 21:02

shelhamer mentioned this pull request May 27, 2015

Refactor solvers regularization and logging code #2397

Closed

ronghanghu mentioned this pull request Aug 6, 2015

RMSprop clean up and rebase #2867

Merged

This was referenced Aug 6, 2015

Adaptive Solvers: AdaDelta, RMSprop, and ADAM #2860

Closed

AdaDelta Solver (v3) #2782

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deduplicate solver regularization, logging, and local rates and decays #2518

Deduplicate solver regularization, logging, and local rates and decays #2518

shelhamer commented May 27, 2015

jeffdonahue commented May 27, 2015

jeffdonahue commented May 27, 2015

cypof commented May 27, 2015

jeffdonahue commented May 27, 2015

shelhamer commented May 27, 2015

shelhamer commented May 27, 2015

jeffdonahue commented May 27, 2015

Deduplicate solver regularization, logging, and local rates and decays #2518

Deduplicate solver regularization, logging, and local rates and decays #2518

Conversation

shelhamer commented May 27, 2015

jeffdonahue commented May 27, 2015

jeffdonahue commented May 27, 2015

cypof commented May 27, 2015

jeffdonahue commented May 27, 2015

shelhamer commented May 27, 2015

shelhamer commented May 27, 2015

jeffdonahue commented May 27, 2015