Removing updates of Beta1 and Beta2 power accumulators outside the op #4925

abhinavarora · 2017-10-19T02:56:25Z

tonyyang-svail · 2017-10-19T03:18:50Z

paddle/operators/adam_op.cc

 learning_rate_t = learning_rate_t *
-                  sqrt(1 - beta2_pow_out) / (1 - beta1_pow_out)
+                  sqrt(1 - beta2_pow) / (1 - beta1_pow)


Looks like we no longer need to update beta_pow to beta_pow * beta inside the operatorKernel. I am wondering when and where will beta_pow be updated?

We do not need these because beta1 and beta2 power accumulators will be constant for all parameters. On the other hand, we will have one instance of the adam_op for every parameter. So these will be done in accumulators that the Python API will control.

tonyyang-svail

LGTM.

Removing updates of Beta1 and Beta2 power accumulators outside the op

3f0d84f

abhinavarora self-assigned this Oct 19, 2017

tonyyang-svail reviewed Oct 19, 2017

View reviewed changes

abhinavarora requested a review from jacquesqiao October 19, 2017 04:49

tonyyang-svail approved these changes Oct 19, 2017

View reviewed changes

abhinavarora merged commit 11bebeb into PaddlePaddle:develop Oct 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removing updates of Beta1 and Beta2 power accumulators outside the op #4925

Removing updates of Beta1 and Beta2 power accumulators outside the op #4925

abhinavarora commented Oct 19, 2017

tonyyang-svail Oct 19, 2017

abhinavarora Oct 19, 2017

tonyyang-svail Oct 19, 2017

tonyyang-svail left a comment

Removing updates of Beta1 and Beta2 power accumulators outside the op #4925

Removing updates of Beta1 and Beta2 power accumulators outside the op #4925

Conversation

abhinavarora commented Oct 19, 2017

tonyyang-svail Oct 19, 2017

Choose a reason for hiding this comment

abhinavarora Oct 19, 2017

Choose a reason for hiding this comment

tonyyang-svail Oct 19, 2017

Choose a reason for hiding this comment

tonyyang-svail left a comment

Choose a reason for hiding this comment