-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adaptive Solvers: AdaDelta, RMSprop, and ADAM #2860
Comments
OK, I'll go through these PRs. Also, I agree that |
AdaDelta should already conform to the new solver interface. Unless I missed something, I ported it over in #2782 when merging in master a couple of weeks ago. |
Since right now these 3 PRs are blocking each other by enum value in SolverType and SolverParameter next available ID, After that, a solver refactor PR can follow to address the |
@ronghanghu The merging sequence It is becoming to be a "political" issue because there isn't a clear policy for reviews and merges. Fast track merges on younger PRs push adaptation work over the older one (apart of the quality, the in shape status, the historical rebasing effort a maintainer has done on this older PRs) |
@bhack Indeed, there isn't a clear policy for PR reviewing process right now. Our merge sequence is mainly based on the status of PRs, that is, whether a PR is ready to merge or needs further modification. Since these 2 out of 3 here (Adam and AdaDelta) are still not ready, I expect to wait to merge them after #2870. |
The problem is that there isn't almost a rough policy for the community. Readiness it is influenced also by reviews timing other that proposer availability. #2836 that brake this was a fast track review. There are many staging PR in the queue that need to support rebasing efforts of fast track Pr merges. |
Now all of them done. Address solver refactor in #2890. |
There are open PRs for adaptive solvers that need review, revision, and merge:
As a last step AdaGrad, AdaDelta, and RMSprop could likely be improved by refactoring out shared code into an
AdaptiveSGDSolver
@ronghanghu these could be good warm-up projects for development.
@longjon @jeffdonahue thoughts on posting tracking issues of this kind? Does this help navigate or is it noise?
The text was updated successfully, but these errors were encountered: