[rllib] Fix torch c51 dqn #16716

gbartyzel · 2021-06-28T20:12:39Z

Why are these changes needed?

Currently, it is not possible to train a c51 torch agent. The output shape of the value head in DqnTorchModel should be equal to the num_atoms (now it is equal to 1). Also in the current implementation, the noisy option doesn't affect the value head. There are also minor fixes in QLoss, like target probs should be detached before loss calculation.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…torch_c51_dqn

richardliaw · 2021-07-10T22:06:32Z

Hmm @Souphis it looks like TestExplorations.test_td3 is failing because of this?

Is it possible to investigate that failure?

gbartyzel · 2021-07-11T09:40:37Z

I don't think that this PR is related to this failure, but I will investigate this.

sven1977 · 2021-07-13T18:12:05Z

@Souphis , thanks for this PR! Taking a look at the failing test. I think it's not related to this PR, though, so should be good, but I'll confirm. ...

gbartyzel · 2021-07-13T18:17:10Z

@sven1977 I ran tests once again locally, both td3 and dqn passed them. So, I think that this error is not related to this PR.

…torch_c51_dqn

sven1977 · 2021-07-13T18:25:51Z

Merged with master, which has a fix for this test case. Waiting for tests to pass again. ...

sven1977 · 2021-07-13T18:26:26Z

@Souphis , agree. Will merge as soon as everything passes. Should be today :)

sven1977

Thanks for the fixes @Souphis !

gbartyzel added 2 commits June 28, 2021 21:26

[rllib] Fixed value in DqnTorchModel and detach target in c51 loss

fc83339

[rllib] Apply formatting

7d531c0

architkulkarni assigned sven1977 Jun 29, 2021

gbartyzel added 2 commits July 10, 2021 18:40

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

4cd54f8

…torch_c51_dqn

Renamed the value head name

bd8cd95

richardliaw assigned michaelzhiluo Jul 10, 2021

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

c90d028

…torch_c51_dqn

sven1977 approved these changes Jul 13, 2021

View reviewed changes

sven1977 merged commit d553d4d into ray-project:master Jul 13, 2021

gbartyzel deleted the fix_torch_c51_dqn branch July 13, 2021 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Fix torch c51 dqn #16716

[rllib] Fix torch c51 dqn #16716

gbartyzel commented Jun 28, 2021 •

edited

Loading

richardliaw commented Jul 10, 2021

gbartyzel commented Jul 11, 2021

sven1977 commented Jul 13, 2021

gbartyzel commented Jul 13, 2021 •

edited

Loading

sven1977 commented Jul 13, 2021

sven1977 commented Jul 13, 2021

sven1977 left a comment

[rllib] Fix torch c51 dqn #16716

[rllib] Fix torch c51 dqn #16716

Conversation

gbartyzel commented Jun 28, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

richardliaw commented Jul 10, 2021

gbartyzel commented Jul 11, 2021

sven1977 commented Jul 13, 2021

gbartyzel commented Jul 13, 2021 • edited Loading

sven1977 commented Jul 13, 2021

sven1977 commented Jul 13, 2021

sven1977 left a comment

Choose a reason for hiding this comment

gbartyzel commented Jun 28, 2021 •

edited

Loading

gbartyzel commented Jul 13, 2021 •

edited

Loading