Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the readability #3

Open
familyld opened this issue Nov 12, 2020 · 2 comments
Open

About the readability #3

familyld opened this issue Nov 12, 2020 · 2 comments

Comments

@familyld
Copy link

Hi Aviral,

In the paper, you claim CQL can be implemented with less than 20 lines of code, but it's really difficult to identify these "20 lines of code" from the current version of your project which is built upon other projects. Would you please point out which part of code exactly corresponds to the core of CQL? I really like your idea of CQL, both the theoretical part and its simplicity, but currently, it seems very hard to follow.

Best,
Zhi-Hong

@familyld
Copy link
Author

familyld commented Nov 12, 2020

For the Atari experiments, it seems like line 221 ~ 243 in quantile_agent.py are of interest.

For the D4RL experiments, it seems like line 233 ~ 300 in cql.py are of interest, but more lines of code are modified or added, comparing to sac.py.

Since the two experiments are based on different methods (QR-DQN and SAC) and different libraries (dopamine and rlkit), the detailed implementations are quite different. It would be great if the author can add more comments and explicitly correspond the code to the equations in the paper, e..g., Equation (4).

@aviralkumar2907
Copy link
Owner

Hello @familyld,

Yes, I hope to add some more comments in about a month timeline. Sorry for the delay in doing that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants