Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] DreamerV3 implementation? #113

Open
MRiabov opened this issue Sep 4, 2024 · 3 comments
Open

[FEATURE] DreamerV3 implementation? #113

MRiabov opened this issue Sep 4, 2024 · 3 comments
Labels
enhancement New feature or request Roadmap On the roadmap and will be addressed in time

Comments

@MRiabov
Copy link

MRiabov commented Sep 4, 2024

Please describe the purpose of the feature. Is it related to a problem?

Create a DreamerV3 implementation - there is no pure Jax implementation to date.

Describe the solution you'd like

Pure Jax Anakin/Sebulba implementation of DreamerV3, including pairing with native Jax environments.

Describe alternatives you've considered

Current implementation of DreamerV3 has half numpy half jax.numpy in code. Which is suboptimal.

How do we know when implementation of this feature is complete?

Checklist:

  • The Dreamer implementation is present fully in Jax.

Additional context

DreamerV3 is currently the smartest algorithm out there, and was able to collect diamonds in Minecraft with fixed hyperparameters and no human data involved. See https://arxiv.org/pdf/2301.04104v2, this is the current implementation: https://github.com/danijar/dreamerv3/tree/main

@MRiabov MRiabov added the enhancement New feature or request label Sep 4, 2024
@EdanToledo
Copy link
Owner

Hey. This is in my roadmap at some point but it's quite an involved algorithm. I have written it before but some thought will need to go into how to do it cleanly. I unfortunately have no estimate as to when I can get around to doing this.

@MRiabov
Copy link
Author

MRiabov commented Sep 4, 2024 via email

@EdanToledo
Copy link
Owner

my code is quite messy right now - it was for a paper I submitted a while ago and it was for a multi-agent use case using dreamer and graph neural networks. It's not that the code is difficult, it's just that there are a lot of details to incorporate if you want to accurately represent the paper. I can try push this up on my todo list but I can't say it's a priority right now. I'll leave the issue open though to remind me.

@EdanToledo EdanToledo added the Roadmap On the roadmap and will be addressed in time label Sep 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Roadmap On the roadmap and will be addressed in time
Projects
None yet
Development

No branches or pull requests

2 participants