You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey. This is in my roadmap at some point but it's quite an involved algorithm. I have written it before but some thought will need to go into how to do it cleanly. I unfortunately have no estimate as to when I can get around to doing this.
I'm using Dreamer myself, and looking at the code - a lot can be simply
replaced by Jax alternatives like scan and vmap.
Would you mind sharing the jax implementation, by the way?
my code is quite messy right now - it was for a paper I submitted a while ago and it was for a multi-agent use case using dreamer and graph neural networks. It's not that the code is difficult, it's just that there are a lot of details to incorporate if you want to accurately represent the paper. I can try push this up on my todo list but I can't say it's a priority right now. I'll leave the issue open though to remind me.
Please describe the purpose of the feature. Is it related to a problem?
Create a DreamerV3 implementation - there is no pure Jax implementation to date.
Describe the solution you'd like
Pure Jax Anakin/Sebulba implementation of DreamerV3, including pairing with native Jax environments.
Describe alternatives you've considered
Current implementation of DreamerV3 has half numpy half jax.numpy in code. Which is suboptimal.
How do we know when implementation of this feature is complete?
Checklist:
Additional context
DreamerV3 is currently the smartest algorithm out there, and was able to collect diamonds in Minecraft with fixed hyperparameters and no human data involved. See https://arxiv.org/pdf/2301.04104v2, this is the current implementation: https://github.com/danijar/dreamerv3/tree/main
The text was updated successfully, but these errors were encountered: