Skip to content

Issues: DLR-RM/stable-baselines3

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: How to avoid saving an external LLM model while saving a cutomized dqn policy question Further information is requested
#2025 opened Oct 22, 2024 by chrisgao99
5 tasks done
[Question] Batch Size Selection for a Finite MDP question Further information is requested
#2024 opened Oct 22, 2024 by DavidLudl
4 tasks done
[Feature Request] When are you planning to upgrade to Gymnasium v1.0.0 duplicate This issue or pull request already exists enhancement New feature or request
#2023 opened Oct 18, 2024 by drulye
2 tasks done
[Question] TD3 algorithm, During training,why limit the next_actions custom gym env Issue related to Custom Gym Env question Further information is requested RTFM Answer is the documentation
#2022 opened Oct 15, 2024 by Danny551
4 tasks done
[Feature Request] request title jax API and gymnax enhancement New feature or request more information needed Please fill the issue template completely
#2020 opened Oct 10, 2024 by quanmissq
2 tasks done
[Question] Manually Controlling Actions During PPO Training check the checklist You have checked the required items in the checklist but you didn't do what is written... custom gym env Issue related to Custom Gym Env more information needed Please fill the issue template completely question Further information is requested
#2014 opened Sep 25, 2024 by wayne-weiwei
4 tasks done
[bug] Adaptive SAC: using logarithm of entropy coefficient to compute temperature objective instead of entropy coefficient documentation Improvements or additions to documentation duplicate This issue or pull request already exists question Further information is requested
#2013 opened Sep 24, 2024 by Mattia-sony
[Bug]: Episode start flag is never set for off policy algorithms question Further information is requested
#2011 opened Sep 20, 2024 by josndan
5 tasks done
[Bug]: Possible inconsistencies with the PPO implementation question Further information is requested
#1986 opened Aug 2, 2024 by hexonfox
5 tasks done
[Feature Request] Temporal Convolutional network enhancement New feature or request
#1984 opened Jul 30, 2024 by tty666
2 tasks done
[Bug]: Higher memory usage on sequential training runs bug Something isn't working
#1966 opened Jul 9, 2024 by NickLucche
5 tasks done
[Question] How do I correctly manually reset the episode on a rollout_end? question Further information is requested
#1961 opened Jul 4, 2024 by npit
4 tasks done
[Bug]: RunningMeanStd overflowing bug Something isn't working check the checklist You have checked the required items in the checklist but you didn't do what is written...
#1953 opened Jun 29, 2024 by spiglerg
4 of 5 tasks
[Bug]: evaluate_policy called multiple times vor vectorized environments documentation Improvements or additions to documentation help wanted Help from contributors is welcomed
#1912 opened Apr 26, 2024 by LukasFehring
5 tasks done
Exporting MultiInputActorCriticPolicy as ONNX more information needed Please fill the issue template completely question Further information is requested
#1873 opened Mar 18, 2024 by MaximCamilleri
4 tasks done
torch.load without weights_only parameter is unsafe enhancement New feature or request
#1852 opened Feb 27, 2024 by kit1980
[Bug]: Procgen check the checklist You have checked the required items in the checklist but you didn't do what is written... documentation Improvements or additions to documentation duplicate This issue or pull request already exists good first issue Good for newcomers help wanted Help from contributors is welcomed
#1823 opened Feb 1, 2024 by BurgerAndreas
5 tasks done
Wrong types for policy parameters in Documentation documentation Improvements or additions to documentation
#1815 opened Jan 25, 2024 by mspils
2 tasks done
Deadlock when running model.learn on a SubprocVecEnv check the checklist You have checked the required items in the checklist but you didn't do what is written... custom gym env Issue related to Custom Gym Env
#1814 opened Jan 24, 2024 by 1-Bart-1
5 tasks done
[Bug]: Reset options ignored when resetting due to termination / truncation from within wrapper's step bug Something isn't working documentation Improvements or additions to documentation help wanted Help from contributors is welcomed
#1790 opened Dec 20, 2023 by npit
5 tasks done
IsaacGym Preview4 with Stable Baselines3? custom gym env Issue related to Custom Gym Env documentation Improvements or additions to documentation help wanted Help from contributors is welcomed
#1768 opened Nov 27, 2023 by LyuJZ
5 tasks done
[Question] User needs to reset gSDE noise when using learned model? documentation Improvements or additions to documentation help wanted Help from contributors is welcomed question Further information is requested
#1767 opened Nov 26, 2023 by gmarkkula
4 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.