New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[RLlib] Cleanup examples folder (vol 30): BC pretraining, then PPO finetuning (new API stack with RLModule checkpoints). #47838

Merged

sven1977 merged 5 commits into ray-project:master from sven1977:cleanup_examples_folder_30_bc_train_ppo_finetune

Sep 28, 2024

+331 −198

DCO / DCO succeeded Sep 28, 2024 in 1s

DCO

All commits are signed off!

View more details on DCO