- Add
EvalCallback
for proper evaluation using a separate env (--eval-freq
,--eval-episodes
) - Add
--save-freq
argument to save periodic checkpoints of the model - Add
--load-best
argument to enjoy script to load the best model according to evaluation - Better seeding and fix possible race condition in folder creation
- Use new callback for hyperparameter tuning