You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OS Platform and Distribution (e.g., Linux Ubuntu 16.04): n/a
Ray installed from (source or binary): n/a
Ray version: n/a
Python version: n/a
Exact command to reproduce:
Describe the problem
rollout.py does not keep track of previous actions and previous observed rewards and pass them to compute_action.py. This obviously is an issue if the policy was trained with prev. observed rewards and actions.
Source code / logs
The text was updated successfully, but these errors were encountered:
System information
Describe the problem
rollout.py does not keep track of previous actions and previous observed rewards and pass them to compute_action.py. This obviously is an issue if the policy was trained with prev. observed rewards and actions.
Source code / logs
The text was updated successfully, but these errors were encountered: