Prev_action, prev_reward not passed to rollout #4573

eugenevinitsky · 2019-04-06T23:55:57Z

System information

OS Platform and Distribution (e.g., Linux Ubuntu 16.04): n/a
Ray installed from (source or binary): n/a
Ray version: n/a
Python version: n/a
Exact command to reproduce:

Describe the problem

rollout.py does not keep track of previous actions and previous observed rewards and pass them to compute_action.py. This obviously is an issue if the policy was trained with prev. observed rewards and actions.

Source code / logs

ericl · 2019-04-07T00:17:06Z

Ah yeah @vladfi1 has a fix here: #4565

eugenevinitsky · 2019-04-08T02:49:45Z

Cool, resolved!

vladfi1 mentioned this issue Apr 7, 2019

[rllib] Support prev_state/prev_action in rollout and fix multiagent #4565

Merged

1 task

eugenevinitsky closed this as completed Apr 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prev_action, prev_reward not passed to rollout #4573

Prev_action, prev_reward not passed to rollout #4573

eugenevinitsky commented Apr 6, 2019

ericl commented Apr 7, 2019

eugenevinitsky commented Apr 8, 2019

Prev_action, prev_reward not passed to rollout #4573

Prev_action, prev_reward not passed to rollout #4573

Comments

eugenevinitsky commented Apr 6, 2019

System information

Describe the problem

Source code / logs

ericl commented Apr 7, 2019

eugenevinitsky commented Apr 8, 2019