[Bug; RLlib]: Error in `SampleBatch.get_single_step_input_dict()` #20216

sven1977 · 2021-11-10T15:00:41Z

Search before asking

I searched the issues and found no similar issues.

Ray Component

RLlib

What happened + What you expected to happen

There is a problem in SampleBatch.get_single_step_input_dict when having a complex traj. view setup for attention nets.
If a view-column is built from an underlying data column from a range of timesteps (e.g. state_in[-1] == state_out[-10:-1]), the method returns a wrong state_in_0.

The following repro test case should pass:

from gym.spaces import Box, Discrete
import numpy as np

from ray.rllib.policy.sample_batch import SampleBatch
from ray.rllib.policy.view_requirement import ViewRequirement
from ray.rllib.utils.test_utils import check

space = Box(-1.0, 1.0, ())

# With batch-repeat-value > 1: state_in_0 is only built every n
# timesteps.
view_reqs = {
    "state_in_0": ViewRequirement(
        data_col="state_out_0",
        shift="-5:-1",
        space=space,
        batch_repeat_value=5,
    ),
    "state_out_0": ViewRequirement(
        space=space, used_for_compute_actions=False),
}

# Trajectory of 1 ts (0) (we would like to compute the 1st).
batch = SampleBatch({
    "state_in_0": np.array([
        [0, 0, 0, 0, 0],  # ts=0
    ]),
    "state_out_0": np.array([1]),
})
input_dict = batch.get_single_step_input_dict(
    view_requirements=view_reqs, index="last")
check(
    input_dict,
    {
        "state_in_0": [[0, 0, 0, 0, 1]],  # ts=1
        "seq_lens": [1],
    })

Versions / Dependencies

ray=master
py=3.8
OSS=MacOS

Reproduction script

see above

Anything else

No response

Are you willing to submit a PR?

Yes I am willing to submit a PR!

The text was updated successfully, but these errors were encountered:

sven1977 added bug Something that is supposed to be working; but isn't triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Nov 10, 2021

sven1977 self-assigned this Nov 10, 2021

sven1977 added rllib RLlib related issues P2 Important issue, but not time-critical and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Nov 10, 2021

sven1977 mentioned this issue Nov 10, 2021

[RLlib] Issue: Get single step input dict incorrect. #20217

Merged

6 tasks

sven1977 closed this as completed in #20217 Nov 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug; RLlib]: Error in `SampleBatch.get_single_step_input_dict()` #20216

[Bug; RLlib]: Error in `SampleBatch.get_single_step_input_dict()` #20216

sven1977 commented Nov 10, 2021

[Bug; RLlib]: Error in SampleBatch.get_single_step_input_dict() #20216

[Bug; RLlib]: Error in SampleBatch.get_single_step_input_dict() #20216

Comments

sven1977 commented Nov 10, 2021

Search before asking

Ray Component

What happened + What you expected to happen

Versions / Dependencies

Reproduction script

Anything else

Are you willing to submit a PR?

[Bug; RLlib]: Error in `SampleBatch.get_single_step_input_dict()` #20216

[Bug; RLlib]: Error in `SampleBatch.get_single_step_input_dict()` #20216