[RLlib] Fix SampleBatch to_device() #27572

kouroshHakha · 2022-08-05T19:14:15Z

Why are these changes needed?

Related issue number

closes #26593

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

This reverts commit 74686a8.

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

gjoliver

a couple of minor questions.
looks cleeeaaaan!

gjoliver · 2022-08-07T17:32:49Z

rllib/utils/torch_utils.py

        # Floatify all float64 tensors.
-        if tensor.dtype == torch.double:
+        if tensor.is_floating_point():


any chance this is a new api? if yes, maybe make sure it works with the version we pin for core.

good point. I checked quickly and it's even there on torch=1.8.0 Where do we specify the torch pinned version in ray?
https://pytorch.org/docs/1.8.0/search.html?q=IS_FLOATING_POINT&check_keywords=yes&area=default

gjoliver · 2022-08-07T17:34:04Z

rllib/utils/torch_utils.py

        # Numpy arrays.
-        if isinstance(item, np.ndarray):


what were we doing before ... didn't we check these conditions before we get in here :)

the else is added because I moved a bit of logic to clean the code. e.g torch.is_tensor() is brought down after dealing with RepeatedValues type.

gjoliver · 2022-08-07T17:36:45Z

rllib/policy/tests/test_sample_batch.py

+                "f": RepeatedValues(np.array([[1, 2, 0, 0]]), lengths=[2], max_len=4),
+                SampleBatch.SEQ_LENS: np.array([2, 3, 1]),
+                "state_in_0": np.array([1.0, 3.0, 4.0]),
+                SampleBatch.INFOS: np.array([{"a": 1}, {"b": 2}, {"c": 3}]),


INFOS's dtype is object right? and we basically don't do anything to it if I understand correctly.

Yes. correct.

Signed-off-by: Huaiwei Sun <[email protected]>

Signed-off-by: Stefan van der Kleij <[email protected]>

kouroshHakha added 9 commits July 19, 2022 17:20

fixed crr flakeyness on crr

74686a8

Revert "fixed crr flakeyness on crr"

5ab160c

This reverts commit 74686a8.

Merge branch 'master' of github.com:ray-project/ray

5019f2d

Merge branch 'master' of github.com:ray-project/ray

a5c07d9

Merge branch 'master' of github.com:ray-project/ray

d3bdfd1

Merge branch 'master' of https://github.com/ray-project/ray

38d3eb4

Merge branch 'master' of https://github.com/ray-project/ray

e15f48d

fixed the bug and added a unittest

a6a1ed0

added repeated value test case as well

794bb69

kouroshHakha requested review from sven1977, gjoliver, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla and krfricke as code owners August 5, 2022 19:14

kouroshHakha assigned sven1977 Aug 5, 2022

kouroshHakha added 4 commits August 5, 2022 12:32

fixed gpu tests and added gpu tests to BUILD

6867105

wip

42e13fe

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

f6b8ccc

Merge branch 'master' into fix-samplebatch-todevice

092abac

gjoliver approved these changes Aug 7, 2022

View reviewed changes

sven1977 merged commit 3b2a842 into ray-project:master Aug 8, 2022

scottsun94 pushed a commit to scottsun94/ray that referenced this pull request Aug 9, 2022

[RLlib] Fix SampleBatch to_device(). (ray-project#27572)

316501c

Signed-off-by: Huaiwei Sun <[email protected]>

Stefan-1313 pushed a commit to Stefan-1313/ray_mod that referenced this pull request Aug 18, 2022

[RLlib] Fix SampleBatch to_device(). (ray-project#27572)

d2cfb54

Signed-off-by: Stefan van der Kleij <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix SampleBatch to_device() #27572

[RLlib] Fix SampleBatch to_device() #27572

kouroshHakha commented Aug 5, 2022 •

edited

Loading

gjoliver left a comment

gjoliver Aug 7, 2022

kouroshHakha Aug 7, 2022

gjoliver Aug 7, 2022

kouroshHakha Aug 7, 2022

gjoliver Aug 7, 2022

kouroshHakha Aug 7, 2022

[RLlib] Fix SampleBatch to_device() #27572

[RLlib] Fix SampleBatch to_device() #27572

Conversation

kouroshHakha commented Aug 5, 2022 • edited Loading

Why are these changes needed?

Related issue number

Checks

gjoliver left a comment

Choose a reason for hiding this comment

gjoliver Aug 7, 2022

Choose a reason for hiding this comment

kouroshHakha Aug 7, 2022

Choose a reason for hiding this comment

gjoliver Aug 7, 2022

Choose a reason for hiding this comment

kouroshHakha Aug 7, 2022

Choose a reason for hiding this comment

gjoliver Aug 7, 2022

Choose a reason for hiding this comment

kouroshHakha Aug 7, 2022

Choose a reason for hiding this comment

kouroshHakha commented Aug 5, 2022 •

edited

Loading