[Train] Add support for handling multiple batch data types for prepare_data_loader #26386

VishDev12 · 2022-07-08T08:32:17Z

Why are these changes needed?

When working with Ray Train, using the ray.train.torch.prepare_data_loader method with a dataset that returns a dictionary instead of a tuple from its __getitem__ method causes issues.

Related issue number

Closes #26385

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…oader in ray.train.torch

python/ray/train/torch/train_loop_utils.py

Co-authored-by: matthewdeng <[email protected]>

amogkam

Thanks @VishDev12! Do you think you can also add a test for this? There's already a test_torch_prepare_dataloader test in test_gpu.py. We can just use that same test but try with DataLoaders that also output dictionaries instead of just tuples.

python/ray/train/torch/train_loop_utils.py

VishDev12 · 2022-07-09T06:30:25Z

Hey @amogkam, I've added a dict case in the test_torch_prepare_dataloader test.

Please let me know if something doesn't look quite right.

amogkam

Thanks @VishDev12 this lgtm! Just left one minor comment

amogkam · 2022-07-11T17:54:31Z

python/ray/train/examples/torch_linear_example.py

@@ -24,6 +24,11 @@ def __len__(self):
        return len(self.x)


+class LinearDatasetDict(LinearDataset):


If this is only used in the test and not part of the example, can we move this to test_gpu directly?

That makes sense, I've moved it just under the fixtures defined on test_gpu, just to keep the separation between the test_* functions and the rest. But if you want to have it defined just above test_torch_prepare_dataloader, I could do that too.

amogkam · 2022-07-12T17:15:55Z

Awesome, thanks @VishDev12!

…r prepare_data_loader (#26386)" This reverts commit 36229d1.

…r prepare_data_loader (#26386)" (#26483) This reverts commit 36229d1.

…types for prepare_data_loader (#26386)" (#26483)" This reverts commit e6c0403.

…types for prepare_data_loader"" (#26491) Signed-off-by: Amog Kamsetty <[email protected]> * Revert "Revert "[Train] Add support for handling multiple batch data types for prepare_data_loader (#26386)" (#26483)" This reverts commit e6c0403.

…types for prepare_data_loader"" (ray-project#26491) Signed-off-by: Amog Kamsetty <[email protected]> * Revert "Revert "[Train] Add support for handling multiple batch data types for prepare_data_loader (ray-project#26386)" (ray-project#26483)" This reverts commit e6c0403. Signed-off-by: Rohan138 <[email protected]>

…e_data_loader (ray-project#26386) When working with Ray Train, using the ray.train.torch.prepare_data_loader method with a dataset that returns a dictionary instead of a tuple from its __getitem__ method causes issues. Co-authored-by: matthewdeng <[email protected]> Signed-off-by: Stefan van der Kleij <[email protected]>

…r prepare_data_loader (ray-project#26386)" (ray-project#26483) This reverts commit 36229d1. Signed-off-by: Stefan van der Kleij <[email protected]>

…types for prepare_data_loader"" (ray-project#26491) Signed-off-by: Amog Kamsetty <[email protected]> * Revert "Revert "[Train] Add support for handling multiple batch data types for prepare_data_loader (ray-project#26386)" (ray-project#26483)" This reverts commit e6c0403. Signed-off-by: Stefan van der Kleij <[email protected]>

VishDev12 changed the title ~~Add support for handling multiple batch data types for prepare_data_loader in Ray Train~~ [Train] Add support for handling multiple batch data types for prepare_data_loader Jul 8, 2022

add support for handling multiple batch data types for prepare_data_l…

b8542d9

…oader in ray.train.torch

VishDev12 force-pushed the master branch from 2f6030d to b8542d9 Compare July 8, 2022 08:44

matthewdeng reviewed Jul 8, 2022

View reviewed changes

python/ray/train/torch/train_loop_utils.py Outdated Show resolved Hide resolved

matthewdeng assigned amogkam Jul 8, 2022

Support nested move to device on tuples and lists

cb7d26a

Co-authored-by: matthewdeng <[email protected]>

amogkam reviewed Jul 8, 2022

View reviewed changes

python/ray/train/torch/train_loop_utils.py Outdated Show resolved Hide resolved

python/ray/train/torch/train_loop_utils.py Outdated Show resolved Hide resolved

VishDev12 added 2 commits July 9, 2022 11:47

add a dict case to test_torch_prepare_dataloader

4866623

change debug to info

efdd178

amogkam approved these changes Jul 11, 2022

View reviewed changes

shift LinearDatasetDict to test_gpu

6f9b17c

amogkam merged commit 36229d1 into ray-project:master Jul 12, 2022

amogkam mentioned this pull request Jul 12, 2022

[WIP] Adding dict handling to torch prepare data loader #26430

Closed

6 tasks

amogkam added a commit that referenced this pull request Jul 12, 2022

Revert "[Train] Add support for handling multiple batch data types fo…

ba3636c

…r prepare_data_loader (#26386)" This reverts commit 36229d1.

amogkam mentioned this pull request Jul 12, 2022

Revert "[Train] Add support for handling multiple batch data types for prepare_data_loader" #26483

Merged

amogkam added a commit that referenced this pull request Jul 13, 2022

Revert "[Train] Add support for handling multiple batch data types fo…

e6c0403

…r prepare_data_loader (#26386)" (#26483) This reverts commit 36229d1.

amogkam added a commit that referenced this pull request Jul 13, 2022

Revert "Revert "[Train] Add support for handling multiple batch data …

495c35b

…types for prepare_data_loader (#26386)" (#26483)" This reverts commit e6c0403.

brucez-anyscale mentioned this pull request Jul 14, 2022

Fix test dashboard flaky by catch an expected exception #26555

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Train] Add support for handling multiple batch data types for prepare_data_loader #26386

[Train] Add support for handling multiple batch data types for prepare_data_loader #26386

VishDev12 commented Jul 8, 2022

amogkam left a comment

VishDev12 commented Jul 9, 2022

amogkam left a comment

amogkam Jul 11, 2022

VishDev12 Jul 12, 2022

amogkam commented Jul 12, 2022

		@@ -24,6 +24,11 @@ def __len__(self):
		return len(self.x)


		class LinearDatasetDict(LinearDataset):

[Train] Add support for handling multiple batch data types for prepare_data_loader #26386

[Train] Add support for handling multiple batch data types for prepare_data_loader #26386

Conversation

VishDev12 commented Jul 8, 2022

Why are these changes needed?

Related issue number

Checks

amogkam left a comment

Choose a reason for hiding this comment

VishDev12 commented Jul 9, 2022

amogkam left a comment

Choose a reason for hiding this comment

amogkam Jul 11, 2022

Choose a reason for hiding this comment

VishDev12 Jul 12, 2022

Choose a reason for hiding this comment

amogkam commented Jul 12, 2022