[Data] Optimize block prefetching #35568

raulchen · 2023-05-19T23:39:33Z

Why are these changes needed?

WaitBlockPrefetcher will blockingly wait for the first block. When prefetch size is small, this can cause latency on the critical path. This PR moves the wait to a background thread.

Related issue number

closes #35521

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

amogkam · 2023-05-22T16:59:28Z

kicking off the release test here: https://buildkite.com/ray-project/release-tests-pr/builds/39284

amogkam · 2023-05-22T17:04:36Z

python/ray/data/_internal/block_batching/block_batching.py

            with stats.iter_wait_s.timer() if stats else nullcontext():
-                prefetcher.prefetch_blocks(list(sliding_window))
+                prefetcher.prefetch_blocks([next_block])


why is this change required instead of triggering the fetch of the entire sliding window?

It's a bug previously. There is no need to redundantly prefetching the same blocks (both for wait and actor-based prefetchers).

raulchen · 2023-05-22T23:22:52Z

kicking off the release test here: https://buildkite.com/ray-project/release-tests-pr/builds/39284

@amogkam How to check the benchmark results from this?

amogkam · 2023-05-22T23:26:29Z

iter-torch-batches-bs-32-prefetch-0-shuffleNone = {'time': 92.82726407200005}
--
  | iter-torch-batches-bs-32-prefetch-0-shuffle64 = {'time': 94.301909722}
  | iter-torch-batches-bs-32-prefetch-1-shuffleNone = {'time': 92.12948181000002}
  | iter-torch-batches-bs-32-prefetch-1-shuffle64 = {'time': 91.936336773}
  | iter-torch-batches-bs-32-prefetch-4-shuffleNone = {'time': 92.30885751599999}
  | iter-torch-batches-bs-32-prefetch-4-shuffle64 = {'time': 92.08876677199999}

looks like we still see the regression. I would expect that with prefetching we should be down to 70 seconds. Running it again to confirm.

ericl · 2023-05-30T23:43:36Z

python/ray/data/_internal/block_batching/util.py

+            if len(blocks_to_wait) > 0:
+                ray.wait(blocks_to_wait, num_returns=1, fetch_local=True)
+            else:
+                self._condition.wait()


Catch and log exceptions from this loop?

Signed-off-by: Hao Chen <[email protected]>

raulchen · 2023-06-01T04:44:09Z

Fixed a few bugs. Now prefetch_batches=1 is more performant. But weirdly, prefetch_batches=4 is the same as no prefetching.

iter-torch-batches-bs-32-prefetch-0-shuffleNone = {'time': 94.05068665299996}
--
iter-torch-batches-bs-32-prefetch-0-shuffle64 = {'time': 88.09606919100008}
iter-torch-batches-bs-32-prefetch-1-shuffleNone = {'time': 65.197580313}
iter-torch-batches-bs-32-prefetch-1-shuffle64 = {'time': 64.72387075799998}
iter-torch-batches-bs-32-prefetch-4-shuffleNone = {'time': 89.3722670200001}
iter-torch-batches-bs-32-prefetch-4-shuffle64 = {'time': 86.70470300799991}

ericl · 2023-06-01T05:46:42Z

Hmm, previously we waited for all blocks queued. That might turn out to be important. I think we should follow the previous call pattern of passing all blocks to be prefetched to the wait call, and using num returns 1 still.

raulchen · 2023-06-01T18:47:32Z

After reverting to prefetching the entire sliding window every time, prefetch_batches=4 is effective as well now. I didn't figure out why though.

iter-torch-batches-bs-32-prefetch-0-shuffleNone = {'time': 86.87967626900002}
--
iter-torch-batches-bs-32-prefetch-0-shuffle64 = {'time': 86.52172062299996}
iter-torch-batches-bs-32-prefetch-1-shuffleNone = {'time': 63.789988204}
iter-torch-batches-bs-32-prefetch-1-shuffle64 = {'time': 63.34117246599999}
iter-torch-batches-bs-32-prefetch-4-shuffleNone = {'time': 60.843809012000065}
iter-torch-batches-bs-32-prefetch-4-shuffle64 = {'time': 59.433647494999946}

@ericl another question. currently each prefetcher instance will create a new thread (I added a "Prefetcher.stop" method to make sure the thread will stop asap). Do you think this will create too many threads in practice? If so, we may want to use a static thread pool. I think it should be fine. because users are not likely to use multiple "iter_batches" simultaneously.

ericl · 2023-06-01T19:02:41Z

Interesting. It could be we auto cancel previous wait requests. Unfortunately the code is very old here.

Also agree a thread per active iterator is totally fine.

amogkam

awesome, thanks @raulchen!

Signed-off-by: Hao Chen <[email protected]>

raulchen · 2023-06-01T20:21:57Z

Per offline discussions, some possible explanations are:

the blocks that have been prefetched get evicted again. the second "ray.wait" either prevents them from being evicted or fetches them again.
A second ray.wait somehow cancels the previous ray.wait.

Either way, we can merge this PR first to fix the regression.

## Why are these changes needed? `WaitBlockPrefetcher` will blockingly wait for the first block. When prefetch size is small, this can cause latency on the critical path. This PR moves the wait to a background thread. ## Related issue number closes ray-project#35521 Signed-off-by: e428265 <[email protected]>

raulchen requested review from ericl, scv119, c21, amogkam, scottjlee and bveeramani as code owners May 19, 2023 23:39

raulchen assigned ericl May 19, 2023

amogkam reviewed May 22, 2023

View reviewed changes

ericl reviewed May 30, 2023

View reviewed changes

ericl approved these changes May 30, 2023

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label May 30, 2023

raulchen added 3 commits May 31, 2023 10:43

[Data] Optimize block prefetching

7d76df9

Signed-off-by: Hao Chen <[email protected]>

lock

315b20d

Signed-off-by: Hao Chen <[email protected]>

fix test

06529de

Signed-off-by: Hao Chen <[email protected]>

raulchen force-pushed the wait-prefetch branch from 05ef51d to 06529de Compare May 31, 2023 17:43

raulchen added 9 commits May 31, 2023 10:53

log

402fb56

Signed-off-by: Hao Chen <[email protected]>

fix

3406fab

Signed-off-by: Hao Chen <[email protected]>

fix

32bdb36

Signed-off-by: Hao Chen <[email protected]>

lint

9d60847

Signed-off-by: Hao Chen <[email protected]>

lint

e555a7b

Signed-off-by: Hao Chen <[email protected]>

Merge branch 'master' into wait-prefetch

b0d40ee

daemon

f30b58a

Signed-off-by: Hao Chen <[email protected]>

stop prefetcher

33683bf

Signed-off-by: Hao Chen <[email protected]>

fix

00613d9

Signed-off-by: Hao Chen <[email protected]>

amogkam reviewed Jun 1, 2023

View reviewed changes

raulchen added 3 commits June 1, 2023 13:16

prefetch all

d07dc4b

Signed-off-by: Hao Chen <[email protected]>

revert some changes

9f71f08

Signed-off-by: Hao Chen <[email protected]>

minor refine

608be7a

Signed-off-by: Hao Chen <[email protected]>

raulchen force-pushed the wait-prefetch branch from 297636c to 608be7a Compare June 1, 2023 20:16

raulchen merged commit aa0d07e into ray-project:master Jun 1, 2023

raulchen deleted the wait-prefetch branch June 1, 2023 22:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Optimize block prefetching #35568

[Data] Optimize block prefetching #35568

raulchen commented May 19, 2023 •

edited

Loading

amogkam commented May 22, 2023

amogkam May 22, 2023

raulchen May 22, 2023

raulchen commented May 22, 2023

amogkam commented May 22, 2023

ericl May 30, 2023

raulchen commented Jun 1, 2023

ericl commented Jun 1, 2023

raulchen commented Jun 1, 2023

ericl commented Jun 1, 2023 •

edited

Loading

amogkam left a comment

raulchen commented Jun 1, 2023

[Data] Optimize block prefetching #35568

[Data] Optimize block prefetching #35568

Conversation

raulchen commented May 19, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

amogkam commented May 22, 2023

amogkam May 22, 2023

Choose a reason for hiding this comment

raulchen May 22, 2023

Choose a reason for hiding this comment

raulchen commented May 22, 2023

amogkam commented May 22, 2023

ericl May 30, 2023

Choose a reason for hiding this comment

raulchen commented Jun 1, 2023

ericl commented Jun 1, 2023

raulchen commented Jun 1, 2023

ericl commented Jun 1, 2023 • edited Loading

amogkam left a comment

Choose a reason for hiding this comment

raulchen commented Jun 1, 2023

raulchen commented May 19, 2023 •

edited

Loading

ericl commented Jun 1, 2023 •

edited

Loading