[core] Cleanup handling for nondeterministic object size during transfer #22639

stephanie-wang · 2022-02-24T22:19:16Z

Why are these changes needed?

Currently object transfers assume that the object size is fixed. This is a bad assumption during failures, especially with lineage reconstruction enabled and tasks with nondeterministic outputs.

This PR cleans up the handling and hopefully guards against two cases where the object size may change during a transfer:

The object manager's size information does not match the object in the local plasma store (due to async notifications). --> the object manager overwrites its own information if it finds that the physical object has a different size.
The receiver's created buffer size does not match the sender's object size. --> the receiver destroys the previous buffer and creates a new buffer with the correct size. This might cause some transient errors but eventually object transfer should succeed.

Unfortunately I couldn't trigger this from Python because it depends on some pretty specific timing conditions. However, I did add some unit tests for case 2 (this is the majority of the PR).

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

ericl

Nice! While this doesn't block this PR, I'm wondering if we may also see cases where the data size is the same but data contents differ across versions of the object.

I believe that can't happen since we currently stream object data from a single source (never re-using chunks), but we may want to add a random version / checksum of the object data to reject these cases as well in the future.

ericl · 2022-02-25T00:21:04Z

src/ray/object_manager/object_buffer_pool.cc

-  RAY_CHECK_OK(store_client_.Connect(store_socket_name_.c_str(), "", 0, 300));
-}
+ObjectBufferPool::ObjectBufferPool(
+    std::shared_ptr<plasma::PlasmaClientInterface> store_client, uint64_t chunk_size)


ericl · 2022-02-25T00:24:48Z

src/ray/object_manager/object_buffer_pool.cc

  const int64_t object_size =
      static_cast<int64_t>(data_size) - static_cast<int64_t>(metadata_size);
  std::shared_ptr<Buffer> data;
+  RAY_LOG(INFO) << "store_client_ " << store_client_;


Oops, thanks...

stephanie-wang · 2022-02-25T00:34:15Z

Nice! While this doesn't block this PR, I'm wondering if we may also see cases where the data size is the same but data contents differ across versions of the object.

I believe that can't happen since we currently stream object data from a single source (never re-using chunks), but we may want to add a random version / checksum of the object data to reject these cases as well in the future.

Yeah I was thinking this as well, a version number would be good and I think it will work pretty much use the same codepath.

By the way, it is actually possible to get chunks from different sources right now if a transfer fails midway through, or if pull retries are close enough together that they overlap. That also means it's possible to get liveness issues if this happens repeatedly, but I figured it's fine for now.

ericl · 2022-02-25T00:43:34Z

By the way, it is actually possible to get chunks from different sources right now if a transfer fails midway through, or if pull retries are close enough together that they overlap. That also means it's possible to get liveness issues if this happens repeatedly, but I figured it's fine for now.

Ah, this is if there are two concurrent pushers to the same pull requester? That does sound problematic.

scv119 · 2022-02-25T02:32:30Z

Yup if we add a unique version number (like a randomized UUID) should help up eliminate the same size different content case.

…fer (ray-project#22639) Currently object transfers assume that the object size is fixed. This is a bad assumption during failures, especially with lineage reconstruction enabled and tasks with nondeterministic outputs. This PR cleans up the handling and hopefully guards against two cases where the object size may change during a transfer: 1. The object manager's size information does not match the object in the local plasma store (due to async notifications). --> the object manager overwrites its own information if it finds that the physical object has a different size. 2. The receiver's created buffer size does not match the sender's object size. --> the receiver destroys the previous buffer and creates a new buffer with the correct size. This might cause some transient errors but eventually object transfer should succeed. Unfortunately I couldn't trigger this from Python because it depends on some pretty specific timing conditions. However, I did add some unit tests for case 2 (this is the majority of the PR).

stephanie-wang added 2 commits February 24, 2022 14:15

fixes

4145efc

unit test

8a59739

stephanie-wang assigned ericl and scv119 Feb 24, 2022

ericl approved these changes Feb 25, 2022

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Feb 25, 2022

bad log

2f53e86

scv119 approved these changes Feb 25, 2022

View reviewed changes

fix

dde18e3

stephanie-wang merged commit 634ca9a into ray-project:master Feb 25, 2022

stephanie-wang deleted the nondeterministic-object-size branch February 25, 2022 17:39

mengke-mk mentioned this pull request Mar 9, 2022

Refactor the underlying object store interface. #22948

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] Cleanup handling for nondeterministic object size during transfer #22639

[core] Cleanup handling for nondeterministic object size during transfer #22639

stephanie-wang commented Feb 24, 2022 •

edited

Loading

ericl left a comment

ericl Feb 25, 2022

ericl Feb 25, 2022

stephanie-wang Feb 25, 2022

stephanie-wang commented Feb 25, 2022

ericl commented Feb 25, 2022

scv119 commented Feb 25, 2022

[core] Cleanup handling for nondeterministic object size during transfer #22639

[core] Cleanup handling for nondeterministic object size during transfer #22639

Conversation

stephanie-wang commented Feb 24, 2022 • edited Loading

Why are these changes needed?

Checks

ericl left a comment

Choose a reason for hiding this comment

ericl Feb 25, 2022

Choose a reason for hiding this comment

ericl Feb 25, 2022

Choose a reason for hiding this comment

stephanie-wang Feb 25, 2022

Choose a reason for hiding this comment

stephanie-wang commented Feb 25, 2022

ericl commented Feb 25, 2022

scv119 commented Feb 25, 2022

stephanie-wang commented Feb 24, 2022 •

edited

Loading