[WIP] Speed up deserialization of object refs #17882

mwtian · 2021-08-16T23:49:41Z

Why are these changes needed?

In #17803 it is reported that calling ray.get() on object refs contained within an object can be expectedly slow.

Cache Python call site when creating object refs contained in the same deserializing object. Getting Python call site is the most expensive operation when creating object refs. Also move some deserialization logic into cython.
Add a microbenchmark for ray.get() on objects containing 10k object refs. On m5.8xlarge,
Before: single client get object containing 10k refs per second 6.68 +- 0.01
After: single client get object containing 10k refs per second 17.52 +- 0.8

Related issue number

#17803

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

bveeramani · 2022-01-30T05:45:36Z

‼️ ACTION REQUIRED ‼️

We've switched our code formatter from YAPF to Black (see #21311).

To prevent issues with merging your code, here's what you'll need to do:

Install Black

pip install -I black==21.12b0

Format changed files with Black

curl -o format-changed.sh https://gist.githubusercontent.com/bveeramani/42ef0e9e387b755a8a735b084af976f2/raw/7631276790765d555c423b8db2b679fd957b984a/format-changed.sh
chmod +x ./format-changed.sh
./format-changed.sh
rm format-changed.sh

Commit your changes.

git add --all
git commit -m "Format Python code with Black"

Merge master into your branch.

git pull upstream master

Resolve merge conflicts (if necessary).

After running these steps, you'll have the updated format.sh.

kfstorm · 2022-03-13T09:20:42Z

‼️ ACTION REQUIRED ‼️

We've updated our formatting configuration for C++ code. (see #22725)

This PR includes C++ code change. To prevent issues with merging your code, here's what you'll need to do:

Merge the latest changes from upstream/master branch into your branch.

git pull upstream master
git merge upstream/master

Resolve merge conflicts (if necessary).

After running these steps, you'll have the updated C++ formatting configuration.

Format changed files.

scripts/format.sh

Commit your changes.

git add --all
git commit -m "Format C++ code"

stale · 2022-04-16T15:57:06Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

Cache call site

676bfa0

mwtian force-pushed the ref-count branch from 847cdd0 to 676bfa0 Compare August 17, 2021 07:13

mwtian added 2 commits August 17, 2021 07:18

fix

48255fc

benchmark

8853bdc

This was referenced Aug 23, 2021

[Core] Slow Embedded Object Ref Counting #17803

Closed

[Core][ObjectRef] Change default to not record call stack during ObjectRef creation #18078

Merged

stale bot added the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Apr 16, 2022

mwtian closed this Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Speed up deserialization of object refs #17882

[WIP] Speed up deserialization of object refs #17882

mwtian commented Aug 16, 2021 •

edited

Loading

bveeramani commented Jan 30, 2022

kfstorm commented Mar 13, 2022

stale bot commented Apr 16, 2022

[WIP] Speed up deserialization of object refs #17882

[WIP] Speed up deserialization of object refs #17882

Conversation

mwtian commented Aug 16, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

bveeramani commented Jan 30, 2022

‼️ ACTION REQUIRED ‼️

kfstorm commented Mar 13, 2022

‼️ ACTION REQUIRED ‼️

stale bot commented Apr 16, 2022

mwtian commented Aug 16, 2021 •

edited

Loading