[Pubsub] reduce memory usage for channels that do not require total memory cap #23985

mwtian · 2022-04-18T21:01:26Z

Why are these changes needed?

In a1e06f6, memory bound was added for each subscribed entity in the publisher. It adds two extra std::deque per subscribed entity, which turns out to cost a lot more memory when there are a large number of ObjectRefs: #23853 (comment)

This PR avoids the extra memory usage for entities in channels unlikely to grow too large, i.e. all channels except those for logs and error info. Subscribed entity memory usage no longer shows up in the memory profile when there are 1M object refs:

Raw data: profile006.pb.gz

Related issue number

#23604

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

stephanie-wang · 2022-04-19T16:51:36Z

src/ray/pubsub/publisher.h

+
+/// State for an entity that streams published messages to subscribers, with total size
+/// cap.
+class StreamEntityState : public EntityState {


I think "Stream" is not really the right word here since they both send streams to the subscribers. Maybe something like "CappedEntityState" vs "BufferedEntityState"?

Also, could you update the comment to explain what happens when we exceed the cap and to explain when Basic vs Streamed should be used/why we have two different kinds?

Good point. Renamed to CappedEntityState and added comment on their differences.

stephanie-wang

Nice find! It looks good, I just left some comments about naming and documentation.

fishbone · 2022-04-20T00:14:27Z

src/ray/pubsub/publisher.cc

@@ -90,24 +101,32 @@ const absl::flat_hash_map<SubscriberID, SubscriberState *> &EntityState::Subscri
  return subscribers_;
 }

+SubscriptionIndex::SubscriptionIndex(rpc::ChannelType channel_type)


nit: I feel it's a little bit wired to put channel_type into SubscriptionIndex. it seems that it's only used to construct EntityState. We don't need to store it inside channel_type_.

Another thing I feel bad about is that it's an application layer decision, but here it's hardcoded in the infra layer. I'm wondering whether we can make it better?

synced offline. I'm good with this right now since it's still maintainable.

basic entity state

0f9376c

mwtian assigned stephanie-wang Apr 18, 2022

mwtian marked this pull request as ready for review April 18, 2022 22:43

mwtian added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Apr 18, 2022

mwtian assigned fishbone Apr 19, 2022

stephanie-wang reviewed Apr 19, 2022

View reviewed changes

stephanie-wang approved these changes Apr 19, 2022

View reviewed changes

update

48de848

fishbone reviewed Apr 20, 2022

View reviewed changes

fishbone approved these changes Apr 20, 2022

View reviewed changes

fishbone merged commit 34fb092 into ray-project:master Apr 20, 2022

mwtian deleted the pubsub-state branch April 20, 2022 01:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pubsub] reduce memory usage for channels that do not require total memory cap #23985

[Pubsub] reduce memory usage for channels that do not require total memory cap #23985

mwtian commented Apr 18, 2022 •

edited

Loading

stephanie-wang Apr 19, 2022

mwtian Apr 19, 2022

stephanie-wang left a comment

fishbone Apr 20, 2022

fishbone Apr 20, 2022

fishbone Apr 20, 2022

[Pubsub] reduce memory usage for channels that do not require total memory cap #23985

[Pubsub] reduce memory usage for channels that do not require total memory cap #23985

Conversation

mwtian commented Apr 18, 2022 • edited Loading

Why are these changes needed?

Related issue number

Checks

stephanie-wang Apr 19, 2022

Choose a reason for hiding this comment

mwtian Apr 19, 2022

Choose a reason for hiding this comment

stephanie-wang left a comment

Choose a reason for hiding this comment

fishbone Apr 20, 2022

Choose a reason for hiding this comment

fishbone Apr 20, 2022

Choose a reason for hiding this comment

fishbone Apr 20, 2022

Choose a reason for hiding this comment

mwtian commented Apr 18, 2022 •

edited

Loading