Lazy-allocate pusher memory. #397

ryzhyk · 2021-06-25T02:54:43Z

The current implementation of channel pushers preallocates
Message::default_length entries per pusher. For very large graphs this
adds up to a lot of memory being allocated even when the system is idle
with no outstanding messages. This patch changes the allocation policy
to only allocate channel memory when there are messages to send and to
deallocate it when there are no messages, reducing the memory footprint
to 0 in idle state at the cost of some potential slow-down due to a
larger number of allocations.

See #394.

frankmcsherry · 2021-06-25T12:03:54Z

A few quick thoughts:

Lazy allocation seems great.
The approach to de-allocation will de-allocate on every call to Message::push_at, rather than at the end of a stream of messages. The Push trait that gets used everywhere communicates "I have no more data" with a None message, and de-allocating on that seems better. So, once every burst of data, rather than once for every message.
The above approach might mean that this can be implemented by e.g. Exchange and then just be another pact rather than a system-wide change to resource management. If nothing else, it's probably worth tracking down where these buffers get stashed.

ryzhyk · 2021-06-26T03:44:38Z

Thanks for the feedback!

So, once every burst of data, rather than once for every message.

Makes sense! Instead of always deallocating inside push_at, I will modify pusher implementations that maintain internal buffers (Exchange and Tee) to deallocate the buffer on a None message.

The above approach might mean that this can be implemented by e.g. Exchange and then just be another pact rather than a system-wide change to resource management.

Sorry, I'm not sure how to implement this. Are you suggesting that pushers::Exchange should be configurable to optionally deallocate at the end of a burst, and then we create a pact similar to ExchangePact but with this option enabled? If so, how do I get timely to use this new pact when constructing the dataflow?

The current implementation of channel pushers preallocates `Message::default_length` entries per pusher. For very large graphs this adds up to a lot of memory being allocated even when the system is idle with no outstanding messages. This patch changes the allocation policy to only allocate channel memory when there are messages to send and to deallocate it at the end of a burst of messages (signaled by pushing a `None` message), reducing the memory footprint to 0 in idle state at the cost of some potential slow-down due to a larger number of allocations. See TimelyDataflow#394.

frankmcsherry · 2021-06-30T14:18:57Z

Re-reading things, I think the Exchange implementation is missing an opportunity to blank out its vector on lines 44-46, where for single-worker instantiations it just passes the message through (including a None indicating a flush).

ryzhyk · 2021-06-30T14:23:58Z

I thought in this case the buffer would never be allocated in the first place?

frankmcsherry · 2021-06-30T14:26:00Z

Ah good point. That may be true!

ryzhyk · 2021-06-30T18:40:11Z

I confirmed using out internal benchmarks that the fixed up version that only deallocates on None yields memory savings comparable to the version that aggressively deallocated on every push.

@frankmcsherry, do you see this being merged as is or do you feel we should work on making this an optional pact?

frankmcsherry · 2021-07-01T12:57:02Z

I'd need to think a bit before merging it as is, due to the impact on others. If it were an optional pact, it would be easy to accept. On the other hand, I'm not sure that the Tee stuff is as easy to make optional, which is too bad. I do like the idea, and we did something like this in Naiad, but I do want to shake out the performance penalty before landing it.

ryzhyk mentioned this pull request Jun 25, 2021

Lazy memory allocation in channel pushers. #394

Open

ryzhyk force-pushed the lazy_allocate_channels branch from b8aeed0 to d29fcf9 Compare June 28, 2021 06:52

ryzhyk force-pushed the lazy_allocate_channels branch from d29fcf9 to 9f0571f Compare June 28, 2021 07:06

Kixiron mentioned this pull request Jul 12, 2021

Exchange, input, tee: Only allocate buffers as needed #400

Closed

antiguru mentioned this pull request Aug 23, 2021

Exchange: bench, do not preallocate buffers #416

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy-allocate pusher memory. #397

Lazy-allocate pusher memory. #397

ryzhyk commented Jun 25, 2021

frankmcsherry commented Jun 25, 2021

ryzhyk commented Jun 26, 2021 •

edited

Loading

frankmcsherry commented Jun 30, 2021

ryzhyk commented Jun 30, 2021

frankmcsherry commented Jun 30, 2021

ryzhyk commented Jun 30, 2021

frankmcsherry commented Jul 1, 2021

Lazy-allocate pusher memory. #397

Are you sure you want to change the base?

Lazy-allocate pusher memory. #397

Conversation

ryzhyk commented Jun 25, 2021

frankmcsherry commented Jun 25, 2021

ryzhyk commented Jun 26, 2021 • edited Loading

frankmcsherry commented Jun 30, 2021

ryzhyk commented Jun 30, 2021

frankmcsherry commented Jun 30, 2021

ryzhyk commented Jun 30, 2021

frankmcsherry commented Jul 1, 2021

ryzhyk commented Jun 26, 2021 •

edited

Loading