Salvage mplex in the age of resource management #99

vyzo · 2022-02-26T17:59:12Z

Instead of blindly pushing packets till we run out of memory and die because the resource manager throttled us, we limit in-flight in and out buffers.
As a side effect, we no longer crash and burn when the reader is slow, we just block.

multiplex.go

marten-seemann · 2022-02-28T07:33:30Z

multiplex.go

+
+	return mp.getBuffer(length)
+}
+
 func (mp *Multiplex) getBuffer(length int) ([]byte, error) {
 	if err := mp.memoryManager.ReserveMemory(length, 128); err != nil {


Can we reserve all the memory mplex could possibly consume up front, and get rid of this call?

we could, but thats 16M a pop!!!
16 buffers x 1M max message size.

But you are right, we should.
I guess we can rsvp incrementally to find how much we can wiggle, up to 16 bufs (maybe less?)

ok done; I also cut it to 8 buffers total (4 each direction) so that it's only up to 8M for the reservation.

- memory is reserved up front - number of buffers in flight is limited - we remove that insane behaviour of resetting conns with slow readers; now we just block.

vyzo · 2022-02-28T08:25:45Z

squashed some commits.

Stebalien · 2022-02-28T13:38:31Z

Why are we reserving memory up-front instead of on-demand? That and assuming that every message is at the "max message size" will basically make mplex useless.

Stebalien · 2022-02-28T13:39:18Z

As a side effect, we no longer crash and burn when the reader is slow, we just block.

We didn't crash and burn, we killed the stream so the entire system didn't deadlock. Please revert that!

marten-seemann · 2022-02-28T13:42:32Z

We didn't crash and burn, we killed the stream so the entire system didn't deadlock.

The fact that mplex was randomly resetting streams is what made mplex unusable, as every transfer larger than a few MB would run into this limit.

Stebalien · 2022-02-28T13:44:15Z

The fact that mplex was randomly resetting streams is what made mplex unusable, as every transfer larger than a few MB would run into this limit.

Only if the reader didn't read from the stream fast enough.
I should have read the code, it still does this: https://github.com/libp2p/go-mplex/pull/99/files#diff-1b51c99a200935510d1b391b27ba28918fa265dffdeaab00d4a9b393f0e3e520R449

So I'm now very confused.

marten-seemann · 2022-02-28T13:48:24Z

Sorry for the confusion, we didn't reset the stream, we killed the connection when a memory allocation failed:
https://github.com/libp2p/go-mplex/blob/v0.4.0/multiplex.go#L570-L578

vyzo · 2022-02-28T13:53:47Z

the reservation must be for max message size because we really send that long messages (and must be prepared to receive).

we could theoretically try to precommit memory and track actual usage every time we get a buffer, but it gets complicated and the only reasonable way to deal with the condition and the timeouts/cancellations is to spawn a goroutine every time we block.

vyzo · 2022-02-28T13:57:26Z

note that mplex was effectively unusable with dynamic memory reervations coz we dont know who the buffer belongs to and thus must kill the conn when it fails.

Stebalien · 2022-02-28T14:13:19Z

Sorry for the confusion, we didn't reset the stream, we killed the connection when a memory allocation failed:

Ah, I thought you were referring to what mplex did before. That makes more sense.

the reservation must be for max message size because we really send that long messages (and must be prepared to receive).

I'm still confused. Why can't we call ReserveMemory right before we allocate a buffer instead of keeping it around?

note that mplex was effectively unusable with dynamic memory reervations coz we dont know who the buffer belongs to and thus must kill the conn when it fails.

This cannot be correct. Like, literally cannot.

We know exactly what we know in yamux.
We know how big the message is before we try to read it.
We know how big the write buffer is before we try to send.

vyzo · 2022-02-28T14:35:14Z

I'm still confused. Why can't we call ReserveMemory right before we allocate a buffer instead of keeping it around?

Because it can fail and we have to kill the conn. If your concern is that we are not very effectively using the reserved memory, then we can do more detailed memory accounting at the cost of a goroutine for every blocked allocation. But that just pushes the problem a bit further, it doesn't solve it.

We know how big the message is before we try to read it

yes, and if we can't get memory for it we must die because we don't know what it is.
I think it is strictly better to block until a buffer is available instead of failing to get one.

Stebalien · 2022-02-28T14:40:31Z

Because it can fail and we have to kill the conn.

Can we not block?

If your concern is that we are not very effectively using the reserved memory, then we can do more detailed memory accounting at the cost of a goroutine for every blocked allocation

Why? I assume we'd just block the receive until we have some memory.

That's partially my concern. My main concern is that mplex is going to sit on massive memory allocations without using them.

yes, and if we can't get memory for it we must die because we don't know what it is.
I think it is strictly better to block until a buffer is available instead of failing to get one.

Yes, then why aren't we doing this?

vyzo · 2022-02-28T14:42:43Z

Because it can fail and we have to kill the conn.

Can we not block?

how?

If your concern is that we are not very effectively using the reserved memory, then we can do more detailed memory accounting at the cost of a goroutine for every blocked allocation

Why? I assume we'd just block the receive until we have some memory.

That's partially my concern. My main concern is that mplex is going to sit on massive memory allocations without using them.

If that's your concern, yes, I can fix that; it's just more complexity for marginal benefit imo.

yes, and if we can't get memory for it we must die because we don't know what it is.
I think it is strictly better to block until a buffer is available instead of failing to get one.

Yes, then why aren't we doing this?

That's what we do, we block until we can get a buffer!

Stebalien · 2022-02-28T15:04:22Z

Can we not block?

how?

It looks like we need a way to block on resource allocation. This isn't the first time, and won't be the last.

If that's your concern, yes, I can fix that; it's just more complexity for marginal benefit imo.

Reserving multiple megabytes per connection seems like a pretty big problem, right?

That's what we do, we block until we can get a buffer!

No, we're reserving multiple megabytes of memory up-front. That's my concern.

BigLep · 2022-03-01T21:54:04Z

@vyzo : is there any followup work that needs to happen? I'm asking because saw comments/discussion after this was merged.

vyzo · 2022-03-02T05:47:23Z

yes, we decided to drastically reduce the memory precommit.

vyzo requested review from Stebalien and marten-seemann February 26, 2022 17:59

vyzo force-pushed the fix/salvage branch from d4cf0b4 to 9145b19 Compare February 26, 2022 18:08

vyzo mentioned this pull request Feb 27, 2022

Intermittent connection drops with Edgevpn 0.10.0/libp2p 0.18.0-rc5 leaves disconnected peers mudler/edgevpn#12

Closed

marten-seemann reviewed Feb 28, 2022

View reviewed changes

vyzo added 2 commits February 28, 2022 10:24

fix mplex behaviour in terms of resource management

7461f15

- memory is reserved up front - number of buffers in flight is limited - we remove that insane behaviour of resetting conns with slow readers; now we just block.

fix tests

ab1db53

vyzo force-pushed the fix/salvage branch from 239b975 to ab1db53 Compare February 28, 2022 08:25

marten-seemann approved these changes Feb 28, 2022

View reviewed changes

marten-seemann merged commit a03888c into master Feb 28, 2022

vyzo deleted the fix/salvage branch February 28, 2022 14:35

vyzo mentioned this pull request Feb 28, 2022

adjust dataIn buffer size for the actual allotment of buffers #101

Closed

BigLep mentioned this pull request Mar 1, 2022

go-libp2p v0.18.0 libp2p/go-libp2p#1267

Closed

69 tasks

vyzo mentioned this pull request Mar 2, 2022

Mplex salvage operations, part II #102

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Salvage mplex in the age of resource management #99

Salvage mplex in the age of resource management #99

vyzo commented Feb 26, 2022

marten-seemann Feb 28, 2022

vyzo Feb 28, 2022

vyzo Feb 28, 2022

vyzo commented Feb 28, 2022

Stebalien commented Feb 28, 2022

Stebalien commented Feb 28, 2022

marten-seemann commented Feb 28, 2022

Stebalien commented Feb 28, 2022

marten-seemann commented Feb 28, 2022

vyzo commented Feb 28, 2022 •

edited

Loading

vyzo commented Feb 28, 2022 •

edited

Loading

Stebalien commented Feb 28, 2022

vyzo commented Feb 28, 2022

Stebalien commented Feb 28, 2022

vyzo commented Feb 28, 2022

Stebalien commented Feb 28, 2022

BigLep commented Mar 1, 2022

vyzo commented Mar 2, 2022

Salvage mplex in the age of resource management #99

Salvage mplex in the age of resource management #99

Conversation

vyzo commented Feb 26, 2022

marten-seemann Feb 28, 2022

Choose a reason for hiding this comment

vyzo Feb 28, 2022

Choose a reason for hiding this comment

vyzo Feb 28, 2022

Choose a reason for hiding this comment

vyzo commented Feb 28, 2022

Stebalien commented Feb 28, 2022

Stebalien commented Feb 28, 2022

marten-seemann commented Feb 28, 2022

Stebalien commented Feb 28, 2022

marten-seemann commented Feb 28, 2022

vyzo commented Feb 28, 2022 • edited Loading

vyzo commented Feb 28, 2022 • edited Loading

Stebalien commented Feb 28, 2022

vyzo commented Feb 28, 2022

Stebalien commented Feb 28, 2022

vyzo commented Feb 28, 2022

Stebalien commented Feb 28, 2022

BigLep commented Mar 1, 2022

vyzo commented Mar 2, 2022

vyzo commented Feb 28, 2022 •

edited

Loading

vyzo commented Feb 28, 2022 •

edited

Loading