Respect message list limits when creating messages to send #218

ajsutton · 2022-02-01T00:21:55Z

Somewhat naive approach to ensuring created messages are within the message list limits. Before adding each message part to the new message all list limits are checked. If limits would be exceeded the message part is re-added to the pending pool.

Some concerns:

Is it too expensive to perform these checks for every part we add?
Is it possible to deduplicate the checks in validateMessageListLimits (single message) and validateMergedMessageListLimits (two messages)?
Is there a better approach to handling parts that don't fit than just adding them back to the queue?
Will something actually trigger sending the remaining parts in a later message or do they get stuck?
How do we test this properly?

Nashatyrev · 2022-02-01T14:06:31Z

Some background from slack:
@ajsutton wrote:

I’ve been investigating a problem with gossip messages not making it through in a custom network for a while now.
Turns out that at startup Teku is activating 3 different forks (phase0, altair and bellatrix) because they’re activating within 2 epochs. Gossip topics are fork specific so teku is subscribing to 3 times as many topics as it would normally follow all at once at startup. Since it’s told to subscribe to all subnets that’s 64 individual attestation subnets plus the blocks, aggregates etc subnets * 3 forks. In total we’re subscribing to 217 topics.
To avoid DOS attacks, Teku limits the number of topics you can ask to subscribe to in a single message and the total number of topics a peer can be subscribed to at once. We set them to absurdly high values that were more than twice what you’d need to be subscribed to... 100 topics per subscription request and a total of 200 topics that can be subscribed to at once. Because the limits were absurd we didn’t actually get teku to split up requests so we wind up ignoring the only subscription message that’s sent on both sides and hence the two teku instances aren’t actually subscribed to any gossip at all.
It also violates the maxSubscriptions and maxGraft settings for libp2p gossip (both set to 200). libp2p probably should be splitting the messages it sends to stay under those limits automatically but isn’t.

@Nashatyrev wrote:

Did recently something slightly similar with mplex frames here: #205

Nashatyrev · 2022-02-01T14:13:21Z

Basically the approach looks good.
Yep, I also feel the intention to get rid the duplicating validate method. I'll check if the clone() and then validate 1 message works fast enough.
I would optimistically build the whole message and then validate. If validation fails then handle it with the proposed method. Looks like too large messages are exceptional cases mostly
I could handle testing as well

Nashatyrev · 2022-02-01T14:25:52Z

BTW, does it make sense to simply increase subscription limits as soon as we hit such case?

Nashatyrev · 2022-02-01T15:55:59Z

What do you think about this variant? 965636b

Optimistically builds and validates the whole message initially (expected to be the vast majority of cases, so probably performance issues could be neglected)
Use the single validate method
Stops filling a single message once any list hit the limit. This looks safer since if we try to fill all lists, the following potential invalid case may happen: the topic subscription was not included in the first message (due to limit) while publication for this topic was
Sending all messages when split occurs. There is potential message rate limit issue, but I believe it is far from happening with the current setup
Added initial test. Going to add more when agree on approach

ajsutton · 2022-02-01T23:57:09Z

Yeah I really like that approach. You're right that it's quite unusual to exceed these message limits (only case I know of is when three forks are scheduled for subsequent epochs which only happens in devnets).

In terms of increasing the subscription limits, I'm assuming at the libp2p level the current limits must be somewhat agreed between clients though not really sure of that. At the Teku level yeah I think we can just increase the limits to make things fit without opening up any real risks - there are still a limited number of "relevant" topics that we allow subscriptions to and it won't require that big an increase on the current limits.

… submitting all subscriptions

Nashatyrev · 2022-02-02T10:17:02Z

Fixed an issue here 9d39652. When new peer connected onPeerActive didn't use the collectPeerMessages() method
Also added some more tests

Nashatyrev

LGTM

ajsutton

Thanks Anton, looks good.

Respect message list limits when creating messages to send

f5cf00b

ajsutton requested a review from Nashatyrev February 1, 2022 00:31

Refactor Pubsub RPC message split. Add sanity test

965636b

Nashatyrev added 3 commits February 2, 2022 13:12

Fix: when new peer connects we should also use message splitting when…

9d39652

… submitting all subscriptions

Add more tests

ced26a7

Format

d46693b

Nashatyrev marked this pull request as ready for review February 2, 2022 10:17

Nashatyrev approved these changes Feb 2, 2022

View reviewed changes

ajsutton commented Feb 2, 2022

View reviewed changes

ajsutton merged commit 6a8de4c into libp2p:develop Feb 2, 2022

ajsutton deleted the limit-list-lengths branch February 2, 2022 22:26

Nashatyrev mentioned this pull request Jul 13, 2022

Optimize GossipRouter.mergeMessageParts() #245

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Respect message list limits when creating messages to send #218

Respect message list limits when creating messages to send #218

ajsutton commented Feb 1, 2022

Nashatyrev commented Feb 1, 2022

Nashatyrev commented Feb 1, 2022

Nashatyrev commented Feb 1, 2022 •

edited

Loading

Nashatyrev commented Feb 1, 2022

ajsutton commented Feb 1, 2022

Nashatyrev commented Feb 2, 2022

Nashatyrev left a comment

ajsutton left a comment

Respect message list limits when creating messages to send #218

Respect message list limits when creating messages to send #218

Conversation

ajsutton commented Feb 1, 2022

Nashatyrev commented Feb 1, 2022

Nashatyrev commented Feb 1, 2022

Nashatyrev commented Feb 1, 2022 • edited Loading

Nashatyrev commented Feb 1, 2022

ajsutton commented Feb 1, 2022

Nashatyrev commented Feb 2, 2022

Nashatyrev left a comment

Choose a reason for hiding this comment

ajsutton left a comment

Choose a reason for hiding this comment

Nashatyrev commented Feb 1, 2022 •

edited

Loading