[consensus] Update proposer metrics #19655

arun-koshy · 2024-10-02T06:38:20Z

Description

Adding a few metrics that will help with the smart ancestor selection investigations

Set leader timestamp for the parent round of the current threshold clock round in threshold clock. This will allow for us to get better block_proposal_leader_wait_ms metric values
Update block_proposal_leader_wait_count whenever we hit the case where leaders don't exist during proposal
Add metric for the interval between propsals.

Test plan

Release notes

Check each box that your changes affect. If none of the boxes relate to your changes, release notes aren't required.

For each box you select, include information after the relevant heading that describes the impact of your changes that a user might notice and any actions they must take to implement updates.

vercel · 2024-10-02T06:38:24Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

3 Skipped Deployments

Name	Status	Updated (UTC)
multisig-toolkit	⬜️ Ignored (Inspect)	Oct 2, 2024 6:38am
sui-kiosk	⬜️ Ignored (Inspect)	Oct 2, 2024 6:38am
sui-typescript-docs	⬜️ Ignored (Inspect)	Oct 2, 2024 6:38am

arun-koshy · 2024-10-02T18:01:06Z

All of the changes to the metrics can be seen reflected on the left side of the graphs and the right side is "main". Let me know what you think.

We can see the rate at which we have to call back in to try new block because leaders were missing.
We can see the the average wait time for a leader AFTER a quorum has been reached which is around 3ms. With this we will have the quorum receive latency + leader wait time separated to show us which is taking most of the time.
And this is what block proposal interval will look like.

mwtian · 2024-10-02T18:25:02Z

consensus/core/src/core.rs

-            .add_blocks(accepted_blocks.iter().map(|b| b.reference()).collect())
+        // Get max round of accepted blocks. This will be equal to the threshold
+        // clock round, either by advancing the threshold clock round by being
+        // greater than current clock round or by equaling the current clock round.


Is this the case? Blocks older than current threshold clock round can get accepted as well.

I only added the case in the comment for greater and equal but blocks less than the clock round are essentially ignored by threshold clock

mwtian · 2024-10-02T18:31:23Z

consensus/core/src/core.rs

+                self.context
+                    .metrics
+                    .node_metrics
+                    .block_proposal_leader_wait_count


I think we should use a separate metric for counting the number of times leader is not found. block_proposal_leader_wait_count is tied to block_proposal_leader_wait_ms, so when the average wait is ~250ms, we know the leader is missing.

I think the confusion for me with these metrics is that it doesn't just include leader wait time, it includes the quorum receive wait time which can make this metric a little misleading. Separating them brings more clarity. Though I guess we could always subtract this metric from quorum receive latency.

add proposer metrics

75e9363

arun-koshy requested review from akichidis and mwtian October 2, 2024 06:38

mwtian reviewed Oct 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[consensus] Update proposer metrics #19655

[consensus] Update proposer metrics #19655

arun-koshy commented Oct 2, 2024 •

edited

Loading

vercel bot commented Oct 2, 2024

arun-koshy commented Oct 2, 2024 •

edited

Loading

mwtian Oct 2, 2024

arun-koshy Oct 2, 2024

mwtian Oct 2, 2024

arun-koshy Oct 2, 2024

[consensus] Update proposer metrics #19655

Are you sure you want to change the base?

[consensus] Update proposer metrics #19655

Conversation

arun-koshy commented Oct 2, 2024 • edited Loading

Description

Test plan

Release notes

vercel bot commented Oct 2, 2024

arun-koshy commented Oct 2, 2024 • edited Loading

mwtian Oct 2, 2024

Choose a reason for hiding this comment

arun-koshy Oct 2, 2024

Choose a reason for hiding this comment

mwtian Oct 2, 2024

Choose a reason for hiding this comment

arun-koshy Oct 2, 2024

Choose a reason for hiding this comment

arun-koshy commented Oct 2, 2024 •

edited

Loading

arun-koshy commented Oct 2, 2024 •

edited

Loading