-
Notifications
You must be signed in to change notification settings - Fork 11.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[consensus] Update proposer metrics #19655
base: main
Are you sure you want to change the base?
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎ |
.add_blocks(accepted_blocks.iter().map(|b| b.reference()).collect()) | ||
// Get max round of accepted blocks. This will be equal to the threshold | ||
// clock round, either by advancing the threshold clock round by being | ||
// greater than current clock round or by equaling the current clock round. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this the case? Blocks older than current threshold clock round can get accepted as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I only added the case in the comment for greater and equal but blocks less than the clock round are essentially ignored by threshold clock
self.context | ||
.metrics | ||
.node_metrics | ||
.block_proposal_leader_wait_count |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should use a separate metric for counting the number of times leader is not found. block_proposal_leader_wait_count
is tied to block_proposal_leader_wait_ms
, so when the average wait is ~250ms, we know the leader is missing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the confusion for me with these metrics is that it doesn't just include leader wait time, it includes the quorum receive wait time which can make this metric a little misleading. Separating them brings more clarity. Though I guess we could always subtract this metric from quorum receive latency.
Description
Adding a few metrics that will help with the smart ancestor selection investigations
block_proposal_leader_wait_ms
metric valuesblock_proposal_leader_wait_count
whenever we hit the case where leaders don't exist during proposalTest plan
private-testnet
Release notes
Check each box that your changes affect. If none of the boxes relate to your changes, release notes aren't required.
For each box you select, include information after the relevant heading that describes the impact of your changes that a user might notice and any actions they must take to implement updates.