-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Errant GTID Counts metric in VTOrc #16829
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Manan Gupta <[email protected]>
Signed-off-by: Manan Gupta <[email protected]>
Signed-off-by: Manan Gupta <[email protected]>
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
Tests
Documentation
New flags
If a workflow is added or modified:
Backward compatibility
|
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #16829 +/- ##
==========================================
+ Coverage 69.51% 69.53% +0.02%
==========================================
Files 1569 1569
Lines 202517 202531 +14
==========================================
+ Hits 140780 140833 +53
+ Misses 61737 61698 -39 ☔ View full report in Codecov by Sentry. |
@@ -137,3 +138,6 @@ Currently many of the configuration options for VReplication Workflows are vttab | |||
requires restarts of vttablets. We now allow these to be overridden while creating a workflow or dynamically once | |||
the workflow is in progress. See https://github.com/vitessio/vitess/pull/16583 for details. | |||
|
|||
### <a id="errant-gtid-metric"/>Errant GTIDs Count Metric | |||
A new metric called `ErrantGTIDCounts` has been added to the `VTOrc` component. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A count[er] stat is an ever increasing metric that indicates the number of times X has occurred. So I would assume this is the number of times it's had an errant GTID. I would suggest calling this ~ CurrentErrantGTIDCount
(which is a gauge vs a counter).
@@ -61,6 +61,7 @@ var forgetAliases *cache.Cache | |||
var ( | |||
readTopologyInstanceCounter = stats.NewCounter("InstanceReadTopology", "Number of times an instance was read from the topology") | |||
readInstanceCounter = stats.NewCounter("InstanceRead", "Number of times an instance was read") | |||
errantGTIDCounts = stats.NewGaugesWithSingleLabel("ErrantGTIDCounts", "Number of errant GTIDs in a vttablet", "TabletAlias") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
errantGTIDCounts = stats.NewGaugesWithSingleLabel("ErrantGTIDCounts", "Number of errant GTIDs in a vttablet", "TabletAlias") | |
errantGTIDCounts = stats.NewGaugesWithSingleLabel("ErrantGTIDCounts", "Number of errant GTIDs a vttablet currently has", "TabletAlias") |
Description
As pointed out in #16828, it would be a good idea to also publish the number of errant GTIDs in each tablet as a metric. This PR accomplishes this goal.
The new metric looks like -
Related Issue(s)
Checklist
Deployment Notes