Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tikvclient: add metrics for gRPC connection transient failure (#12084) #12093

Merged
merged 1 commit into from
Sep 9, 2019
Merged

tikvclient: add metrics for gRPC connection transient failure (#12084) #12093

merged 1 commit into from
Sep 9, 2019

Conversation

lonng
Copy link
Contributor

@lonng lonng commented Sep 9, 2019

Signed-off-by: Lonng [email protected]

Cherry-pick #12084 for release-3.0


What problem does this PR solve?

TiDB access to TiKV through gRPC requests. If the underlying socket is disconnected, gRPC will try to reconnect to the underlying socket, which may cause the request delay to jitter. We need to way to monitor the low-level socket state change.

What is changed and how it works?

This PR adds a metric to monitor the gRPC connection state, the metric will record the connection state before sending the request to TiKV. We can diagnose the delay jitter by rate(tidb_grpc_connection_state) after this PR merged.

Check List

Tests

  • No code
  • Manual test
    image

Related changes

  • Need to cherry-pick to the release branch

Release note

  • Write release note for bug-fix or new feature.

Copy link
Contributor

@crazycs520 crazycs520 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@winkyao winkyao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@winkyao winkyao added the status/can-merge Indicates a PR has been approved by a committer. label Sep 9, 2019
@sre-bot
Copy link
Contributor

sre-bot commented Sep 9, 2019

Your auto merge job has been accepted, waiting for #12092, #12038

@sre-bot
Copy link
Contributor

sre-bot commented Sep 9, 2019

/run-all-tests

@sre-bot sre-bot merged commit eb62dae into pingcap:release-3.0 Sep 9, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status/can-merge Indicates a PR has been approved by a committer.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants