kvprober: implement a prober that sends queries to KV in order to generate signal on the healthiness of KV client & below #61074
Labels
A-kv-observability
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
O-sre
For issues SRE opened or otherwise cares about tracking.
T-kv
KV Team
https://docs.google.com/document/d/1NqsIgizseyMxUBimpE10m6sSnbebQ25VyJRUwxdmJyM/edit
Is your feature request related to a problem? Please describe.
The problem to solve it provide operators & others better signals of the healthiness of kvclient & below.
Operators & others can alert on these signals. CRL can use them to think about how "big" outages are & to understand where in CRDB stack (is KV broken or not?) & keyspace the impact is.
Describe the solution you'd like
Implement a prober that sends queries to KV in order to generate signal on the healthiness of KV client & below. For a POC that does reads only, see #58124.
Describe alternatives you've considered
More discussion in design doc linked above about alternatives.
By 21.2:
Gets
to the start key of a randomly selected ranges in loop: kvprober: introduce a package for probing the KV layer that does reads only #58124timeutil.NewTimer
per comments from Tobias here: kvprober: rate limit the planner #61275Later:
kvprober
pkg/util/log/eventpb
Epic CC-5241
The text was updated successfully, but these errors were encountered: