Add pint check to disallow `topk` or `bottomk` in recording rules #820

wbollock · 2023-12-15T21:46:08Z

I believe a new check to warn users about the use of topk or bottomk in recording rules would be useful, as those rules will typically churn quite often and create many new time series.

This is fairly opinionated and could vary a lot of on the nature of the metric being aggregated so interested in discussion and definitely think it fits as a non-default rule

The text was updated successfully, but these errors were encountered:

wbollock · 2024-04-29T20:07:55Z

cc: @prymitive let me know what you think. happy to contribute this

This adds a new check to warn against the use of `topk` or `bottomk` in recording rules. This is an anti-pattern as these operators lead to high churn as the time series the recording rule generates will change frequently as the conditions for topk/bottomk adjust. It is enabled by default with a warning severity. It will only fire for recording rules, not alerting rules. Resolves cloudflare#820

prymitive · 2024-06-04T09:46:34Z

As mentioned in #985 (comment) I don't believe that just throwing warnings every time someone uses topk is a valid approach.
Churn in recording rules might be undesired but there's nothing fundamentally wrong with it.
With alerting rules it's different, I agree that if someone has a rule like:

- alert: ...
  expr: topk(10, foo)

then it is likely to cause flapping alerts, since topk() might return different time series on each evaluation. But one can stabilise the results by stripping churning labels, for example by stripping all labels and reporting min or max value: min(topk(10, foo)).
So this would need a bit more advanced logic to find only queries that "leak" labels from topk() into final results with no constrains and don't use for: or keep_firing_for: to avoid churn.

Fixes #820.

prymitive · 2024-10-22T12:27:10Z

I need a function that gives me the source of labels in a query and this check is a good use case for testing it - so expect it to be added in the next release.

Fixes #820.

wbollock mentioned this issue May 24, 2024

feat: topk check #985

Closed

prymitive added a commit that referenced this issue Oct 22, 2024

Warn about topk, bottomk and other sampling functions

e45f978

Fixes #820.

prymitive mentioned this issue Oct 22, 2024

Warn about topk, bottomk and other sampling functions #1157

Merged

prymitive added a commit that referenced this issue Oct 22, 2024

Warn about topk, bottomk and other sampling functions

b4490ca

Fixes #820.

prymitive closed this as completed in #1157 Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pint check to disallow `topk` or `bottomk` in recording rules #820

Add pint check to disallow `topk` or `bottomk` in recording rules #820

wbollock commented Dec 15, 2023

wbollock commented Apr 29, 2024

prymitive commented Jun 4, 2024 •

edited

Loading

prymitive commented Oct 22, 2024

Add pint check to disallow topk or bottomk in recording rules #820

Add pint check to disallow topk or bottomk in recording rules #820

Comments

wbollock commented Dec 15, 2023

wbollock commented Apr 29, 2024

prymitive commented Jun 4, 2024 • edited Loading

prymitive commented Oct 22, 2024

Add pint check to disallow `topk` or `bottomk` in recording rules #820

Add pint check to disallow `topk` or `bottomk` in recording rules #820

prymitive commented Jun 4, 2024 •

edited

Loading