v2: selfReporting generates incomplete metric grafana_kubernetes_monitoring_build_info #875

marcomusso · 2024-11-08T10:54:16Z

From a vanilla values file I enabled selfReporting to ensure all metrics checks are green in Grafana Cloud (Home => Infrastructure => Kubernetes => Configuration => Metrics status).

I defined 3 destinations (prometheus, loki and otlp) and deployed.

In the resulting alloy configs only the singleton shows the reporting blocks added by the chart templates (which is probably correct as it's the $chosenCollector even if I would expect the receiver to be higher priority if the "list" is ordered) and specifically generates this metric:

# TYPE grafana_kubernetes_monitoring_build_info gauge                                                        grafana_kubernetes_monitoring_build_info{version="2.0.0-rc.2", namespace="grafana-k8s-monitoring-v2", platform=""} 1

which lacks the cluster label (ie it's not added even in further relabeling blocks) thus failing the check:

Here we can query for that metric and see its labels:

Please note: the cluster label is missing from all grafana_kubernetes.* metrics, is that expected/correct?

As a side note: the selfReporting.scrapeInterval key can be set but it's overridden by the global one so I don't see the point in setting it (and the comment/description in the values file doesn't help understand how to use it).

The text was updated successfully, but these errors were encountered:

petewall · 2024-11-08T14:43:44Z

the scrape interval for self-reporting isn't overridden by the global:

  scrape_interval = {{ .Values.selfReporting.scrapeInterval | default "1h" | quote}}

If you're seeing otherwise, let me know.

The cluster label should be set by the destination, not the data source (self-reporting)... I'll investigate why this isn't showing up here...

marcomusso · 2024-11-08T14:45:15Z

Fact is (what I tried): I set 5m in that scrape interval and yet I got samples each minute that's why I said it was overridden (maybe poor choice of words, I didn't investigate too much but trusted the comment in the value file).

petewall · 2024-11-08T15:47:38Z

Cool. I'll try it out

marcomusso · 2024-11-08T15:51:57Z

btw: the cluster label is now present in all grafana_kubernetes_* metrics. Probably before it was not added as "external label" because it was not sent to a prometheus destination (if I read that line correctly)?

PS: it is wrong to assume that an OTLP-only destination should be able to carry/relabel everything correctly?

petewall · 2024-11-08T15:54:05Z

Yeah, I just fixed an issue where it would request prometheus-ecosystem metrics destinations, but then try to use otlp-ecosystem metrics destinations. I fixed it to be consistent (prefer prometheus ecosystem).

I also need to fix the otlp destination to set cluster as well as k8s.cluster.name, which matches the behavior of loki and prometheus destinations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2: selfReporting generates incomplete metric grafana_kubernetes_monitoring_build_info #875

v2: selfReporting generates incomplete metric grafana_kubernetes_monitoring_build_info #875

marcomusso commented Nov 8, 2024

petewall commented Nov 8, 2024

marcomusso commented Nov 8, 2024 •

edited

Loading

petewall commented Nov 8, 2024

marcomusso commented Nov 8, 2024 •

edited

Loading

petewall commented Nov 8, 2024

v2: selfReporting generates incomplete metric grafana_kubernetes_monitoring_build_info #875

v2: selfReporting generates incomplete metric grafana_kubernetes_monitoring_build_info #875

Comments

marcomusso commented Nov 8, 2024

petewall commented Nov 8, 2024

marcomusso commented Nov 8, 2024 • edited Loading

petewall commented Nov 8, 2024

marcomusso commented Nov 8, 2024 • edited Loading

petewall commented Nov 8, 2024

marcomusso commented Nov 8, 2024 •

edited

Loading

marcomusso commented Nov 8, 2024 •

edited

Loading