prometheus-node-exporter: declare cpu/mem requests and limits #2450

consideRatio · 2023-03-31T18:46:44Z

We've seen the node-exporter pod be evicted because it has no resource requests/limits when what needs to be evicted are other pods to resolve the situation. Let's not allow that to happen!

This relates to Set limits & resource requests on all core pods #90, about setting requests and limits for support chart's pods.

CPU (leap, 30 last days)

Note that the use dropped at some point. That was me upgrading the GKE based k8s cluster in #2337.

Memory (leap, 30 last days)

Why I choose the requests and limits

We don't want this pod to starve other pods of CPU if it bursts, but it seems it can only burst to ~3m CPU. Setting a limit on 5m CPU seemed fine.
We want to ensure the typical operation is guaranteed CPU wise, which seem to be ~0-3m CPU. Setting a request of 5m CPU seemed fine.
We want requests/limits of memory to be the same to avoid getting evicted by memory, and crashing reliably if this is too low instead. I saw that it has peaked at ~21Mi, so I went for 30Mi to be safe, knowing there is a few hundred Mi of memory available still to not cause issues with planned requests/limits in New default machine types and profile list options - sharing nodes is great! #2121.

pnasrat

Minor changes for comments/docs

This generally makes sense though.

helm-charts/support/values.yaml

consideRatio · 2023-03-31T21:32:22Z

Thank you for reviewing @pnasrat! I realized that I had copy pasted incorrect prometheus queries so I drew the wrong conclusions. It turns out this pod is peaking at very low CPU (~3m) and memory (~21Mi), so I've made updates on queries, screenshots, and requests/limits etc.

pnasrat

lgtm

consideRatio · 2023-04-04T13:08:39Z

Thank you @pnasrat for reviewing!!

github-actions · 2023-04-04T13:08:56Z

🎉🎉🎉🎉

Monitor the deployment of the hubs here 👉 https://github.com/2i2c-org/infrastructure/actions/runs/4608120591

consideRatio self-assigned this Mar 31, 2023

pnasrat suggested changes Mar 31, 2023

View reviewed changes

helm-charts/support/values.yaml Outdated Show resolved Hide resolved

helm-charts/support/values.yaml Outdated Show resolved Hide resolved

helm-charts/support/values.yaml Outdated Show resolved Hide resolved

prometheus-node-exporter: declare cpu/mem requests and limits

09c8f12

consideRatio force-pushed the pr/prom-node-exporter-reqs branch from 3839cf7 to 09c8f12 Compare March 31, 2023 21:14

consideRatio requested a review from pnasrat March 31, 2023 21:19

pnasrat approved these changes Apr 4, 2023

View reviewed changes

consideRatio merged commit 113baf9 into 2i2c-org:master Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prometheus-node-exporter: declare cpu/mem requests and limits #2450

prometheus-node-exporter: declare cpu/mem requests and limits #2450

consideRatio commented Mar 31, 2023 •

edited

Loading

pnasrat left a comment

consideRatio commented Mar 31, 2023

pnasrat left a comment

consideRatio commented Apr 4, 2023

github-actions bot commented Apr 4, 2023

prometheus-node-exporter: declare cpu/mem requests and limits #2450

prometheus-node-exporter: declare cpu/mem requests and limits #2450

Conversation

consideRatio commented Mar 31, 2023 • edited Loading

CPU (leap, 30 last days)

Memory (leap, 30 last days)

Why I choose the requests and limits

pnasrat left a comment

Choose a reason for hiding this comment

consideRatio commented Mar 31, 2023

pnasrat left a comment

Choose a reason for hiding this comment

consideRatio commented Apr 4, 2023

github-actions bot commented Apr 4, 2023

consideRatio commented Mar 31, 2023 •

edited

Loading