Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding prometheus dashboard support #1

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

timkpaine
Copy link
Member

@timkpaine timkpaine commented Apr 15, 2024

Adding an option to periodically scrape the ray metrics actor, and publish to appropriate perspective tables.

I tested this mostly low scale (2-5 second scraping logic, running for a few hours, adding unit test, that kind of thing). It's possible we want this to be more of an "experimental" feature, and/or restrict the tables and metrics we actually care to scrape/publish, but I think this is cool enough for now that we should publish this as an option.

leaving for a few hours, my local cluster cpu/memory was largely the same throughout, and looked like this:
image
which also seems to reflect closely the default option for our cluster, which looks like this:

image

(Note this is all running on my local, head node)

With minimal effort, we suddenly get access to all these (and more!) perspective metrics:

image

and the metrics are pretty expansive,

image

@timkpaine timkpaine force-pushed the tbg/prometheus branch 2 times, most recently from 8dc5377 to 67493ec Compare April 15, 2024 15:47
Copy link

github-actions bot commented Apr 15, 2024

Test Results

3 tests  +1   3 ✅ +1   1m 23s ⏱️ +38s
1 suites ±0   0 💤 ±0 
1 files   ±0   0 ❌ ±0 

Results for commit ad17147. ± Comparison against base commit f689ff8.

♻️ This comment has been updated with latest results.

@gauglertodd gauglertodd linked an issue May 6, 2024 that may be closed by this pull request
@timkpaine timkpaine added the type: feature Feature requests label Jun 11, 2024
@gauglertodd gauglertodd force-pushed the tbg/prometheus branch 7 times, most recently from 46f55f9 to 1abc30b Compare September 9, 2024 16:26
Co-authored-by: Todd Gaugler <[email protected]>
Signed-off-by: Tim Paine <[email protected]>

Adding prometheus dashboard support

Co-authored-by: Todd Gaugler <[email protected]>
Signed-off-by: Tim Paine <[email protected]>

.

.

...

...

...

...

...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: feature Feature requests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add Support for Prometheus Metrics
1 participant