Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add relation and expose metrics from tensorboard-controller #131

Open
rgildein opened this issue Aug 7, 2024 · 1 comment
Open

Add relation and expose metrics from tensorboard-controller #131

rgildein opened this issue Aug 7, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@rgildein
Copy link
Contributor

rgildein commented Aug 7, 2024

Context

Implement metrics-endpoint for the tensorboard-controller, after #7633 is fixed in upstream and changes are included in release (e.g. 1.10).

We need to add relation using prometheus_scrape interface and grafana_dashboard interface if any dashboard is provided.

What needs to get done

  1. Expose port for metrics as ServicePort
  2. Implement metrics-endpoint endpoint
  3. [Optional] Implement grafana-dashboard endpoint
  4. [Optional] Find and add alert rules for metrics from upstream
  5. [Optional] Find and add Grafana dashboards for metrics from upstream

Definition of Done

  1. tensorboard-controller has metrics-endpoint and grafana-dashboard endpoints
  2. Integration with cos is tested
@rgildein rgildein added the enhancement New feature or request label Aug 7, 2024
Copy link

Thank you for reporting us your feedback!

The internal ticket has been created: https://warthogs.atlassian.net/browse/KF-6107.

This message was autogenerated

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant