Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore Vector as a log and metric exporter #2018

Open
achimnol opened this issue Apr 12, 2024 · 0 comments
Open

Explore Vector as a log and metric exporter #2018

achimnol opened this issue Apr 12, 2024 · 0 comments
Assignees
Labels
comp:common Related to Common component urgency:2 With time limit, it should be finished within it; otherwise, resolve it when no other chores.
Milestone

Comments

@achimnol
Copy link
Member

achimnol commented Apr 12, 2024

Vector (made by Datadog) is a high-performance observability pipeline tool, which can collect and process events, logs, and metrics. It's written in Rust and some of my friends in "big techs" reported that it works successfully in very large-scale deployments.

https://vector.dev/docs/about/concepts/

Let's inspect how we can integrate this into our platform, particularly in on-premise customer sites. Since there are multiple "vector sink" implementation, we could attach Datadog in the cloud while attaching other open-source dashboards in the on-prem setups.

  • We could provide some "vector source" configuration samples (e.g., capturing the stderr stream of service daemons).
  • We could write additional logging backends in ai.backend.common.logging to feed structured logs to the vector pipeline directly.
  • We could extend our stats subsystem in ai.backend.agent.stats to feed metrics to the vector pipeline. If we write our own prometheus exporter interfaces, we could reuse them for the vector sources.
  • We could extend Add a init/health check for mountpoints #1639 and our event bus to feed health check events to the vector pipeline.
  • We could provide some "vector transform" configuration samples to detect anomalies in the manager and agents.
  • We could update our internal dahsboard project to take the role of "vector sinks" to accept and display the processed observability data.
  • Q: Could we use Graylog as a sink? What about Grafana Loki as the log viewer?
image

(taken from https://vector.dev/docs/setup/deployment/)

@achimnol achimnol added type:feature Add new features comp:common Related to Common component urgency:2 With time limit, it should be finished within it; otherwise, resolve it when no other chores. labels Apr 12, 2024
@achimnol achimnol added this to the 24.09 milestone Apr 12, 2024
@achimnol achimnol removed the type:feature Add new features label Oct 18, 2024
@achimnol achimnol changed the title Support Vector as a log and metric exporter Explore Vector as a log and metric exporter Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
comp:common Related to Common component urgency:2 With time limit, it should be finished within it; otherwise, resolve it when no other chores.
Projects
None yet
Development

No branches or pull requests

4 participants