Update buildkite-metrics to use the agent metrics api #40

sj26 · 2018-04-06T05:24:34Z

This is super rough, but it works.

lox

This is a good reduction in code bulk. Will probably needs some explanation in the README

lox · 2018-04-09T23:08:50Z

Random idea, but what if we didn't pass the agent registration token over the wire and instead used a HMAC'd value with the account_id? Still vulnerable to reply attacks, but at least you can't then use that token to register an agent.

sj26 · 2018-04-10T03:47:46Z

@lox I'd love to look into a better authentication exchange, especially I'd love to homogenize our tokens so that a cluster token can do some limited agent api stuff, and an agent token can do limited rest api stuff, etc. It'd be a little interesting, trying to create tokens which included enough information to do a mac exchange. But I think we're sufficiently protected against replay by TLS for the moment:

https://www.ssllabs.com/ssltest/analyze.html?d=agent.buildkite.com

lox · 2018-04-10T03:59:56Z

I wasn't proposing any sort of key exchange @sj26, just rather than passing in BUILDKITE_AGENT_TOKEN, we'd pass in hmac($BUILDKITE_AGENT_TOKEN, $ACCOUNT_ID) and use that as the secret.

It means we aren't encouraging people to put the BUILDKITE_AGENT_TOKEN in env anywhere.

sj26 · 2018-04-10T05:56:53Z

Right, right — for the agent setup and env, not the request channel. Yeah, that also kinda gels with what I mean. I was proposing opaque authorization tokens which actually pack information, like our graphql IDs, e.g. Authorization: Token base64(pack($TOKEN_UUID+$NONCE+hmac($TOKEN_SECRET,$TOKEN_UUID+$NONCE))), so BK can unpack the uuid to lookup the agent registration token and then validate the hmac — and you could generate that on the buildkite.com side to feed in as an env. I was misinterpreting and thinking the metrics agent would also then perform hmac for the request channel, but that's probably superfluous.

But yeah, step 2.

lox · 2018-04-10T06:00:43Z

Gotcha! That makes sense. Reckon getting it working like it is makes sense for now anyway.

lox · 2018-04-16T05:00:43Z

Tests are looking awesome @sj26

sj26 · 2018-04-16T05:05:54Z

@lox thanks! They're rough, but they work. I think maybe some README updates and this is good for release, with some published caveats that it drops a bunch of metrics from the previous version around builds and historical — this is purely for job/agent workload metrics. Do you know many folks using it beyond the elastic stack?

lox · 2018-04-16T06:04:21Z

Happy to merge with some README changes!

lox · 2018-04-17T04:37:16Z

🚢

Update buildkite-metrics to use the agent metrics api

93aa8d2

sj26 added the wip label Apr 6, 2018

sj26 self-assigned this Apr 6, 2018

sj26 requested a review from lox April 6, 2018 05:24

lox approved these changes Apr 9, 2018

View reviewed changes

sj26 added 2 commits April 10, 2018 16:38

Fix lambda build

1392909

Add basic tests for collector

eb8f5c8

sj26 removed the wip label Apr 17, 2018

Update README for v3

dd5b117

sj26 force-pushed the agent-metrics-api branch from 9b0e4b6 to dd5b117 Compare April 17, 2018 05:57

sj26 merged commit 0b884b1 into master Apr 17, 2018

lox mentioned this pull request Apr 22, 2018

AutoScaling on ScheduledJobsCount has some undesired results #31

Closed

lox mentioned this pull request Apr 29, 2018

Add support for new agent metrics endpoint #15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update buildkite-metrics to use the agent metrics api #40

Update buildkite-metrics to use the agent metrics api #40

sj26 commented Apr 6, 2018

lox left a comment

lox commented Apr 9, 2018

sj26 commented Apr 10, 2018

lox commented Apr 10, 2018

sj26 commented Apr 10, 2018 •

edited

Loading

lox commented Apr 10, 2018

lox commented Apr 16, 2018

sj26 commented Apr 16, 2018

lox commented Apr 16, 2018

lox commented Apr 17, 2018

Update buildkite-metrics to use the agent metrics api #40

Update buildkite-metrics to use the agent metrics api #40

Conversation

sj26 commented Apr 6, 2018

lox left a comment

Choose a reason for hiding this comment

lox commented Apr 9, 2018

sj26 commented Apr 10, 2018

lox commented Apr 10, 2018

sj26 commented Apr 10, 2018 • edited Loading

lox commented Apr 10, 2018

lox commented Apr 16, 2018

sj26 commented Apr 16, 2018

lox commented Apr 16, 2018

lox commented Apr 17, 2018

sj26 commented Apr 10, 2018 •

edited

Loading