Add time based partitioning to store component #957

claytono · 2019-03-21T14:15:41Z

Changes

Add --min-time and --max-time command-line flags to store component. This will cause the store component to only load blocks whose start time lies within the time given. These two can be given independently and the defaults are effectively "forever ago" and "forever in the future" respectively.

Verification

I've tested this against a copy of one of our production buckets and I've updated the e2e tests to test this functionality.

cc: #814

bwplotka · 2019-03-21T15:12:23Z

CC @povilasv PTAL

bwplotka

Awesome, will do more detailed review soon!

bwplotka · 2019-03-21T15:13:41Z

cmd/thanos/flags.go

@@ -145,6 +145,31 @@ func modelDuration(flags *kingpin.FlagClause) *model.Duration {
 	return value
 }

+type flagTime struct {


so I belive this is quite fixed... We essentially need duration right? Like now-3months - now-2h style.

Yeah I agree I need relative time also, would essentially replace the functionality I got here #930

Yes, that would be amazing!

I think both would be useful. We're intending to use this for horizontal scale out, and we want the ability to specify exact time ranges in order to size the blocks served to the capacity of the thanos store host.

👍 I think both are useful

👍

One question: I don't get why someone would need specific time. I think it does not make sense. What's the use case?

We're intending to use this for horizontal scale out, and we want the ability to specify exact time ranges in order to size the blocks served to the capacity of the thanos store host.

@claytono hmm are you sure you want fixed time ranges for that? Why?

We intend to have a handful of Thanos store nodes, each serving a portion of a shared bucket, with one of them having an open ended time range for all new metrics. For now, we intend to have an outside process periodically do analysis of the bucket and generate time ranges for each thanos store process based on index size for each block. We want to provision these nodes such that they're all fairly full from a memory standpoint, but that we're not over-provisioning. For us, the major expense of running Thanos is the memory on compute instances, and the S3 storage is nearly free in comparison.

With metric ingest rates changing over time (new apps, seasonality, etc) and the activity of the compactor, I think partitioning the bucket time ranges with relative times is going to be error prone and/or lead to inefficient usage of the hardware.

It's kind of odd from my perspective, but if you find this useful, sure (: happy to accept that.

It's what we've come up with for horizontal scaling of the Thanos store nodes. I'd love to hear how other people are managing scaling out.

For us, the major expense of running Thanos is the memory on compute instances

Seems, you are trying to solve separate problem with absolute time ranges.

I'd like to have relative time as well.

povilasv · 2019-04-25T03:56:30Z

FYI I've continued work on different PR #1077

claytono added 5 commits March 21, 2019 10:12

Add time filter flags for store component

cfaba07

Fixup tests for mintime/maxtime

6135bd3

Add timerange test to bucket e2e test

a40baed

Fix unchecked error

96dbc20

Update for min-time and max-time flags

856697d

claytono force-pushed the time-filters branch from 6919cb5 to 856697d Compare March 21, 2019 14:43

claytono marked this pull request as ready for review March 21, 2019 15:09

bwplotka requested review from GiedriusS and domgreen March 21, 2019 15:12

bwplotka added feature request/improvement component: store labels Mar 21, 2019

bwplotka reviewed Mar 21, 2019

View reviewed changes

povilasv mentioned this pull request Mar 22, 2019

WIP store: Add --skip-window functionality #930

Closed

povilasv mentioned this pull request Mar 29, 2019

[feature request] store: distributed queries against objstore backends #992

Closed

povilasv closed this Apr 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add time based partitioning to store component #957

Add time based partitioning to store component #957

claytono commented Mar 21, 2019 •

edited

Loading

bwplotka commented Mar 21, 2019

bwplotka left a comment

bwplotka Mar 21, 2019

povilasv Mar 21, 2019

GiedriusS Mar 21, 2019

claytono Mar 21, 2019

povilasv Mar 21, 2019

bwplotka Mar 22, 2019

claytono Mar 22, 2019

bwplotka Mar 22, 2019

claytono Mar 22, 2019

xjewer Mar 27, 2019 •

edited

Loading

povilasv commented Apr 25, 2019

Add time based partitioning to store component #957

Add time based partitioning to store component #957

Conversation

claytono commented Mar 21, 2019 • edited Loading

Changes

Verification

bwplotka commented Mar 21, 2019

bwplotka left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xjewer Mar 27, 2019 • edited Loading

Choose a reason for hiding this comment

povilasv commented Apr 25, 2019

claytono commented Mar 21, 2019 •

edited

Loading

xjewer Mar 27, 2019 •

edited

Loading