[dbnode] Emit aggregate usage metrics #3123

arnikola · 2021-01-26T16:19:15Z

Emits additional aggregate metrics around fields and terms used; also reduces change that made aggregate query doc limit too aggressive

Also makes aggregate docs limit less aggressive

wesleyk

are there any tests we can update to ensure the global limit revert works as expected?

wesleyk · 2021-01-26T16:25:52Z

src/dbnode/storage/index/block_test.go

@@ -1933,6 +1947,12 @@ func TestBlockAggregate(t *testing.T) {
 	spans := mtr.FinishedSpans()
 	require.Len(t, spans, 2)
 	require.Equal(t, tracepoint.BlockAggregate, spans[0].OperationName)
+
+	for _, v := range scope.Snapshot().Counters() {


can use tallytest.AssertCounterValue

👍 nice, did not know about this one

wesleyk · 2021-01-26T16:29:14Z

src/dbnode/storage/index/aggregate_results.go

+func (*noopCounter) Inc(_ int64) {}
+
+func newUsageMetrics(ns ident.ID, iOpts instrument.Options) usageMetrics {
+	if ns == nil {


hmm can we have a catch-all metric tag for these instead?

I was thinking that, but figured that potentially having a bunch of metrics under namespace="undefined" would end up kinda confusing and not add much additional information; can add it if you think it would be worth having though 👍

gotcha, when is ns not set? Mostly curious if we want to account for these or not

Mostly when we clear out or finalize the results, or if incorrectly initializing when taking an AggregateResults out of the pool, also happened a lot in tests so had to have some sensible defaults for it without updating all the tests.

i'd rather not drop metrics. I don't think an undefined namespace is that confusing. it would allow you to sum across the metrics without namespace, which is probably the common case.

…kola/limit-metrics

wesleyk · 2021-01-26T18:24:43Z

src/dbnode/storage/index/aggregate_results.go

+	dedupedFields tally.Counter
+}
+
+func newUsageMetrics(ns ident.ID, iOpts instrument.Options) usageMetrics {


do we need to reset this with every aggregate result? Is the namespace important enough?

codecov · 2021-01-27T14:24:22Z

Codecov Report

Merging #3123 (5c1b8d7) into master (827796c) will decrease coverage by 0.0%.
The diff coverage is 91.8%.

@@            Coverage Diff            @@
##           master    #3123     +/-   ##
=========================================
- Coverage    72.2%    72.2%   -0.1%     
=========================================
  Files        1084     1084             
  Lines      100219   100251     +32     
=========================================
+ Hits        72397    72420     +23     
- Misses      22774    22781      +7     
- Partials     5048     5050      +2

Flag	Coverage Δ
aggregator	`75.8% <ø> (ø)`
cluster	`84.8% <ø> (-0.1%)`	⬇️
collector	`84.3% <ø> (ø)`
dbnode	`78.6% <91.8%> (-0.1%)`	⬇️
m3em	`74.4% <ø> (ø)`
m3ninx	`73.1% <ø> (ø)`
metrics	`20.0% <ø> (ø)`
msg	`74.1% <ø> (-0.2%)`	⬇️
query	`67.2% <ø> (ø)`
x	`80.3% <ø> (+<0.1%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 827796c...4561410. Read the comment docs.

ryanhall07 · 2021-01-27T14:39:17Z

src/dbnode/storage/index/aggregate_results.go

+func (*noopCounter) Inc(_ int64) {}
+
+func newUsageMetrics(ns ident.ID, iOpts instrument.Options) usageMetrics {
+	if ns == nil {


i'd rather not drop metrics. I don't think an undefined namespace is that confusing. it would allow you to sum across the metrics without namespace, which is probably the common case.

ryanhall07 · 2021-01-27T14:41:57Z

src/dbnode/storage/index/aggregate_results.go

+	// will have one field.
+	totalCount := len(batch)
+	for idx := 0; idx < len(batch); idx++ {
+		totalCount += len(batch[idx].Terms)


how is total different than total terms?

Removed this

ryanhall07

i'd personally drop the total, or make term/field a label so you can sum across. not a fan of composite metrics that might skew in the future.

eb5dfac

* master: [dbnode] Add aggregate term limit regression test (#3135) [DOCS] Adding Prometheus steps to quickstart (#3043) [dbnode] Revert AggregateQuery changes (#3133) Fix TestSessionFetchIDs flaky test (#3132) [dbnode] Alter multi-segments builder to order by size before processing (#3128) [dbnode] Emit aggregate usage metrics (#3123) [dbnode] Add Shard.OpenStreamingReader method (#3119) [dtests] Docker tests integration with docker-compose (#3031) [dbnode] Comments / remove unused var (#3124) [query] Handle context.Canceled and map to 499 http status (#3069) [dbnode] Use StreamingReadMetadata for bootstrapping (#2938) [dbnode] Use DefaultTestOptions in test code (#3113) # Conflicts: # src/dbnode/storage/bootstrap/bootstrapper/fs/source.go

* master: [dtest] endpoint to fetch tagged (#3138) Refactor FetchTagged to return an Iterator of results (#3141) [dbnode] Add aggregate term limit regression test (#3135) [DOCS] Adding Prometheus steps to quickstart (#3043) [dbnode] Revert AggregateQuery changes (#3133) Fix TestSessionFetchIDs flaky test (#3132) [dbnode] Alter multi-segments builder to order by size before processing (#3128) [dbnode] Emit aggregate usage metrics (#3123) [dbnode] Add Shard.OpenStreamingReader method (#3119)

arnikola added 2 commits January 26, 2021 11:18

[dbnode] Emit aggregate usage metrics

eccfccf

Also makes aggregate docs limit less aggressive

Merge branch 'master' into arnikola/limit-metrics

5055182

wesleyk reviewed Jan 26, 2021

View reviewed changes

arnikola added 3 commits January 26, 2021 12:21

PR + lint

ce2a4a3

Merge branch 'arnikola/limit-metrics' of github.com:m3db/m3 into arni…

abfc590

…kola/limit-metrics

Delete unused type

56de59b

wesleyk approved these changes Jan 26, 2021

View reviewed changes

wesleyk reviewed Jan 26, 2021

View reviewed changes

arnikola added 6 commits January 26, 2021 14:03

PR response

30a9fba

Lint + response

d8962fa

Merge branch 'master' into arnikola/limit-metrics

f46d2f0

Revert client mock

da2fb32

Lint

5c1b8d7

Merge branch 'master' into arnikola/limit-metrics

9fd9a18

rallen090 approved these changes Jan 27, 2021

View reviewed changes

ryanhall07 reviewed Jan 27, 2021

View reviewed changes

ryanhall07 approved these changes Jan 27, 2021

View reviewed changes

Merge branch 'master' into arnikola/limit-metrics

4561410

wesleyk merged commit eb5dfac into master Jan 27, 2021

wesleyk deleted the arnikola/limit-metrics branch January 27, 2021 19:36

arnikola added a commit that referenced this pull request Jan 28, 2021

Revert "[dbnode] Emit aggregate usage metrics (#3123)"

033ef9c

eb5dfac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dbnode] Emit aggregate usage metrics #3123

[dbnode] Emit aggregate usage metrics #3123

arnikola commented Jan 26, 2021

wesleyk left a comment

wesleyk Jan 26, 2021

arnikola Jan 26, 2021

wesleyk Jan 26, 2021

arnikola Jan 26, 2021

wesleyk Jan 26, 2021

arnikola Jan 26, 2021

ryanhall07 Jan 27, 2021

wesleyk Jan 26, 2021

codecov bot commented Jan 27, 2021 •

edited

Loading

ryanhall07 Jan 27, 2021

ryanhall07 Jan 27, 2021

arnikola Jan 27, 2021

ryanhall07 left a comment

[dbnode] Emit aggregate usage metrics #3123

[dbnode] Emit aggregate usage metrics #3123

Conversation

arnikola commented Jan 26, 2021

wesleyk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jan 27, 2021 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanhall07 left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 27, 2021 •

edited

Loading