[dbnode] Adaptive WriteBatch allocations #3429

linasm · 2021-04-17T14:40:03Z

What this PR does / why we need it:
In some cases we have to handle write batches that are an order of magnitude smaller than the default expected size (128).
Thus, always preallocating the default size (which results in 49152 bytes allocated) is really wasteful.
This PR attempts to avoid preallocating write batches of the given size, and instead defers allocation to a later time, when the actual batch size is known. Then slices of the needed capacity (plus 20% more) are allocated. Allocating 20% more than needed for the current write batch should help avoid subsequent allocations when the same write batch is taken from the pool and is used for another write batch of slightly bigger size.

Special notes for your reviewer:
The original behaviour is preserved in case InitialBatchSize is defined in the config.

Does this PR introduce a user-facing and/or backwards incompatible change?:
NONE

Does this PR require updating code package or user-facing documentation?:
NONE

codecov · 2021-04-17T14:56:28Z

Codecov Report

Merging #3429 (ba0140e) into master (fb48ee1) will increase coverage by 0.2%.
The diff coverage is 96.6%.

@@            Coverage Diff            @@
##           master    #3429     +/-   ##
=========================================
+ Coverage    72.1%    72.3%   +0.2%     
=========================================
  Files        1100     1100             
  Lines      103565   102612    -953     
=========================================
- Hits        74734    74268    -466     
+ Misses      23664    23234    -430     
+ Partials     5167     5110     -57

Flag	Coverage Δ
aggregator	`76.9% <ø> (ø)`
cluster	`84.9% <ø> (ø)`
collector	`84.3% <ø> (ø)`
dbnode	`78.9% <96.6%> (+0.6%)`	⬆️
m3em	`74.4% <ø> (ø)`
m3ninx	`73.6% <ø> (+<0.1%)`	⬆️
metrics	`19.7% <ø> (ø)`
msg	`74.4% <ø> (-0.1%)`	⬇️
query	`66.9% <ø> (ø)`
x	`80.3% <ø> (-0.2%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fb48ee1...ba0140e. Read the comment docs.

linasm · 2021-04-21T06:08:06Z

src/dbnode/ts/writes/write_batch_pool.go

 	// defaultWritePoolMaxBatchSize is the default maximum size for a writeBatch that the pool
 	// will allow to remain in the pool. Any batches larger than that will be discarded to prevent
 	// excessive memory use forever in the case of an exceptionally large batch write.
-	defaultMaxBatchSize = 100000
+	defaultMaxBatchSize = 10240


Reducing this default as 100000 seems to be crazy lot (it would result in a 39MB batch).

robskillington · 2021-04-21T09:53:09Z

src/dbnode/ts/writes/write_batch.go

 	if batchSize > cap(b.writes) {
-		writes = make([]BatchWrite, 0, batchSize)
+		batchCap := batchSize
+		if cap(b.writes) == 0 {


Why do this if cap of writes is zero?

I thought you’d always want to allocate 1.2x the size you actually need since next request will likely have a little more or a little less.

The reason for this was to preserve the original behaviour when having InitialBatchSize set in config (so that if you have initial batch size of 128 in config, and a batch size of 200 arrives, exactly 200 would be allocated, and not 200*1.2).
I'll add an explicit writeBatch.adaptiveSize flag which will help me handle every case better.

robskillington

LGTM other than minor comment

soundvibe

LGTM with one nit

soundvibe · 2021-04-21T12:39:16Z

src/dbnode/ts/writes/write_batch.go

+		if b.adaptiveSize {
+			batchCap = adaptiveBatchCap
+		}
+		b.writes = make([]BatchWrite, 0, batchCap)


nit: I think it would be very useful to have a gauge for current allocated batch capacity. It could be valuable in cases when, e.g. we receive one huge batch and subsequent batches are much smaller. The memory usage will probably be increased after this even if we are receiving smaller batches so this gauge could help us to identify such cases.

How would this gauge work? I guess we would increment it on line 139, but when would we decrement it?

It could be also a counter for now, I was thinking that maybe in future we will update this to also shrink batch capacity so gauge makes sense from this point of view.

* master: [dbnode] Adaptive WriteBatch allocations (#3429)

[dbnode] Adaptive WriteBatch allocations

1717fc7

linasm added 2 commits April 21, 2021 08:33

Merge branch 'master' into linasm/adaptive-write-batch-allocation

df172e9

Preserve original behaviour when InitialBatchSize is configured

2a26a83

linasm commented Apr 21, 2021

View reviewed changes

linasm changed the title ~~WIP [dbnode] Adaptive WriteBatch allocations~~ [dbnode] Adaptive WriteBatch allocations Apr 21, 2021

linasm marked this pull request as ready for review April 21, 2021 07:37

linasm requested review from robskillington and soundvibe April 21, 2021 09:50

robskillington reviewed Apr 21, 2021

View reviewed changes

robskillington approved these changes Apr 21, 2021

View reviewed changes

Introduce writeBatch.adaptiveSize field

ba0140e

linasm assigned soundvibe Apr 21, 2021

soundvibe reviewed Apr 21, 2021

View reviewed changes

linasm merged commit 4f035bd into master Apr 21, 2021

linasm deleted the linasm/adaptive-write-batch-allocation branch April 21, 2021 13:47

soundvibe added a commit that referenced this pull request Apr 22, 2021

Merge branch 'master' into linasn/assign-new-shards-fix

003ff06

* master: [dbnode] Adaptive WriteBatch allocations (#3429)

linasm mentioned this pull request Jun 6, 2021

[dbnode] Adaptive writeBatchPooledReq allocations #3544

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dbnode] Adaptive WriteBatch allocations #3429

[dbnode] Adaptive WriteBatch allocations #3429

linasm commented Apr 17, 2021 •

edited

Loading

codecov bot commented Apr 17, 2021 •

edited

Loading

linasm Apr 21, 2021

robskillington Apr 21, 2021

linasm Apr 21, 2021

robskillington left a comment

soundvibe left a comment

soundvibe Apr 21, 2021

linasm Apr 21, 2021

soundvibe Apr 21, 2021

[dbnode] Adaptive WriteBatch allocations #3429

[dbnode] Adaptive WriteBatch allocations #3429

Conversation

linasm commented Apr 17, 2021 • edited Loading

codecov bot commented Apr 17, 2021 • edited Loading

Codecov Report

linasm Apr 21, 2021

Choose a reason for hiding this comment

robskillington Apr 21, 2021

Choose a reason for hiding this comment

linasm Apr 21, 2021

Choose a reason for hiding this comment

robskillington left a comment

Choose a reason for hiding this comment

soundvibe left a comment

Choose a reason for hiding this comment

soundvibe Apr 21, 2021

Choose a reason for hiding this comment

linasm Apr 21, 2021

Choose a reason for hiding this comment

soundvibe Apr 21, 2021

Choose a reason for hiding this comment

linasm commented Apr 17, 2021 •

edited

Loading

codecov bot commented Apr 17, 2021 •

edited

Loading