core/state, eth, trie: stabilize memory use, fix memory leak #21491

karalabe · 2020-08-26T10:19:07Z

We've seen for a while that the bootnodes go OOM while fast syncing. Lately this became more and more pronounced, so it was clear that we have some memory issues in state sync. This PR addresses 3 independent issues:

State sync expands the account and storage tries depth first. This ensures that we keep as little of it in memory as possible and flush completed subtries to disk quickly. In the case of the bootnodes however, they are running with 250 full node peers. Even though the trie.Sync is depth first, with so many peers available to retrieve from, the trie got expanded too heavily breadth wise too. This resulted in a notable memory usage.
- The PR fixes this by limiting each state trie depth to a maximum of 16K active fetches. This way at worse we can have 128 (64 account + 64 storage) depths * 16K requests. In practice the tries are saturated only till depth 8-9, so the depths below will be missing altogether. On mainnet currently this limits pending trie retrieval tasks to about 600K, which is a reasonable amount.
State sync previously expanded same-depth trie nodes randomly. This was a deliberate decision so that we scatter the state randomly across the network, so even if a seed node goes down, there's a chance that others pulled enough to reconstruct between each other. This isn't makes state sync a bit non-deterministic.
- This PR changes the undefined ordering to ~lexicographic one, by modifying the trie.Sync priority queue to not only take into account the depth of a node, but also the path prefix in the trie (only first 14 nibbles fit into the int64 priority beside the depth). This will allow us a further optimization where we can detect if a peer we're syncing from doesn't have the state we need yet, so we can throttle requests going to them (and coming back with an empty response).
Every time the pivot moved, we cancelled the currently running state sync and restarted it from scratch with a new head. Unfortunately, we also called a defer sync.Close() on the new object. This resulted in all these partially completed and discarded sync objects to be kept referenced in memory until sync fully completed or failed. This was leaking out memory and leading to crashes,
- The PR fixes this by removing the defer from within the loop and doing just 1 lazy defer at the top where the original sync object is created. If the object is recreated, the loop will close the old one before replacing it, so there's no leak there.

A quirk in the PR is that I needed to modify the LeafCallback to pass the path too, but that was shared by both the syncer as well as the committer. I hacked a nil into the committer, but maybe it would be nice to split the callback to have a different version for SyncLeafCallback and CommitLeafCallback (I don't want to track the path for commit since it's useless burden).

holiman

LGTM!

holiman · 2020-08-26T12:43:59Z

trie/sync.go

+	prio := int64(len(req.path)) << 56 // depth >= 128 will never happen, storage leaves will be included in their parents
+	for i := 0; i < 14 && i < len(req.path); i++ {
+		prio |= int64(15-req.path[i]) << (52 - i*4) // 15-nibble => lexicographic order
+	}
+	s.queue.Push(req.hash, prio)


I wrote a little playground gist to check how this worked: https://play.golang.org/p/QKezpVe3ZX7 . Might be worth taking the gist and making a test out of it, to verify that paths are indeed prioritized correctly?

I'm going to postpone adding this into a followup PR. I want to expose the path and code/trie node separation into the Missing and that will make this test a lot simpler. Otherwise we'd need to mess around with hard coding hashes into the tester, which we could definitely do, but it will be kind of like a black magic test with random values nobody knows where they originate from.

holiman · 2020-08-26T20:11:06Z

This PR is great. Just look at that mem consumption:

And that steady trickle of state

core/state, eth, trie: stabilize memory use, fix memory leak

d8da0b3

karalabe added this to the 1.9.21 milestone Aug 26, 2020

karalabe requested a review from holiman August 26, 2020 10:19

karalabe requested a review from rjl493456442 as a code owner August 26, 2020 10:19

holiman approved these changes Aug 26, 2020

View reviewed changes

karalabe merged commit d97e006 into ethereum:master Aug 27, 2020

holiman mentioned this pull request Sep 10, 2020

Unable to sync freshly installed Geth #15571

Closed

ricardolyn mentioned this pull request Jun 7, 2021

[Upgrade] Go-Ethereum release v1.9.21 Consensys/quorum#1206

Closed

10 tasks

opsquorum mentioned this pull request Jun 7, 2021

[Upgrade] Go-Ethereum release v1.9.21 Consensys/quorum#1207

Closed

10 tasks

This was referenced Jun 7, 2021

[Upgrade] Go-Ethereum release v1.9.21 Consensys/quorum#1208

Closed

[Upgrade] Go-Ethereum release v1.9.21 quorumbot/quorum#1

Closed

[Upgrade] Go-Ethereum release v1.9.21 Consensys/quorum#1211

Merged

This was referenced Jun 21, 2022

stabilize memory use, fix memory leak jeongkyun-oh/klaytn#24

Merged

stabilize memory use, fix memory leak klaytn/klaytn#1451

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core/state, eth, trie: stabilize memory use, fix memory leak #21491

core/state, eth, trie: stabilize memory use, fix memory leak #21491

karalabe commented Aug 26, 2020 •

edited

Loading

holiman left a comment

holiman Aug 26, 2020

karalabe Aug 27, 2020

holiman commented Aug 26, 2020

core/state, eth, trie: stabilize memory use, fix memory leak #21491

core/state, eth, trie: stabilize memory use, fix memory leak #21491

Conversation

karalabe commented Aug 26, 2020 • edited Loading

holiman left a comment

Choose a reason for hiding this comment

holiman Aug 26, 2020

Choose a reason for hiding this comment

karalabe Aug 27, 2020

Choose a reason for hiding this comment

holiman commented Aug 26, 2020

karalabe commented Aug 26, 2020 •

edited

Loading