[consensus] Enable Urkel Tree compaction #669

pinheadmz · 2021-12-09T01:17:02Z

Closes #660

Adds new rpc method compacttree which deletes from disk an enormous amount of historical data in the Urkel Tree that is only necessary in the case that a chain reorganization occurs deeper than ~288 blocks. Note that this condition also applies to blockchain-pruning nodes and in the event of such a deep reorg, a pruned node will fail into an unrecoverable state. Pruning is available as a configuration setting or can be executed while running with rpc pruneblockchain.

What does compacting the Urkel Tree mean?

Since the tree is append-only, when an existing name is updated (anything from a REVEAL to a REGISTER to an UPDATE) the old data still remains on disk. A new tree node is written and some file pointers are shuffled around to determine the new root hash of the tree (which is committed to in a block header).

If a chain reorg crosses back over a tree interval (36 blocks on mainnet) then an old tree root is pulled from the unzipped chain and used to revert the entire Urkel Tree to its old state, which is possible since all the historical data and pointers are still there on disk.

To compact the Urkel Tree means to take a given tree root and write a fresh Urkel Tree (in flat-file format on disk) containing ONLY the data connected directly to the root, then rename that directory to ~/.hsd/tree, wiping out all the historical data that is no longer part of the current state.

Strategy in this PR

Since compacting means the tree can no longer be reverted, it means that a chain reorg only one block deep that crosses a tree interval will permanently annihilate the full node. Therefore, what we must do is:

Use the historical data we still have on disk to rewind the Urkel Tree to an old state (~288 blocks ago / about 7-8 tree intervals ago)
Run the compacting process on that historical tree root, creating an unrecoverable reorg boundary two days in the past
Replay the blockchain back in to the Urkel Tree, all ~288 blocks, including all namestate updates crossing back over those 7-8 tree commitments until the Urkel Tree is back in sync with the chain

End result

Whether or not a full node is pruning the blockchain, executing this RPC will apply the same restriction at the moment it is completed: a 288-block reorg is the end of the game. Of course UNLIKE a pruning node, this compacting process is NOT on-going, so over time the Urkel Tree will bloat back up again (for better or for worse) until the RPC is called again.

Mainnet testing

I created a second branch of this PR with WIP rpc dumpzone applied. I have hsd synced to height 97555 already and started it in an isolated mode:

hsd --no-dns --no-wallet --only=127.0.0.1

Then executed hsd-rpc dumpzone hns-97555-full.zone to get a snapshot of the Urkel Tree state.

Then executed hsd-rpc compacttree to run the pruning process.

The size of the directory ~/.hsd/tree dropped from 6.8 GB down to ~500 MB 🥳 and the process took about 2.5 minutes total 🎉

[I:2021-12-09T00:46:06Z] (net) Removed loader peer (127.0.0.1:12038).
[D:2021-12-09T00:46:06Z] (node-rpc) Handling RPC call: compacttree.
[D:2021-12-09T00:46:06Z] (node-rpc) []
[I:2021-12-09T00:46:06Z] (chain) Compacting Urkel Tree...
...
[I:2021-12-09T00:48:13Z] (chain) Synchronizing Tree with block history...
...
[I:2021-12-09T00:48:32Z] (chain) Synchronized Tree Root: 9d8eefb2af6c7ab640826f245040c693738ad7cd629b86f32ba040b3ad7a5f8c.

Finally, I ran hsd-rpc dumpzone hns-97555-compacted.zone and compared the two output zone files which matched exactly 🚀

TODO:

add tests
consider edge cases like executing too frequently (?)
determine best deployment (should it be automatic?)

coveralls · 2021-12-09T01:21:47Z

Coverage increased (+0.2%) to 66.97% when pulling 74cc006 on pinheadmz:prunetree into a53f877 on handshake-org:master.

pinheadmz · 2021-12-13T17:58:36Z

Maybe include #675

pinheadmz · 2022-01-07T18:46:25Z

I'm marking this as ready for review now by solving the last problem:

What is the best method for deployment?

There are now two ways to do this:

hsd-rpc compacttree can be run at any time while the node is alive but is not recommended because the node is locked for several minutes and this is not great for your peers (who may even timeout if you take too long)
hsd --compact-tree this argument can be added at launch on command line or left in the config file and the node will run the compact process when it launches, before the node connects to the network.

These options give node runners and integrated apps like Bob Wallet the flexibility to compact the tree either automatically or manually, either way with the node operator knowing when it will happen. I think this is a better method then setting some disk size threshold, etc which will run the compactor at unexpected times.

CHANGELOG.md

lib/node/rpc.js

brandondees

LGTM

Didn't read through all the tests extremely carefully but they do seem like good starting coverage and documentation overall.

Might be nice to add a little more log reporting about how well it's working when applied. Might help node operators with health monitoring or to detect efficiency regressions etc.

brandondees · 2022-01-07T23:54:21Z

This looks like a great improvement, and I don't see any issues with the diff as written (take that with a grain of salt due to my unfamiliarity with the overall codebase and code conventions).

Another tangential question came to mind while reviewing: Are tree updates computed incrementally per block or is it an all-at-once tree update process once every interval? Is that something to consider further optimizations on in either direction?

pinheadmz · 2022-01-08T00:47:09Z

Another tangential question came to mind while reviewing: Are tree updates computed incrementally per block or is it an all-at-once tree update process once every interval? Is that something to consider further optimizations on in either direction?

Changes to the Urkel Tree are stored in memory between tree intervals and written to disk every 36 blocks. This is an optimization and described in the white paper. If you shut down your node between tree intervals the data on disk will actually be behind. That's why we call syncTree() on startup to refill the tree data in memory from the block data on disk.

brandondees · 2022-01-08T01:34:10Z

Gotcha, so there's already not much else to be done there unless you wanted to have hyper-fine configuration of what goes into memory vs disk at what times.

pinheadmz · 2022-01-12T16:11:26Z

Worth pointing out that 0ddd228 makes this PR NOT backwards compatible. After running on my own local hsd node and compacting the tree, I then switched back to master branch to work on something else and got Database Inconsistency error. We can sortof somewhat alleviate this by reverting this:

- this.put(layout.s.encode(), this.tree.rootHash());

so we would still write the treeRoot to the DB even though we no longer use it. I think this would alleviate some cases but we should still consider this a major breaking change.

pinheadmz · 2022-03-31T15:33:10Z

TODO:

remove RPC command

pinheadmz · 2022-04-01T17:56:37Z

@nodech I removed the RPC compacttree and cleaned up the tests.

nodech · 2022-04-11T16:09:19Z

General

The first thing I was thinking about is, whether we are still Full Node after
this. Do we want to only support this with Pruned nodes? We could of course
have two options for this pruned blocks and pruned chain. Even if for full node
it is possible to unprune the node by reindexing the chian.

So I would say, initially support this for prunes only and if someone
knows what they are doing they could use --force-tree-compact that can
be used for full nodes as well?

I think, it's better to support --force-tree-compact for full node
compaction, only after when we have ability to reindex the tree.

Atomicity and `layout.s`

The chaindb and tree are separate trees and we need to have them synchronized
logically. Only way to control where we are at in the Tree Hash history is
the inject. Last tree root hash is not useful information by itself. It needs
to come from the chaindb.

Issues layout.s solves comes when we crash during connect/disonnect.
layout.s is the only way we synchronize right now. Note, it's not useful
to use tip block hash, because tree is only committed once in every 36
blocks, unless you go to the last commit height and grab that block.
Let's go through issues that can occur and how layout.s solves.

Let's consider connect failing. What connect does is: first it commits
data to the Tree and then we atomically add all changes to the chaindb
and also write layout.s as new tree root. If for some reason node
crashes (or gets shut down) between Tree.commit and chaindb.save
we just end up with the "extra" data in the tree, but there will be
no inconsistency when node is rerun. Because layout.s will properly
inject previous Tree.hash instead of the last one (that one was
not saved in chaindb because of failure) and try to connect the same
block again. Because writes to the chaindb.leveldb are atomic,
once we have written succesfully, we are assured there's no inconsistency
between tree and the chain. It's also good to note, that this would
not work in chaindb.save -> tree.commit order, because in case of
failure we would not have tree data, even though chaindb would think
it's time to move on (on next rerun).

This problem becomes more exaggerated when you consider reset or reorg.
When reorg happens, chaindb will revert its state to the past and then move
forward to the new chain. If node crashes/gets shut down after it's reverted to
the past, layout.s is the only way to tell tree that we are actualy in the
older tree and latest tree.rootHash is not relevant.

If you consider someone restarting node with --compact-tree after reverting
to the past (on reorg) you can easily see, why should compact tree also depend
ony laoyut.s instead of tree.rootHash.

So to sum up, tree.rootHash is never relevant for atomicity/consistency
between chaindb and tree. We always trust chaindb to provide the tree.rootHash.
chaindb root hash can be behind tree root hash, but NEVER ahead.

Another note: The issue with blockstore was similar in a way, but it would
the node not starting and would fail to sync. This issue on the other hand,
wont get noticed because it wont result to crash. It will just have wrong
information in the tree.

Atomicity with the compaction.

The issue is related to above mentioned chaindb rootHash(layout.s).
We need to only rely on chaindb to give us information about tree situation.

In order to keep compacting code working with layout.s, we need to
move that layout.s pointer to the past FIRST and then compact. In this
case we make sure, if compaction fails, syncTree will just redo the same
thing and in the worst case end up with wasted space. We can't compact first,
because if it succeeds and then chaindb write fails, chaindb will be AHEAD.

syncTree needs to use layout.s instead as well. As mentioned above,
tree hash root should never be used for anything related to the chaindb.
Because of the syncTree nature, it's okay even if compaction failed
and we restart node without compact flag nd chaindb is behind. syncTree
will update chaindb from saveNames until it's caught up.

We can continue above mentioned example to show the issue of not using
layout.s. If chaindb reorg/reset went in the past, but crashed/closed
syncTree will actually be AHEAD and will continue appending to the wrong
tree hash.

To summurize

chaindb root hash can be behind tree root hash, but NEVER ahead.
within compaction we need to move chain root hash(layout.s) behind even
if we fail to compact and never run it again.
syncTree should use chain root hash (layout.s)

Nits

You can move out syncTree from compactTree and have it like this.

  if (!this.options.spv) {
    if (this.options.compactTree)
      await this.compactTree();
      
    await this.syncTree();
  }

a

pinheadmz · 2022-04-12T20:28:55Z

You can move out syncTree from compactTree and have it like this.

This will call syncTree() an unnecessary second time if the option is set. (compactTree() calls syncTree() automatically already)

But I fixed that with an else in d656335

pinheadmz · 2022-04-12T20:31:25Z

@nodech thank you so much for your brilliant review and great catch regarding layout.s I rebased the branch to revert the layout.s deprecation and added in an extra commit to that value in compactTree() that is in commit 6cfd96b

I also added a test in commit 80baf34 to cover the "failed connect()" case as you describe. You were right, the code totally failed that test until layout.s was restored.

pinheadmz · 2022-05-31T19:31:11Z

Mainnet integration testing is going well, working with prune and full nodes and combinations of the new options. Urkel reliably drops from 15GB to ~1GB. Even synced a fresh mainnet chain and restarted every 1000 blocks to keep tree size low throughout entire IBD.

Only issue so far is the migration in SPV mode

smcki012 · 2022-06-01T02:53:57Z

Fantastic work on this. Benchmarks are very promising. HSD is getting really lean for node operators!

lib/blockchain/chaindb.js

lib/blockchain/chain.js

test/chain-tree-compaction-test.js

pinheadmz · 2022-06-03T15:29:40Z

ACK 6e3a113

Show Signature

pinheadmz's public key is on keybase

Add reconstruct tree RPC call. Make TreeState safe.

chain: clean up chain.open

Minor nits and updates.

pinheadmz · 2022-06-03T15:47:20Z

ACK 74cc006

Show Signature

pinheadmz's public key is on keybase

pinheadmz mentioned this pull request Dec 9, 2021

Additional Logging During Tree Sync #668

Merged

pinheadmz force-pushed the prunetree branch from 72adee4 to da3bb7f Compare December 9, 2021 15:40

pinheadmz mentioned this pull request Dec 14, 2021

Gracefully shut down node on critical errors like full disk #650

Merged

3 tasks

pinheadmz force-pushed the prunetree branch 4 times, most recently from 6f85926 to 604f41a Compare December 15, 2021 19:04

pinheadmz mentioned this pull request Dec 27, 2021

Pruned full node using more than 400 MB handshake-org/handshake-org.github.io#141

Open

pinheadmz force-pushed the prunetree branch from 604f41a to 0ddd228 Compare January 7, 2022 15:57

pinheadmz marked this pull request as ready for review January 7, 2022 18:42

brandondees reviewed Jan 7, 2022

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

brandondees reviewed Jan 7, 2022

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

brandondees reviewed Jan 7, 2022

View reviewed changes

lib/node/rpc.js Show resolved Hide resolved

brandondees approved these changes Jan 7, 2022

View reviewed changes

pinheadmz added this to the 4.0.0 milestone Mar 31, 2022

pinheadmz force-pushed the prunetree branch from b22a4d0 to 11f9575 Compare April 1, 2022 17:56

pinheadmz requested a review from nodech April 1, 2022 17:56

pinheadmz force-pushed the prunetree branch from 11f9575 to 80baf34 Compare April 12, 2022 20:22

pinheadmz commented Jun 1, 2022

View reviewed changes

lib/blockchain/chaindb.js Show resolved Hide resolved

pinheadmz commented Jun 1, 2022

View reviewed changes

lib/blockchain/chain.js Show resolved Hide resolved

pinheadmz commented Jun 2, 2022

View reviewed changes

test/chain-tree-compaction-test.js Outdated Show resolved Hide resolved

pinheadmz commented Jun 2, 2022

View reviewed changes

test/chain-tree-compaction-test.js Outdated Show resolved Hide resolved

nodech force-pushed the prunetree branch 2 times, most recently from 4fe79bc to 3e570fc Compare June 2, 2022 19:09

nodech added advanced review difficulty - advanced blockchain part of the codebase dns part of the codebase protocol/consensus part of the codebase breaking-major Backwards incompatible - Release version feature general - adding feature labels Jun 3, 2022

nodech approved these changes Jun 3, 2022

View reviewed changes

nodech and others added 9 commits June 3, 2022 19:37

chaindb: add tree state with compaction and migration.

7adfc3c

chain: add compaction interval, events and rpc call.

95769d9

chain: Add tree commit height to the tree state.

5b89d91

Add reconstruct tree RPC call. Make TreeState safe.

chaindb: try removing tmp directory before compaction.

a339e39

pkg: update changelog.

d297d10

chain: clean up chain.open

chain: Fix migration for the SPV.

05ed369

Minor nits and updates.

test: cover tree interval boundary after compacting

5502195

chain: take into account the compaction depth.

b41c56e

pkg: add chain migrate flag to changelog

74cc006

nodech force-pushed the prunetree branch from 6e3a113 to 74cc006 Compare June 3, 2022 15:44

pinheadmz merged commit ba949f3 into handshake-org:master Jun 3, 2022

handshake-org deleted a comment from satoshi-MAM Jan 10, 2023

nodech deleted the prunetree branch June 11, 2024 11:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[consensus] Enable Urkel Tree compaction #669

[consensus] Enable Urkel Tree compaction #669

pinheadmz commented Dec 9, 2021 •

edited

Loading

coveralls commented Dec 9, 2021 •

edited

Loading

pinheadmz commented Dec 13, 2021

pinheadmz commented Jan 7, 2022

brandondees left a comment

brandondees commented Jan 7, 2022

pinheadmz commented Jan 8, 2022

brandondees commented Jan 8, 2022

pinheadmz commented Jan 12, 2022 •

edited

Loading

pinheadmz commented Mar 31, 2022 •

edited

Loading

pinheadmz commented Apr 1, 2022

nodech commented Apr 11, 2022

pinheadmz commented Apr 12, 2022

pinheadmz commented Apr 12, 2022

pinheadmz commented May 31, 2022

smcki012 commented Jun 1, 2022

pinheadmz commented Jun 3, 2022

pinheadmz commented Jun 3, 2022

[consensus] Enable Urkel Tree compaction #669

[consensus] Enable Urkel Tree compaction #669

Conversation

pinheadmz commented Dec 9, 2021 • edited Loading

What does compacting the Urkel Tree mean?

Strategy in this PR

End result

Mainnet testing

coveralls commented Dec 9, 2021 • edited Loading

pinheadmz commented Dec 13, 2021

pinheadmz commented Jan 7, 2022

What is the best method for deployment?

brandondees left a comment

Choose a reason for hiding this comment

brandondees commented Jan 7, 2022

pinheadmz commented Jan 8, 2022

brandondees commented Jan 8, 2022

pinheadmz commented Jan 12, 2022 • edited Loading

pinheadmz commented Mar 31, 2022 • edited Loading

pinheadmz commented Apr 1, 2022

nodech commented Apr 11, 2022

General

Atomicity and layout.s

Atomicity with the compaction.

To summurize

Nits

pinheadmz commented Apr 12, 2022

pinheadmz commented Apr 12, 2022

pinheadmz commented May 31, 2022

smcki012 commented Jun 1, 2022

pinheadmz commented Jun 3, 2022

pinheadmz commented Jun 3, 2022

pinheadmz commented Dec 9, 2021 •

edited

Loading

coveralls commented Dec 9, 2021 •

edited

Loading

pinheadmz commented Jan 12, 2022 •

edited

Loading

pinheadmz commented Mar 31, 2022 •

edited

Loading

Atomicity and `layout.s`