Add support for block pruning #116

lukechampine · 2024-11-09T02:47:04Z

Been a long time coming. 😅

The strategy here is quite naive, but I think it will be serviceable. Basically, when we apply a block N, we delete block N-P. P is therefore the "prune target," i.e. the maximum number of blocks you want to store.

In practice, this isn't exhaustive: it only deletes blocks from the best chain. It also won't dramatically shrink the size of an existing database. I think this is acceptable, because pruning is most important during the initial sync, and during the initial sync, you'll only be receiving blocks from one chain at a time. Also, we don't want to make pruning too easy; after all, we need a good percentage of nodes to be storing the full chain, so that others can sync to them.

I tested this out locally with a prune target of 1000, and after syncing 400,000 blocks, my consensus.db was around 18 GB. This is disappointing; it should be much smaller. With some investigation, I found that the Bolt database was only storing ~5 GB of data (most of which was the accumulator tree, which we can't prune until after v2). I think this is a combination of a) Bolt grows the DB capacity aggressively in response to writes, and b) Bolt never shrinks the DB capacity. So it's possible that we could reduce this number by tweaking our DB batching parameters. Alternatively, we could provide a tool that copies the DB to a new file. Not the most user-friendly, but again, I think I'm okay with that for now.

Depends on SiaFoundation/core#228

n8maninger · 2024-11-11T17:39:27Z

chain/db.go

@@ -13,43 +13,32 @@ import (
 )

 type supplementedBlock struct {
-	Block      types.Block
+	Header     *types.BlockHeader


I'd like to see a way to detect and trigger a resync for changes like this. Currently, we panic and require the user to manually delete the consensus database. That's really bad UX, particularly for users that are doing automatic updates. Simplest solution would probably be to try to decode the supplementedBlock when we open the database. If that fails: log an error, close, erase, and reopen.

Longer term, it would be nice to have an actual migration path like the other databases. However, that may be less important if we don't need to store any of this junk after the v2 require height.

agreed. In this case, though, compatibility is preserved: we always encode as v3, but we can decode either v2 or v3. That's possible because the Header field is optional and gets filled in lazily as needed.

lukechampine added 2 commits November 8, 2024 21:35

chain,syncer: Add support for block pruning

c84969c

testutil: Rename MemPeerStore to EphemeralPeerStore

5c549f3

n8maninger reviewed Nov 11, 2024

View reviewed changes

ChrisSchinnerl assigned lukechampine Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for block pruning #116

Add support for block pruning #116

lukechampine commented Nov 9, 2024

n8maninger Nov 11, 2024 •

edited

Loading

lukechampine Nov 12, 2024

Add support for block pruning #116

Are you sure you want to change the base?

Add support for block pruning #116

Conversation

lukechampine commented Nov 9, 2024

n8maninger Nov 11, 2024 • edited Loading

Choose a reason for hiding this comment

lukechampine Nov 12, 2024

Choose a reason for hiding this comment

n8maninger Nov 11, 2024 •

edited

Loading