core/rawdb, triedb/pathdb: introduce trienode history #32596

rjl493456442 · 2025-09-12T03:00:05Z

It's a pull request based on the #32523 , implementing the structure of trienode history.

core/rawdb/schema.go

triedb/pathdb/history_trienode.go

MariusVanDerWijden · 2025-09-26T11:56:21Z

triedb/pathdb/history_trienode.go

+			value := h.nodes[owner][path]
+
+			// key section
+			n := binary.PutUvarint(buf[0:], uint64(prefixLen))          // key length shared (varint)


I don't really get why this is needed

Inside of the restart, these rules are applied:

the first entry is always encoded with full key

for all the subsequent entries, the key is encoded in a "compressed" format.
In which, only the difference between the key and preceding one is stored. Given
that we store the trie nodes here and the key is essentially the node path. By storing
the diff can effectively compress the entry key.

Therefore, a few additional metadata are tracked in the key section:

shared key length

unshared key length

value length

These information can support us to recover the key from the byte stream.

MariusVanDerWijden · 2025-09-26T11:57:22Z

triedb/pathdb/history_trienode.go

+		)
+		for i, path := range h.nodeList[owner] {
+			key := []byte(path)
+			if i%trienodeDataBlockRestartLen == 0 {


Same with this, I don't understand why we need the block restarts and why the sharedPrefix will be 0 if we chunk

triedb/pathdb/history_trienode.go

MariusVanDerWijden · 2025-09-29T09:10:19Z

triedb/pathdb/history_trienode.go

+	if len(keySection) < int(8*nRestarts)+4 {
+		return nil, fmt.Errorf("key section too short, restarts: %d, size: %d", nRestarts, len(keySection))
+	}
+	for i := 0; i < int(nRestarts); i++ {
+		o := len(keySection) - 4 - (int(nRestarts)-i)*8
+		keyOffset := binary.BigEndian.Uint32(keySection[o : o+4])
+		if i != 0 && keyOffset <= keyOffsets[i-1] {
+			return nil, fmt.Errorf("key offset is out of order, prev: %v, cur: %v", keyOffsets[i-1], keyOffset)


There are so many magic numbers here... its hard to comprehend

The nodes from different tries are aggregated and concatenated within the key and value sections.
The offsets of the keys and values belonging to each trie are recorded in the header section.

For each trie, a list of internal chunks, called as restarts, is maintained.
At the end of the key section corresponding to a given trie, the offsets of these restarts are recorded.
These codes are for resolving the offsets of these restarts.

There are two main reasons for maintaining these restarts:

(1) compress the entry key
Given that the key length of the entry (trie node) is not negligible. By maintaining the difference with the preceding one (usually parent node) can compress the key length effectively. Usually 1 byte diff is sufficient.

(2) enhance the lookup efficiency
The first entry in the restart is always stored with full key (no shared part with the preceding one). Therefore the binary search can be performed at the boundary of restarts.

triedb/pathdb/history_trienode.go

MariusVanDerWijden

Generally LGTM, small nits and a few questions

triedb/pathdb/history_trienode_test.go

MariusVanDerWijden

SGTM

It's a pull request based on the ethereum#32523 , implementing the structure of trienode history.

rjl493456442 force-pushed the trie-archive-p3 branch 2 times, most recently from d4b1023 to ca6e68c Compare September 17, 2025 06:31

MariusVanDerWijden self-assigned this Sep 17, 2025

rjl493456442 force-pushed the trie-archive-p3 branch 2 times, most recently from 4129367 to 60dbb64 Compare September 22, 2025 06:19

MariusVanDerWijden reviewed Sep 26, 2025

View reviewed changes

MariusVanDerWijden reviewed Sep 29, 2025

View reviewed changes

triedb/pathdb/history_trienode.go Show resolved Hide resolved

MariusVanDerWijden reviewed Sep 29, 2025

View reviewed changes

triedb/pathdb/history_trienode.go Show resolved Hide resolved

MariusVanDerWijden reviewed Sep 29, 2025

View reviewed changes

rjl493456442 added 3 commits October 9, 2025 11:52

core/rawdb, triedb/pathdb: introduce trienode history

e8b2d89

triedb/pathdb: include metadata in header section

507c50d

core, triedb: address comments from marius

6ab502d

rjl493456442 force-pushed the trie-archive-p3 branch from 60dbb64 to 6ab502d Compare October 9, 2025 05:25

MariusVanDerWijden approved these changes Oct 9, 2025

View reviewed changes

rjl493456442 added this to the 1.16.5 milestone Oct 10, 2025

rjl493456442 merged commit de24450 into ethereum:master Oct 10, 2025
7 of 9 checks passed

ethereumorg092-arch mentioned this pull request Oct 10, 2025

200 #32870

Closed

Sahil-4555 pushed a commit to Sahil-4555/go-ethereum that referenced this pull request Oct 12, 2025

core/rawdb, triedb/pathdb: introduce trienode history (ethereum#32596)

eb8528f

It's a pull request based on the ethereum#32523 , implementing the structure of trienode history.

atkinsonholly pushed a commit to atkinsonholly/ephemery-geth that referenced this pull request Nov 24, 2025

core/rawdb, triedb/pathdb: introduce trienode history (ethereum#32596)

cdb40dc

It's a pull request based on the ethereum#32523 , implementing the structure of trienode history.

prestoalvarez pushed a commit to prestoalvarez/go-ethereum that referenced this pull request Nov 27, 2025

core/rawdb, triedb/pathdb: introduce trienode history (ethereum#32596)

0764ea6

It's a pull request based on the ethereum#32523 , implementing the structure of trienode history.

core/rawdb, triedb/pathdb: introduce trienode history #32596

core/rawdb, triedb/pathdb: introduce trienode history #32596

Uh oh!

Conversation

rjl493456442 commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

MariusVanDerWijden Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MariusVanDerWijden Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MariusVanDerWijden left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MariusVanDerWijden left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants