trie: reduce the memory allocation in trie hashing #31902

rjl493456442 · 2025-05-26T01:39:31Z

This pull request optimizes trie hashing by reducing memory allocation overhead. Specifically:

define a fullNodeEncoder pool to reuse encoders and avoid memory allocations.
simplify the encoding logic for shortNode and fullNode by getting rid of the Go interfaces.

Benchmark results

The memory allocation has been reduced significantly
The CPU IOWait is constantly higher with unknown reason
The overall performance is slightly slower

Memory profile

[[ PR ]]
(pprof) top10
Showing nodes accounting for 8512.51GB, 46.05% of 18486.12GB total
Dropped 2316 nodes (cum <= 92.43GB)
Showing top 10 nodes out of 246
      flat  flat%   sum%        cum   cum%
 2131.23GB 11.53% 11.53%  4212.13GB 22.79%  github.com/ethereum/go-ethereum/trie.decodeFull
 2081.41GB 11.26% 22.79%  2081.43GB 11.26%  github.com/ethereum/go-ethereum/trie.decodeRef
 1227.24GB  6.64% 29.43%  1227.24GB  6.64%  github.com/ethereum/go-ethereum/rlp.(*encBuffer).makeBytes
 1016.72GB  5.50% 34.93%  1016.72GB  5.50%  github.com/ethereum/go-ethereum/trie.(*tracer).onRead (inline)
     385GB  2.08% 37.01%   418.67GB  2.26%  github.com/ethereum/go-ethereum/core/state.newObject
  366.55GB  1.98% 38.99%   676.40GB  3.66%  github.com/ethereum/go-ethereum/core/state.(*stateObject).GetCommittedState
  342.26GB  1.85% 40.84%   425.03GB  2.30%  github.com/ethereum/go-ethereum/core/state.(*stateObject).finalise
  340.53GB  1.84% 42.69%  3821.35GB 20.67%  github.com/ethereum/go-ethereum/trie.(*Trie).insert
  340.36GB  1.84% 44.53%   340.36GB  1.84%  github.com/ethereum/go-ethereum/core/vm.codeBitmap
  281.20GB  1.52% 46.05%   281.20GB  1.52%  github.com/ethereum/go-ethereum/trie.keybytesToHex (inline)

[[ Master ]]

(pprof) alloc_space
(pprof) top10
Showing nodes accounting for 13090.96GB, 54.01% of 24239.01GB total
Dropped 2401 nodes (cum <= 121.20GB)
Showing top 10 nodes out of 221
      flat  flat%   sum%        cum   cum%
 4398.55GB 18.15% 18.15%  5110.55GB 21.08%  github.com/ethereum/go-ethereum/trie.(*hasher).hashFullNodeChildren
 2248.19GB  9.28% 27.42%  4442.15GB 18.33%  github.com/ethereum/go-ethereum/trie.decodeFull
 2194.50GB  9.05% 36.48%  2194.51GB  9.05%  github.com/ethereum/go-ethereum/trie.decodeRef
 1301.95GB  5.37% 41.85%  1301.95GB  5.37%  github.com/ethereum/go-ethereum/rlp.(*encBuffer).makeBytes
 1073.52GB  4.43% 46.28%  1073.52GB  4.43%  github.com/ethereum/go-ethereum/trie.(*tracer).onRead (inline)
  405.73GB  1.67% 47.95%   441.01GB  1.82%  github.com/ethereum/go-ethereum/core/state.newObject
  384.59GB  1.59% 49.54%   709.87GB  2.93%  github.com/ethereum/go-ethereum/core/state.(*stateObject).GetCommittedState
  365.17GB  1.51% 51.04%   365.17GB  1.51%  github.com/ethereum/go-ethereum/core/vm.codeBitmap
  364.32GB  1.50% 52.55%   452.82GB  1.87%  github.com/ethereum/go-ethereum/core/state.(*stateObject).finalise
  354.43GB  1.46% 54.01%  4021.69GB 16.59%  github.com/ethereum/go-ethereum/trie.(*Trie).insert
(pprof)

rjl493456442 · 2025-05-27T02:22:35Z

After running a bit more, it turns out the IOwait is not relevant with the change.

The PR is slightly slower though.

omerfirmak · 2025-05-28T07:02:38Z

trie/hasher.go

@@ -176,8 +182,8 @@ func (h *hasher) encodedBytes() []byte {
 }

 // hashData hashes the provided data
-func (h *hasher) hashData(data []byte) hashNode {
-	n := make(hashNode, 32)
+func (h *hasher) hashData(data []byte) []byte {


Could we have made the return type here and hashNode a [32]byte? It probably wouldn't save much on the total size of allocations, but it will reduce the number of individual allocations.

Sounds reasonable to me. But I don't want to do it in this pr, otherwise it will end up as a giant PR I think.

omerfirmak · 2025-07-17T07:52:59Z

trie/hasher.go

-func (h *hasher) hashFullNodeChildren(n *fullNode) *fullNode {
-	var children [17]node
+func (h *hasher) encodeFullNode(n *fullNode) []byte {
+	fn := fnEncoderPool.Get().(*fullnodeEncoder)


this introduces a synchronization point for all parallel hashers, which is probably why the performance degraded slightly. I think we can do better.

Really? I think sync.Pool should be efficient for concurrent usage.

Well, it might be efficient in what it's doing but it is certainly not free.

MariusVanDerWijden

LGTM

omerfirmak · 2025-07-22T12:49:24Z

My 2c is, optimizing to reduce GC churn at the cost of increased runtime is not really an optimization. We just moved the cost of GC churn to somewhere else (the sync pool in this case).

fjl assigned MariusVanDerWijden May 27, 2025

omerfirmak reviewed May 28, 2025

View reviewed changes

rjl493456442 mentioned this pull request May 29, 2025

Optimize the memory allocation in trie operation #31832

Open

omerfirmak mentioned this pull request Jul 15, 2025

trie: reduce allocations in hashFullNodeChildren #32218

Closed

omerfirmak reviewed Jul 17, 2025

View reviewed changes

rjl493456442 force-pushed the trie-hasher-2 branch from 6b5ccf9 to 17b0a10 Compare July 18, 2025 06:27

rjl493456442 added this to the 1.16.2 milestone Jul 18, 2025

MariusVanDerWijden approved these changes Jul 22, 2025

View reviewed changes

rjl493456442 added 6 commits July 30, 2025 15:30

trie: rework trie hasher

a80f906

trie: add sync pool

4c85ff8

trie: polish

e7e19e7

trie: polish

f64a89f

trie: add comments

d5463bb

trie: remove defer

b7aa034

rjl493456442 force-pushed the trie-hasher-2 branch from 79a934b to b7aa034 Compare July 30, 2025 07:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

trie: reduce the memory allocation in trie hashing #31902

trie: reduce the memory allocation in trie hashing #31902

rjl493456442 commented May 26, 2025 •

edited

Loading

Uh oh!

rjl493456442 commented May 27, 2025

Uh oh!

omerfirmak May 28, 2025 •

edited

Loading

Uh oh!

rjl493456442 Jul 18, 2025

Uh oh!

omerfirmak Jul 17, 2025

Uh oh!

rjl493456442 Jul 18, 2025

Uh oh!

omerfirmak Jul 18, 2025

Uh oh!

MariusVanDerWijden left a comment

Uh oh!

omerfirmak commented Jul 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

trie: reduce the memory allocation in trie hashing #31902

Are you sure you want to change the base?

trie: reduce the memory allocation in trie hashing #31902

Conversation

rjl493456442 commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rjl493456442 commented May 27, 2025

Uh oh!

omerfirmak May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

omerfirmak Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

omerfirmak Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden left a comment

Choose a reason for hiding this comment

Uh oh!

omerfirmak commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

rjl493456442 commented May 26, 2025 •

edited

Loading

omerfirmak May 28, 2025 •

edited

Loading

omerfirmak commented Jul 22, 2025 •

edited

Loading