feat: new mpt implementation #482

shayanh · 2025-08-26T23:42:12Z

This PR introduces a new MPT implementation to optimize node reference calculation and data copying. The implementation is in a new crate named mptnew, though it is mostly based on the existing MPT implementation from #427.

The main contribution of this PR is a new serialization technique. MPT nodes are RLP-encoded and serialized recursively. In the guest program, deserialization is performed by traversing the serialized byte array. Since the nodes are already RLP-encoded, we avoid re-encoding them and re-storing their references. As a result, MPT deserialization and state root calculation become purely zero-copy, with node data and references maintained as pointers to the original input data. Additionally, this implementation reduces data copies by writing data directly into the bump area from the start whenever possible.

Important note: I have not yet implemented the build_mpt feature that builds an MPT from MPT proofs. This feature is only used on the host to generate witness data. For now, I generate witness data for the new MPT implementation using the old MPT: on the host, I first build an old MPT and then serialize it in a format compatible with the new MPT. I will implement the build_mpt feature next and then remove the existing MPT implementation.

Results on block number 23100006:

Proof Time (s): 211.40 -- before: 226.85
Parallel Proof Time (s): 14.75 -- before: 15.53
Instruction count: 132,792,482 -- before: 146,988,654

# Conflicts: # bin/client-eth/Cargo.lock

jonathanpwang · 2025-08-27T05:22:44Z

crates/mptnew/src/trie.rs

+        //
+        // More advanced improvement: either pre-execute block at guest to know exact allocations in
+        // advance, or allocate a separate arena specifically for updates.
+        let capacity = num_nodes + num_nodes / 10;


we should follow up about this

jonathanpwang · 2025-08-27T05:23:44Z

crates/mptnew/src/trie.rs

+    nodes: Vec<NodeData<'a>>,
+
+    /// Cache. Hashing/encoding often needs "what would this node look like in its parent"
+    cached_references: Vec<RefCell<Option<NodeRef<'a>>>>,


btw even in Valery's PR, I wondered if we could get away using raw pointers for a little gain

IIRC I tried using Cell instead and it was very slow for unknown reason. Raw pointers maybe can work

jonathanpwang

I followed the overall logic since you had explained it to me verbally.

After you implement the new serialization, we should delete the old crate to clean up and lessen the code.

shayanh added 13 commits August 26, 2025 17:21

feat: initial newmpt impl

cb09349

mptnew: wip

43c6524

wip

843eb8c

snapshot

65bcd1b

# Conflicts: # bin/client-eth/Cargo.lock

snapshot

95e321f

null node opt

40b6c7b

cleanups

4448e4f

# Conflicts: # bin/client-eth/Cargo.lock

chore: remove unintended committed files

f62a6f1

small fixes and cleanups

afddd36

chores

3941645

remove unused

caed0f3

chores

0bae751

update cargo.lock

1a4bae4

shayanh force-pushed the shayanh/mptnew branch from 7d37496 to 1a4bae4 Compare August 27, 2025 00:27

shayanh marked this pull request as ready for review August 27, 2025 00:28

shayanh requested review from jonathanpwang and Qumeric August 27, 2025 00:28

jonathanpwang reviewed Aug 27, 2025

View reviewed changes

jonathanpwang approved these changes Aug 27, 2025

View reviewed changes

jonathanpwang merged commit 5830cca into main Aug 27, 2025
5 checks passed

jonathanpwang deleted the shayanh/mptnew branch August 27, 2025 05:27

jonathanpwang restored the shayanh/mptnew branch August 27, 2025 05:27

jsign mentioned this pull request Sep 8, 2025

Investigate potential optimization about MPT node deserialization eth-act/zkevm-benchmark-workload#169

Open

jonathanpwang added the input-format The input format of the host binary changed label Sep 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: new mpt implementation #482

feat: new mpt implementation #482

Uh oh!

shayanh commented Aug 26, 2025 •

edited

Loading

Uh oh!

jonathanpwang Aug 27, 2025

Uh oh!

jonathanpwang Aug 27, 2025

Uh oh!

Qumeric Aug 27, 2025

Uh oh!

jonathanpwang left a comment

Uh oh!

Uh oh!

Uh oh!

feat: new mpt implementation #482

feat: new mpt implementation #482

Uh oh!

Conversation

shayanh commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jonathanpwang Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

jonathanpwang Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

Qumeric Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

jonathanpwang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

shayanh commented Aug 26, 2025 •

edited

Loading