Replace `wasmparser-nostd` fork with upstream `wasmparser` #1141

Robbepop · 2024-07-26T12:05:15Z

Thread discussing the performance regressions in wasmparser:
bytecodealliance/wasm-tools#1701

Unfortunately benchmarks conducted locally show significant performance regressions for Wasm validation:

We can see this clearly in the posted screenshot since translate/tiny_keccak/checked/eager regresses massively while translate/tiny_keccak/unchecked/eager only regresses by 3% which could be noise.

The only difference between these two cases is that translate/tiny_keccak/checked/eager validates the Wasm via wasmparser while translate/tiny_keccak/unchecked/eager skips Wasm validation and only translates the Wasm via Wasmi.

The huge 48% performance regression for the checked/lazy test case indicates that Wasm validation has a much higher constant cost bump for validation of non-function bodies, too. OR it indicates that the setup of the function validators became a lot more expensive.

Reproduce benchmarks locally:

On main branch: cargo bench --bench benches -- --save-baseline main
On PR branch: cargo bench --bench benches -- --baseline main

codecov · 2024-07-26T12:09:59Z

Codecov Report

Attention: Patch coverage is 59.86395% with 59 lines in your changes missing coverage. Please review.

Project coverage is 81.45%. Comparing base (e427d0f) to head (8ee3373).

Files with missing lines	Patch %	Lines
crates/wasmi/src/engine/translator/mod.rs	18.18%	27 Missing ⚠️
crates/wasmi/src/module/utils.rs	62.96%	10 Missing ⚠️
crates/wasmi/src/module/init_expr.rs	33.33%	8 Missing ⚠️
crates/wasmi/src/engine/mod.rs	12.50%	7 Missing ⚠️
crates/wasmi/src/module/element.rs	77.77%	4 Missing ⚠️
crates/wasmi/src/engine/config.rs	92.85%	2 Missing ⚠️
crates/wasmi/src/module/parser.rs	90.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1141      +/-   ##
==========================================
- Coverage   81.57%   81.45%   -0.13%     
==========================================
  Files         306      306              
  Lines       25271    25349      +78     
==========================================
+ Hits        20616    20648      +32     
- Misses       4655     4701      +46

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

The new WasmFeatures type requires a bit less stack space due to bitfield usage. Ideally we use our own bitfield based type but this is more work.

Robbepop · 2024-07-26T19:04:27Z

I just found one reason why it regressed: I benchmarked without setting the no-hash-maps feature. Generally BTreeMap is faster than HashMap for Wasmi uses and the old wasmparser-nostd fork alsways uses BTreeMaps. With this feature set local benchmarks still indicate regressions but they are less severe, ranging between 10-25% instead of 20-35%. The unchecked (no validation) case now even has slightly improved performance which indicates very strongly to me that the regressions that we see come from the Wasm validation.

The biggest regression we see now is lazy-translation which further indicates that regressions are in the Wasm validation since lazy-translation eagerly validates the Wasm input and avoids translating it via Wasmi, thus more of the total time spent in the benchmark comes from wasmparser's parsing and validation.

Robbepop · 2024-07-29T09:40:54Z

Running yet another benchmark because I forgot to use no-hash-maps on main. 🤦

This is the slowdown from main to this PR:

Robbepop · 2024-10-06T14:34:29Z

After improving translation benchmarks and updating wasmparser to v0.218.0 I reran benchmarks:

tl;dr Conclusions

Benchmarks with validation (checked) are affected a lot worse.
- Wasm validation seems to have the largest regressions overall.
Benchmarks that perform validation only on non-code section parts are particularly affected. (lazy/checked)
- Thus regressions are also outside of the code section validation.
Even benchmarks that perform no Wasm validation whatsoever are affected. (unchecked)
- Thus parsing Wasm has also regressed.
- Parsing outside of the code section has regressed more. (eager/unchecked vs lazy/unchecked)
Small benchmarks (tiny_keccak and erc20) are affected a lot worse than large ones (spidermonkey).
- This implies that the base overhead for parsing and validation has been lifted/regressed.

Tiny-Keccak

Spidermonkey

ERC-20

Robbepop · 2024-10-11T16:18:21Z

With wasmparser v0.219.0 there is a new component-model crate feature which disables Wasm component-model support. Wasmi can disable this for some nice compile time improvements:

v0.218.0:

cargo build -p wasmi --profile bench  22.52s user 1.16s system 201% cpu 11.778 total

v0.219.0:

cargo build -p wasmi --profile bench  18.95s user 1.18s system 178% cpu 11.248 total

Robbepop · 2024-10-19T10:26:51Z

I just found out that wasmparser v0.219.0 has this API:
https://docs.rs/wasmparser/0.219.1/wasmparser/struct.FunctionBody.html#method.as_bytes

This will likely fix a workaround in Wasmi:

wasmi/crates/wasmi/src/module/parser/buffered.rs

Lines 159 to 170 in 02621ad

    
           match payload { 
        
               Payload::CodeSectionEntry(func_body) => { 
        
                   // Note: Unfortunately the `wasmparser` crate is missing an API 
        
                   //       to return the byte slice for the respective code section 
        
                   //       entry payload. Please remove this work around as soon as 
        
                   //       such an API becomes available. 
        
                   let bytes = Self::consume_buffer(consumed, buffer); 
        
                   let remaining = func_body.get_binary_reader().bytes_remaining(); 
        
                   let start = consumed - remaining; 
        
                   let bytes = &bytes[start..]; 
        
                   self.process_code_entry(func_body, bytes, &header)?; 
        
               }

wasmi/crates/wasmi/src/engine/translator/driver.rs

Lines 19 to 31 in 02621ad

    
           pub fn new( 
        
               offset: impl Into<Option<usize>>, 
        
               bytes: &'parser [u8], 
        
               translator: T, 
        
           ) -> Result<Self, Error> { 
        
               let offset = offset.into().unwrap_or(0); 
        
               let func_body = FunctionBody::new(offset, bytes); 
        
               Ok(Self { 
        
                   func_body, 
        
                   bytes, 
        
                   translator, 
        
               }) 
        
           }

This allows us to avoid a nasty work around.

danielstuart14 · 2024-11-01T20:37:36Z

#1197 requires this change.

Robbepop · 2024-11-16T15:23:51Z

With bytecodealliance/wasm-tools#1906 merged we can also merge this PR once wasmparser v0.221 has been released which might happen next week. 🎉

cc @danielstuart14

Robbepop added 2 commits May 19, 2024 15:20

update wasmparser dependency to upstream

cd00ad1

Merge branch 'main' into rf-update-wasmparser

0829922

Robbepop added 2 commits July 26, 2024 17:04

update to wasmparser v0.214.0

1839745

use WasmFeatures in Config

181eba6

The new WasmFeatures type requires a bit less stack space due to bitfield usage. Ideally we use our own bitfield based type but this is more work.

Robbepop mentioned this pull request Jul 26, 2024

wasmparser: Performance regressions since v0.100.2 bytecodealliance/wasm-tools#1701

Closed

Robbepop mentioned this pull request Oct 4, 2024

Implement the Wasm gc proposal #775

Open

Robbepop added 8 commits October 6, 2024 13:07

Merge branch 'main' into rf-update-wasmparser

05165cd

adjustments after merging with main

fe664ba

reinstate type check in TableEntity::init

b70b16d

update to wasmparser v0.218.0

08d7f61

enable GC_TYPES when REFERENCE_TYPES is enabled

7958bc1

shift method to reduce diff

956471e

remove outdated comment

f93b112

refactor: shrink LazyFuncTranslator stack size

2ba92d3

Robbepop mentioned this pull request Oct 6, 2024

Improve and extend translation benchmarks #1227

Merged

Merge branch 'main' into rf-update-wasmparser

c785870

Robbepop changed the title ~~Replace wasmparser-nostd fork with upstream wasmparser as dependency~~ Replace wasmparser-nostd fork with upstream wasmparser Oct 6, 2024

Robbepop added 3 commits October 6, 2024 20:07

Merge branch 'main' into rf-update-wasmparser

8e9267d

Merge branch 'main' into rf-update-wasmparser

8b7bbfd

update wasmparser to v0.219.0

47bf45a

Robbepop mentioned this pull request Oct 9, 2024

Add a component-model feature to wasmparser bytecodealliance/wasm-tools#1845

Merged

Merge branch 'main' into rf-update-wasmparser

dfbb0d3

Robbepop added 2 commits October 19, 2024 13:14

Merge branch 'main' into rf-update-wasmparser

9184417

use new FuncBody::as_bytes API where useful

fb21283

This allows us to avoid a nasty work around.

Robbepop mentioned this pull request Oct 29, 2024

Refactor crate features for more control over hash-maps usage #1265

Merged

Robbepop mentioned this pull request Nov 1, 2024

Implement the Wasm custom-page-sizes proposal #1197

Open

Robbepop added 4 commits November 13, 2024 00:34

Merge branch 'main' into rf-update-wasmparser

2389118

leftovers from merge

514c150

update to wasmparser v0.220 and propagate crate features

45593bb

update Cargo.lock

ade9d7a

Robbepop mentioned this pull request Nov 15, 2024

wasmparser: optimize type section valiadation by specializing over Wasm feature subset bytecodealliance/wasm-tools#1906

Merged

Robbepop added 6 commits November 16, 2024 16:57

Merge branch 'main' into rf-update-wasmparser

3a797d4

fixes after merge

d681825

update wasm-tools dependencies

e8739b8

update cargo deps

e68c2fc

disable memory64 for wasm-smith

e0a5adf

Merge branch 'main' into rf-update-wasmparser

8ee3373

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace `wasmparser-nostd` fork with upstream `wasmparser` #1141

Replace `wasmparser-nostd` fork with upstream `wasmparser` #1141

Robbepop commented Jul 26, 2024 •

edited

Loading

codecov bot commented Jul 26, 2024 •

edited

Loading

Robbepop commented Jul 26, 2024 •

edited

Loading

Robbepop commented Jul 29, 2024

Robbepop commented Oct 6, 2024 •

edited

Loading

Robbepop commented Oct 11, 2024

Robbepop commented Oct 19, 2024 •

edited

Loading

danielstuart14 commented Nov 1, 2024 •

edited

Loading

Robbepop commented Nov 16, 2024 •

edited

Loading

Replace wasmparser-nostd fork with upstream wasmparser #1141

Are you sure you want to change the base?

Replace wasmparser-nostd fork with upstream wasmparser #1141

Conversation

Robbepop commented Jul 26, 2024 • edited Loading

codecov bot commented Jul 26, 2024 • edited Loading

Codecov Report

Robbepop commented Jul 26, 2024 • edited Loading

Robbepop commented Jul 29, 2024

Robbepop commented Oct 6, 2024 • edited Loading

tl;dr Conclusions

Tiny-Keccak

Spidermonkey

ERC-20

Robbepop commented Oct 11, 2024

Robbepop commented Oct 19, 2024 • edited Loading

danielstuart14 commented Nov 1, 2024 • edited Loading

Robbepop commented Nov 16, 2024 • edited Loading

Replace `wasmparser-nostd` fork with upstream `wasmparser` #1141

Replace `wasmparser-nostd` fork with upstream `wasmparser` #1141

Robbepop commented Jul 26, 2024 •

edited

Loading

codecov bot commented Jul 26, 2024 •

edited

Loading

Robbepop commented Jul 26, 2024 •

edited

Loading

Robbepop commented Oct 6, 2024 •

edited

Loading

Robbepop commented Oct 19, 2024 •

edited

Loading

danielstuart14 commented Nov 1, 2024 •

edited

Loading

Robbepop commented Nov 16, 2024 •

edited

Loading