A reasonablely good fuzzer for libfuzzer
-style WebAssembly harnesses
and a suite of C/C++/Rust projects for WASM fuzzer evals.
Our WebAssembly fuzzer prototype builds on LibAFL and a custom WASM JIT. We insert instrumentation during the fuzzing campaign, and thus can run different combinations of feedback kinds on-demand.
wasmfuzz
implements fast reset and runs harnesses that are designed for libfuzzer
's persistent mode APIs.
Compatible modules export LLVMFuzzerTestOneInput
and malloc
in order to be able to allocate space for the input data and run each fuzz case. Our implementation resets the module's state after each execution.
Based on a preliminary experiment comparing wasmfuzz
and cargo-fuzz
(libfuzzer), fuzzing efficacy should be on-par with native fuzzers.
wasmfuzz
currently requires Linux hosts for its snapshot-restore mechanism.
To build form source you need a somewhat recent stable or nightly Rust toolchain.
- From source:
cargo install --force --git https://github.com/CISPA-SysSec/wasmfuzz
- From a git checkout:
cargo install --force --path .
- Pre-built binaries: https://github.com/CISPA-SysSec/wasmfuzz/releases/download/initial-commit/wasmfuzz-x86_64-unknown-linux-gnu.2.31
Pre-built binaries are dynamically linked against an old glibc
version and should be compatible with most Linux environments.
Run a simple fuzzing campaign:
wasmfuzz fuzz --timeout=1h --cores 8 --dir corpus/ the-module.wasm
Create a line coverage report:
wasmfuzz cov-html --corpus corpus/ --output report/ the-module.wasm
Other options are available via wasmfuzz --help
and wasmfuzz fuzz --help
.
- Custom WebAssembly JIT with the Cranelift code generator as its backend.
- Input mutation is handled by LibAFL.
- Fully deterministic execution with fast-ish snapshot restore via CoW memory mappings.
- Pluggable instrumentation passes that can be configured on-the-fly.
- Simple ensemble fuzzing strategy that periodically swaps out instrumentation options.
- Coverage visualization via DWARF line debug information and DWARFv5 Embedded Source Code. We adapt Coverage.py's HTML template for our reports.
- Function-level, basic-block-level, edge-level code coverage:
src/instrumentation/code_coverage.rs
- Arithmetic comparison distance coverage:
src/instrumentation/cmpcov.rs
- Function all parameters and return values:
src/instrumentation/call_params.rs
- Path coverage:
src/instrumentation/path_hash.rs
- Function / loop hitcounts:
src/instrumentation/perffuzz.rs
- Execution trace instruction limits:
src/instrumentation/instruction_limit.rs
- Memory access values, pointer ranges:
src/instrumentation/mem.rs
- Shortest input that reaches each function / basic block:
src/instrumentation/input_size.rs
Note: We have not evaluated the effectiveness of these passes yet.
- Exits if a crash is found
- No core pinning
- Does not support LibAFL-style network scaling (we use simple ad-hoc in-process message passing and structure sharing, no
llmp
) - No support for the WebAssembly 2.0 Component Model.
Harness Suite (./harness-suite/)
Building these requires a Docker/Podman and Python installation. Use make -C harness-suite
to build all projects.
-
Standard
libfuzzer
-compatible harness API (LLVMFuzzerTestOneInput
). -
Mix of C/C++ and Rust targets. C/C++ projects require porting while Rust targets have drop-in support.
-
Currently builds 18 C/C++ and 29 Rust projects. This includes 6 out of the 19 fuzzbench targets at the moment.
-
Optimized builds with full debug info: Harness modules also contain their source code so we can emit coverage reports without any additional files.
We target the Lime1 series of WebAssembly, which is WebAssembly 1.0 in combination with a few standardized post-1.0 features like the bulk-memory-opt
extension.
WASM Variants of Other Fuzzers (./wasm-fuzzers/)
We retrofit support for WebAssembly to native fuzzers via a KISS recompilation strategy:
wasm2c
translates WASM modules to plain C files, which we then run through standard source-level fuzzing setups.
Currently, we include the following fuzzers in our evaluation scripts:
wasmfuzz
- WAFL (a WASM harness fuzzer, adapted for libfuzzer harnesses)
- libfuzzer-wasm2c
- aflpp-wasm2c
- libafl_libfuzzer-wasm2c
Symbols get renamed during the wasm2c process for native fuzzers. Due to this, hardcoded support for input-to-state mutations might be impacted due to symbol names and/or non-standard unrolled translated implementations. We have not confirmed the impact of this specific issue though.
A FuzzM container is included but we're not using it since it doesn't work with our harnesses due to WebAssembly spec incompatibilities.
-
Our JIT currently depends on a minor patch to Cranelift's JIT setup: [Upstream issue]
-
rustc
support for-Zembed-source
for embedded sources in harness binaries: [Upstream PR] -
symbolic
only supports DWARF embedded sources at exactly v12.10.0-v12.11.1: [Upstream PR]