Skip to content

Some research on efficient interpreter instruction dispatch in Rust.

Notifications You must be signed in to change notification settings

Robbepop/interpreter-dispatch-research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Interpreter Dispatch Research

A collections of different instruction dispatching techniques that can be used in Rust code to drive interpreters.

You can "benchmark" the techniques that have been implemented so far using the following command:

time cargo test --release {name}::counter_loop

Where {name} is one of

  • switch
  • switch_tail
  • closure_loop
  • closure_tree
  • closure_tail

Architectures

All benchmark results are performed on my personal machine. If not stated otherwise benchmarks have been performed using Cargo profile:

[profile.release]
lto = "fat"
codegen-units = 1

switch Technique

This is the usual switch based instruction dispatch that is probably most commonly used amonst simple interpreters. It defines an enum with all the bytecodes and their parameters. For execution it simply loops over the instructions and matches on the kind of the next instruction that is about to be executed.

Benchmark result: 461.57ms

switch_tail Technique

This defines an enum for all bytecodes just like the switch technique. For the execution every instruction calls the next instruction instead of using a central loop. This way every single instruction contains the switch over the instruction kind which may result in bloated compiled source code.

Note: This technique is pretty much unusable in current Rust since Rust is missing guaranteed tail call elimination.

Benchmark result: 472.79ms

closure_loop Technique

Similar to the switch technique it uses a central loop to dispatch the next instruction, however, it replaces the central match with a indirect call to boxed closures that contain everything that is required for the instruction execution.

Benchmark result: 836.79ms

closure_tree Technique

This one has a very interesting instruction structure. Similar to the closure_loop technique it uses closures to model instructions. However, for every instruction that may be followed by other instructions we allow the internal closure to directly call the next instruction indirectly.

This way we can model the control flow according to basic blocks whereas all basic blocks of the source input have exactly one root closure that calls the rest via tail calls.

Note: Doing tail calls in this technique is fine with current Rust since we only do tail calls within basic blocks and not throughout loops since we escape upon branches.

Benchmark result: 944.12ms

closure_tail Technique

This is very similar to the switch_tail technique. The only notable difference is that instructions are now organized as closures as in closure_loop and closure_tree techniques.

Note: This technique is pretty much unusable in current Rust since Rust is missing guaranteed tail call elimination.

Benchmark result: 867.14ms (default profile.release)

About

Some research on efficient interpreter instruction dispatch in Rust.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages