We define an instruction to be an opcode combined with the operands for the opcode. Running the instrumented runtime for an opcode is encapsulated in the following trait:
pub trait InstructionExecutor<F> {
/// Runtime execution of the instruction, if the instruction is owned by the
/// current instance. May internally store records of this call for later trace generation.
fn execute(
&mut self,
instruction: &Instruction<F>,
from_state: ExecutionState<u32>,
) -> Result<ExecutionState<u32>>;
}
There is a struct VmOpcode(usize)
to protect the global opcode usize
, which must be globally unique for each opcode
supported in a given VM.
Opcodes are partitioned into groups, each of which is handled by a single chip. A chip should be a struct of
type C
and associated Air of type A
which satisfy the following trait bounds:
C: Chip<SC> + InstructionExecutor<F>
A: Air<AB> + BaseAir<F> + BaseAirWithPublicValues<F>
Together, these provide the following functionalities:
- Keygen: Performed via the
Air::<AB>::eval()
function. - Trace Generation: This is done by calling
InstructionExecutor::<F>::execute()
which computes and stores execution records and thenChip::<SC>::generate_air_proof_input()
which generates the trace using the corresponding records.
At the AIR-level, for an AIR to integrate with the OpenVM architecture (constrain memory, read the instruction from the program, etc.), the AIR
communicates over different (virtual) buses. There are three main system buses: the memory bus, program bus, and the
execution bus. The memory bus is used to access memory, the program bus is used to read instructions from the program,
and the execution bus is used to constrain the execution flow. These buses are derivable from the SystemPort
struct,
which is provided by the VmInventoryBuilder
.
The buses have very low-level APIs and are not intended to be used directly. "Bridges" are provided to provide a cleaner interface for
sending interactions over the buses and enforcing additional constraints for soundness. The two system bridges are
MemoryBridge
and ExecutionBridge
, which should respectively be used to constrain memory accesses and execution flow.
You can specify phantom sub-instruction executors by implementing the trait:
pub trait PhantomSubExecutor<F> {
fn phantom_execute(
&mut self,
memory: &MemoryController<F>,
streams: &mut Streams<F>,
discriminant: PhantomDiscriminant,
a: F,
b: F,
c_upper: u16,
) -> eyre::Result<()>;
}
pub struct PhantomDiscriminant(pub u16);
The PhantomChip<F>
internally maintains a mapping from PhantomDiscriminant
to Box<dyn PhantomSubExecutor<F>>>
to
handle different phantom sub-instructions.
Each specific instantiation of a modular VM is defined by the following struct:
pub struct VirtualMachine<SC: StarkGenericConfig, E, VC> {
pub engine: E,
pub executor: VmExecutor<Val<SC>, VC>,
}
The engine type E
should be openvm_stark_backend::engine::StarkEngine<SC>
and the VM config type VC
is
openvm_circuit::arch::config::VmConfig<Val<SC>>
, shown below.
pub trait VmConfig<F: PrimeField32>: Clone + Serialize + DeserializeOwned {
type Executor: InstructionExecutor<F> + AnyEnum + ChipUsageGetter;
type Periphery: AnyEnum + ChipUsageGetter;
/// Must contain system config
fn system(&self) -> &SystemConfig;
fn system_mut(&mut self) -> &mut SystemConfig;
fn create_chip_complex(
&self,
) -> Result<VmChipComplex<F, Self::Executor, Self::Periphery>, VmInventoryError>;
}
A VmConfig
has two associated types: Executor
and Periphery
. The Executor
is typically an enum over chips that
are instruction executors, while Periphery
is an enum for the chips that are not.
See VM Extensions for more details.
Key generation is computed from the VmConfig
describing the VM. The VmConfig
is used to create the VmChipComplex
,
which in turn provides the list of AIRs that are used in the proving and verification process.
Trace generation proceeds from:
VirtualMachine::execute_and_generate_with_cached_program()
with subsets of functionality offered by VirtualMachine::execute()
and VirtualMachine::execute_and_generate()
. The
following struct tracks each continuation segment:
pub struct ExecutionSegment<F: PrimeField32, VC: VmConfig<F>> {
pub chip_complex: VmChipComplex<F, VC::Executor, VC::Periphery>,
pub final_memory: Option<Equipartition<F, CHUNK>>,
pub air_names: Vec<String>,
pub since_last_segment_check: usize,
}
This will:
- Split the execution into
ExecutionSegment
s usingExecutionSegment.execute_from_pc()
, which callsExecutionSegment.should_segment()
to segment online. Note that this creates aVmChipComplex
for each segment fromVmConfig.create_chip_set()
, where each segment contains each chip. It also passes all streams to all segments and runs the generation in serial. - Generate traces for each segment by calling
VmChipSet.generate_proof_input()
, which iterates through all chips in order and callsgenerate_proof_input()
.
Prove generation is performed by calling StarkEngine.prove()
on ProofInput<SC>
created from each segment in
generate_proof_input()
. There is no SDK-level API for this in VirtualMachine
at present.
The integration API provides a way to create chips where the following conditions hold:
- a single instruction execution corresponds to a single row of the trace matrix
- rows of all 0's satisfy the constraints
Most chips in the VM satisfy this, with notable exceptions being Keccak and Poseidon2.
VmAdapterInterface<T>
VmAdapterChip<F>
VmAdapterAir<AB>
VmCoreChip<F, I: VmAdapterInterface<F>>
VmCoreAir<AB, I: VmAdapterInterface<AB::Expr>>
Warning
The word core will be banned from usage outside of this context.
Main idea: each VM chip is created from an AdapterChip
and a CoreChip
. Analogously, the VM AIR is created from an
AdapterAir
and CoreAir
so that the columns of the VM AIR are formed by concatenating the columns from the
AdapterAir
followed by the CoreAir
.
The AdapterChip
is responsible for all interactions with the VM system: it owns interactions with the memory bus,
program bus, execution bus. It will read data from memory and expose the data (but not intermediate pointers, address
spaces, etc.) to the CoreChip and then write data provided by the CoreChip back to memory.
The AdapterAir
does not see the CoreAir
, but the CoreAir
is able to see the AdapterAir
, meaning that the same
AdapterAir
can be used with several CoreAir
's. The AdapterInterface provides a way for CoreAir
to provide expressions to be
included in AdapterAir
constraints -- in particular AdapterAir
interactions can still involve CoreAir
expressions.
Traits with their associated types and functions:
/// The interface between core AIR and adapter AIR.
pub trait VmAdapterInterface<T> {
type Reads;
type Writes;
type ProcessedInstruction;
}
pub trait VmAdapterChip<F: Field> {
/// Records generated by adapter before main instruction execution
type ReadRecord: Send;
/// Records generated by adapter after main instruction execution
type WriteRecord: Send;
/// `AdapterAir` should not have public values
type Air: BaseAir<F> + Clone;
type Interface: VmAdapterInterface<F>;
fn preprocess(
&mut self,
memory: &mut MemoryController<F>,
instruction: &Instruction<F>,
) -> Result<(<Self::Interface as VmAdapterInterface<F>>::Reads, Self::ReadRecord)>;
fn postprocess(
&mut self,
memory: &mut MemoryController<F>,
instruction: &Instruction<F>,
from_state: ExecutionState<u32>,
ctx: AdapterRuntimeContext<F, Self::Interface<F>>,
read_record: &Self::ReadRecord,
) -> Result<(ExecutionState<u32>, Self::WriteRecord)>;
/// Populates `row_slice` with values corresponding to `record`.
/// The provided `row_slice` will have length equal to `self.air().width()`.
/// This function will be called for each row in the trace which is being used, and all other
/// rows in the trace will be filled with zeroes.
fn generate_trace_row(
&self,
row_slice: &mut [F],
read_record: Self::ReadRecord,
write_record: Self::WriteRecord,
aux_cols_factory: &MemoryAuxColsFactory<F>,
);
fn air(&self) -> &Self::Air;
}
pub trait VmAdapterAir<AB: AirBuilder>: BaseAir<AB::F> {
type Interface: VmAdapterInterface<AB::Expr>;
fn eval(
&self,
builder: &mut AB,
local: &[AB::Var],
interface: AdapterAirContext<AB::Expr, Self::Interface>,
);
}
pub trait VmCoreChip<F, I: VmAdapterInterface<F>> {
type Record: Send;
type Air: BaseAirWithPublicValues<F> + Clone;
fn execute_instruction(
&self,
instruction: &Instruction<F>,
from_pc: F,
reads: Reads<F, A::Interface<F>>,
) -> Result<(AdapterRuntimeContext<F, A::Interface<F>>, Self::Record)>;
/// Populates `row_slice` with values corresponding to `record`.
/// The provided `row_slice` will have length equal to `self.air().width()`.
/// This function will be called for each row in the trace which is being used, and all other
/// rows in the trace will be filled with zeroes.
fn generate_trace_row(&self, row_slice: &mut [F], record: Self::Record);
}
pub trait VmCoreAir<AB, I>: BaseAirWithPublicValues<AB::F>
where
AB: AirBuilder,
I: VmAdapterInterface<AB::Expr>,
{
fn eval(
&self,
builder: &mut AB,
local_core: &[AB::Var],
from_pc: AB::Var,
) -> AdapterAirContext<AB::Expr, I>;
}
// For passing from CoreChip to AdapterChip
pub struct AdapterRuntimeContext<T, I: VmAdapterInterface<T>> {
/// Leave as `None` to allow the adapter to decide the `to_pc` automatically.
pub to_pc: Option<T>,
pub writes: I::Writes,
}
// For passing from `CoreAir` to `AdapterAir` with T = AB::Expr
pub struct AdapterAirContext<T, I: VmAdapterInterface<T>> {
/// Leave as `None` to allow the adapter to decide the `to_pc` automatically.
pub to_pc: Option<T>,
pub reads: I::Reads,
pub writes: I::Writes,
pub instruction: I::ProcessedInstruction,
}
Warning
You do not need to implement Air
on the struct you implement VmAdapterAir
or VmCoreAir
on.
To create a chip used to support a set of opcodes in the VM, we start with types
A: VmAdapterChip
C: VmCoreChip
A::Air: VmAdapterAir
C::Air: VmCoreAir
where A::Air
and C:Air
are implemented on all relevant AirBuilder
required by the backend. We can then create VmChipWrapper
and VmAirWrapper
types below:
pub struct VmChipWrapper<F, A: VmAdapterChip<F>, C: VmCoreChip<F, A>> {
pub adapter: A,
pub core: C,
pub records: Vec<(A::ReadRecord, A::WriteRecord, C::Record)>,
// For accessing memory
offline_memory: Arc<Mutex<OfflineMemory<F>>>,
}
pub struct VmAirWrapper<A, C> {
pub adapter: A,
pub core: C,
}
They implement the following traits:
InstructionExecutor<F>
is implemented onVmChipWrapper<F, A, C>
, where theexecute()
function:- calls
preprocess()
onA
withmemory
and the rawinstruction
- calls
execute_instruction()
onC
with the rawinstruction
,from_pc
, andreads
frompreprocess()
- calls
postprocess()
onA
with the rawinstruction
,from_state
, theoutput: AdapterRuntimeContext
fromexecute_instruction()
, and theread_record
- stores the resulting
(read_record, write_record, core_record)
- calls
Air<AB>
,BaseAir<F>
, andBaseAirWithPublicValues<F>
are implemented onVmAirWrapper<A::Air, C::Air>
, where theeval()
function implements constraints via:- calls
eval()
onC::Air
- calls
eval()
onA::Air
- calls
Chip<SC>
is implemented onVmChipWrapper<F, A, C>
with associated AirVmAirWrapper<A::Air, C::Air>
, wheregenerate_air_proof_input()
iterates through all records from instruction execution and generates one row of the trace from each record. Importantly, rows which do not correspond to an instruction execution are not affected and left to be identically zero. Each used row in the trace is created via:- calls
generate_trace_row()
onA
with theadapter_row
,read_record
,write_record
- calls
generate_trace_row()
onC
with thecore_row
,core_record
- calls
Convention: If you have a new Foo
functionality you want to support, make structs FooCoreChip, FooCoreAir
. Either use existing BarAdapterChip, BarAdapterAir
or make your own. Then typedef
pub type FooChip<F> = VmChipWrapper<F, BarAdapterChip<F>, FooCoreChip<F>>;
pub type FooAir = VmAirWrapper<BarAdapterAir, FooCoreAir>;
If there is a risk of ambiguity, use name BarFooChip
instead of just FooChip
.
pub struct BasicAdapterInterface<
T,
PI,
const NUM_READS: usize,
const NUM_WRITES: usize,
const READ_SIZE: usize,
const WRITE_SIZE: usize,
>(PhantomData<T, PI>);
impl<..> VmAdapterInterface for BasicAdapterInterface<..> {
type Reads = [[T; READ_SIZE]; NUM_READS];
type Writes = [[T; WRITE_SIZE]; NUM_WRITES];
type ProcessedInstruction = PI;
}
pub struct MinimalInstruction<T> {
pub is_valid: T,
/// Absolute opcode number
pub opcode: T,
}
pub struct ImmInstruction<T> {
pub is_valid: T,
/// Absolute opcode number
pub opcode: T,
pub imm: T
}