Cranelift: alias analysis: track each individual table/heap separately #4166

cfallin · 2022-05-19T23:17:57Z

In #4163 we are introducing an alias analysis and redundant-load elimination / store-to-load-forwarding transform.

This initial implementation categorizes all memory accesses as one of four kinds: to a "heap", to a "table", to the "vmctx", or to everything else. These four categories are allowed to be optimized separately from each other; so e.g. a store to a table does not prevent a load from a heap from being merged with an earlier load, if otherwise to the same address.

This is correct, and simple, and allows us to keep just four bits in MemFlags and four u32s for the "last store" vector, per instruction. However, it is somewhat more imprecise than we would like, especially in the future when we expect multiple modules, memories, tables, etc. to become more common.

Thus, we should investigate ways of efficiently representing an arbitrary number of heaps or tables as separate categories of abstract state. This may require an extended MemFlags, or indirection of some kind, or some limit (first 16, 32, ... memories are privileged).

The text was updated successfully, but these errors were encountered:

fitzgen · 2022-05-20T17:58:43Z

One possibility is that we have "heap0", "heap1", "heap2", and finally "heap_other" (or even just heap0 and heap_other).

The CG has talked about using hints for which memories need to be fast and use virtual memory tricks in browsers which can't use those tricks for every memory. Maybe we could use those same hints to map onto heap0/1/2 vs other.

fitzgen · 2022-05-20T17:59:07Z

or some limit (first 16, 32, ... memories are privileged).

Ah I think this is the same thing I was getting at with heap0/1/2 vs heap_other.

bjorn3 · 2022-05-20T19:24:42Z

One possibility is that we have "heap0", "heap1", "heap2", and finally "heap_other" (or even just heap0 and heap_other).

That won't help for stack slots though. Those are really important for cg_clif. Maybe we could have a side table recording for each instruction which alias set it is part of?

cfallin · 2022-05-20T19:28:30Z

@bjorn3 yes, that could work, as long as it is optional (for memory-overhead reasons). The advantage of MemFlags now is that it's a u8 (or maybe extended to 16 or 32 bits) that can ride along in the InstructionData.

cfallin added enhancement cranelift Issues related to the Cranelift code generator cranelift:goal:optimize-speed Focus area: the speed of the code produced by Cranelift. labels May 19, 2022

cfallin mentioned this issue May 19, 2022

Add a basic alias analysis with redundant-load elim and store-to-load fowarding opts. #4163

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cranelift: alias analysis: track each individual table/heap separately #4166

Cranelift: alias analysis: track each individual table/heap separately #4166

cfallin commented May 19, 2022

fitzgen commented May 20, 2022

fitzgen commented May 20, 2022

bjorn3 commented May 20, 2022

cfallin commented May 20, 2022

Cranelift: alias analysis: track each individual table/heap separately #4166

Cranelift: alias analysis: track each individual table/heap separately #4166

Comments

cfallin commented May 19, 2022

fitzgen commented May 20, 2022

fitzgen commented May 20, 2022

bjorn3 commented May 20, 2022

cfallin commented May 20, 2022