We (@nchataing as intern, @gasche as advisor) implemented a variant of @yallop's constructor-unboxing specification as an experimental branch that we would now like to discuss and consider for upstreaming.
Our intent was to implement the simplest possible form of unboxing in presence of several constructors, and leave more advanced aspects -- anything that could be left off -- to further work.
We support a per-constructor [@unboxed]
attribute, that can be used in a variant type as long as the set of values corresponding to each constructor (boxed or unboxed) remain disjoint.
For example:
type bignum =
| Short of int [@unboxed] (* represented directly by an integer *)
| Long of Gmp.t (* Block of tag 0 (first non-unboxed constructor) *)
We define the head of an OCaml value as follows:
- the head of an immediate value
v
is the pair(Imm, v)
- the head of a heap block with tag
t
is the pair(Block, t)
.
(In other words, the head tracks whether a value is immediate or a block, and for blocks only keeps the tag.)
The "head shape" of a type is a (slight over-approximation of) the set of heads of all possible values of this type.
Now consider a variant type declaration containing one or several constructors annotated with [@unboxed]
:
type ('a, 'b, ...) t =
| Const0 (* some constant constructors *)
| Const1
| ...
| Const{m}
| NonConst0 of t00 * t01 * ...
| Nonconst1 of t10 * t11 * ...
| ...
| NonConst{n} of t{n}0 * t{n}1 * ...
| Unboxed1 of u0 [@unboxed]
| Unboxed2 of u1 [@unboxed]
| ...
| Unboxed{l} of u{l} [@unboxed]
(For simplicity we wrote above all constant constructors first, then all non-constant constructors then all unboxed constructors. But in fact we support arbitrary interleaving of these categories, and the representation is exactly the same as long as the ordering within constant constructors and within non-constant constructors is preserved.)
The compiled representation of this type is as follows:
- as before, constant constructors
Const{k}
are represented by the immediate numberk
- as before, non-constant constructors
Nonconst{k} of ...
are represented by a heap block with tagk
- unboxed constructors
Unboxed{k} of u{k}
are represented directly by the value of typeu{k}
, without any boxing
This definition is rejected statically if the unboxed constructors overlap with the other values of the type, in the following precise sense:
-
We compute the "boxed head shape"
BS
of this type without the unboxed constructors; by definition of the head shape, this is the set{(Imm, 0), (Imm, 1), ..., (Imm, m)} ∪ {(Block, 0), (Block, 1), ,.., (Block, n)}
. -
Then we compute the "unboxed shapes"
US{k}
of each unboxed constructor, that is the head shape ofu{k}
. -
The type is accepted if and only if the shapes
BS, US0, US1, ..., US{l}
are disjoint from each other. The head shape of the whole shape is then the disjoint unionBS ⊎ US0 ⊎ US1 ⊎ ... ⊎ US{l}
.
Unknown/abstract types are assumed to have a "top" shape with containing potentially all heads. (This should be refined when the abstract type is used to represent an FFI type with a precise shape implemented in C; supporting head shape assertions on abstract types is future work.)
(* rejected *)
type t =
| Int of int [@unboxed] (* shape: (Imm, _) *)
| Unit (* shape: (Imm, 0), conflicts with Int above *)
(* accepted *)
type t =
| Int of int [@unboxed] (* shape: (Imm, _) *)
| Box of t (* shape: (Block, 0), as the first non-constant non-unboxed constructor *)
(* shape(t): (Imm, _) ∪ {(Block, 0)} *)
(* accepted *)
type prod = t * t
and t =
| Int of int [@unboxed] (* shape: (Imm, _): any immediate *)
| String of string [@unboxed] (* shape: (Block, Obj.string_tag) (Obj.string_tag is 252) *)
| Prod of prod [@unboxed] (* shape: (Block, 0) *)
(* shape(t): (Imm, _) ∪ {(Block, 0), (Block, Obj.string_tag)} *)
(** With abstract types *)
type abstract
(* accepted *)
type t =
| Int of int [@unboxed] (* shape: (Imm, _) *)
| Abs of abstract (* shape: (Block, 0) *)
(* shape(t): (Imm, _) ∪ {(Block, 0)} *)
(* rejected *)
type t =
| Int of int (* shape: (Block, 0) *)
| Abs of abstract [@unboxed] (* any shape, conflicts with Int *)
(** Nested unboxing *)
type t1 =
| Int of int [@unboxed]
| Block of unit
(* shape(t1): (Imm, _) ∪ {(Block, 0)} *)
(* rejected *)
type t2 =
| T1 of t1 [@unboxed] (* shape: (Imm, _) ∪ {(Block, 0)} *)
| S of string (* shape: (Block, 0), conflicts with T1 *)
(* accepted *)
type t3 =
| T1 of t1 [@unboxed] (* shape: {(Imm, _), (Block, 0)} *)
| S of string [@unboxed] (* shape: (Block, Obj.stringₜag) *)
(* shape(t3): (Imm, _) ∪ {(Block, 0)} ∪ {(Block, Obj.string_tag)} *)
Comparison with Yallop's proposal RFC#14
Jeremy Yallop's proposal uses a global annotation [@@unboxed]
on all constructors at once, we use a per-constructor annotation [@unboxed]
. (The RFC mentions this as a possible extension in "Extension: partial unboxing".) It would be easy to interpret [@@unboxed]
as just "[@unboxed] on all constructors", but we have not implemented this yet.
A major difference is that the RFC#14 specification suggests renumbering constructors in some cases, where the representation of C of foo [@unboxed]
is taken to be different from the representation of foo
, in order to avoid conflicts with other constructors at this type. We do not support any such renumbering:
- the representation of
Unboxed of foo [@unboxed]
is always the representation offoo
- the representation of
Boxed of foo
always uses the block tag consecutive/next/succedent to the previous boxed-constructor tag in the declaration (filtering out unboxed constructors).
(Note: @stedolan calls this aspect of RFC#14 "conflating inlining and disjointness". We only deal with disjointness.)
When the compiler is in flat-float-array
mode, soundness relies on the property that all OCaml types are "separated": they contain either (1) only float
values, or (2) no float
value. New forms of unboxing must preserve this property.
We can track separatedness as part of the head-shape computation for unboxed type declaration, by adding to head-shape data a "separated" bit (see the details in HEAD_SHAPE.impl.md). We reject type declarations whose head-shape is not separated (when in flat-float-array
mode).
It may be that this tracking is precise enough to entirely replace the pre-existing "separability analysis" of the type-checker. We have not implemented it yet, and have not evaluated this possibility.
We define static accept/reject decisions for partially-unboxed types using "head shapes", which are defined in terms of the value-representation strategy of the main OCaml implementation. Should we have a more abstract definition, that leaves more room to other representations in alternative implementations?
We have not studied this question yet and we believe it is a pressing question. In particular, any choice that would end up being merged in the language probably MUST support the js_of_ocaml value representation. (Do you know of a reference document that describes the js_of_ocaml value representation? Pointers are welcome are we are not jsoo experts ourselves. cc @hhugo.)
Our intuition is that we could fine a "weakening" of our current specification that distinguishes less different sort of shapes -- thus rejects more definitions -- and gives good flexibility for alternative implementations. Here are some things we could do:
- We could stop making assumptions about the shape of function closures (currently: {Closure_tag, Infix_tag}), preventing the unboxing of closure-holding constructors.
- We could also weaken our assumptions about built-in containers (string/byte, arrays, double-arrays, etc.)
- We could stop distinguishing "float" from immediates (ouch!) if jsoo does this. What about Int32, Int64, should they be known as custom values?
In other words: what amount of runtime type information should we require from OCaml implementations?
At the limit, one extreme choice would be to only reason on the tag of variant constructors (constant or not), which are distinguishable from each other in any OCaml implementation, and not make any other assumption about head shapes (map all types except variants to the "top" shape). This would reject most unboxing definitions, leave maximal freedom for language implementations. Unfortunately this would also prevent the actually-interesting uses of the feature we know about, which mostly resolve around unboxing an int
-carrying constructor.
This is an aspect of our design on which we need external feedback from people with a good taste for these matters. (cc @xavierleroy, @damiendoligez, @yallop, @stedolan, @lpw25, @let-def, etc.).