Unboxed types #606

damiendoligez · 2016-06-07T12:08:37Z

This PR introduces an annotation and an optimization for concrete types that have only one constructor with one argument, and immutable records with only one field. In both cases, the default representation is a memory block with header and one field. This memory block represents no useful information and we can get rid of it.
Example:

type t = A of string
let x = A "toto"
match x with A s -> s

In this case, the pattern-matching does not even read the header of the block that represents A s because it contains no useful information. With this patch, if we add a [@@unboxed] annotation on the type definition, the compiler will suppress the indirection block and represent the value directly as the string:

type t = A of string [@@unboxed]
let x = A "toto"
assert (Obj.repr x == Obj.repr (match x with A s -> s))

This is useful (for example):

when defining type aliases that we don't want to mix up
when introducing polymorphic values by defining a record type with a single polymorphic field
when using a single-constructor, single-field GADT to introduce an existential type

Some questions

About the name: I called these "unboxed types". Is there any objection and/or better idea?
About the annotation: unboxed is already used for a different (but related) purpose. Is it a good idea to reuse it?
About the annotation: currently you have to activate this optimization with an annotation. Should it be the default instead? See below for compatibility considerations.
About annotations: there is a trap in the current version: if you add a [@unboxed] annotation on the constructor or on the record field, it is ignored. Should it trigger a warning or error, or even just trigger the optimization anyway? (In fact, the same problem already exists with [@immediate]).

Future work

Some more work is needed to make it work nicely with the float array and float record optimizations: currently, if you write:

type t = A of {f : float} [@@unboxed]
type r = {f1 : t; f2 : t; f3: float}
type s = {f4 : float; f5 : float; f6: float}

then a value of type t is represented as a float, but a value of type r is not optimized into a float array, unlike type s.

A related problem is the optimization of array access for t array and the interference with the [@@immediate] annotation (type t = A of int [@@unboxed] [@@immediate] currently fails).

[update: all the above are now implemented]

Compatibility

If activated by default, this optimization will break the FFI because it changes the representation of values. It will also break the compatibility with old marshalled values.

A more subtle incompatibility is with let rec:

type t = A of t [@@unboxed]
let rec x = A x

This must be rejected because it is (compiled as) the same as let rec x = x. The current compiler rejects it without any need for a patch but #556 will be made slightly more complex by this PR.

jhjourdan · 2016-06-07T12:15:10Z

About the let rec issue, another solution would be to make x equal to Val_unit. It actually has the intended semantics.

lpw25 · 2016-06-07T12:17:07Z

About annotations: there is a trap in the current version: if you add a [@unboxed] annotation on the constructor or on the record field, it is ignored. Should it trigger a warning or error, or even just trigger the optimization anyway? (In fact, the same problem already exists with [@Immediate]).

A warning or error might be appropriate, but it should not just trigger the optimisation because there is a related optimisation: unboxing the fields and constructor arguments themselves. For example,

type t = {
  foo: int;
  bar: int;
}

type s = T of t [@unboxed]

could eventually mean that the t should be unboxed within the T constructor, rather than unboxing the T constructor itself.

lpw25 · 2016-06-07T12:21:43Z

From a quick scan, it is not clear to me whether the issue of the float array hack is handled properly. For example, what happens in the following case:

type ext = E : 'a -> t [@@unboxed]

let _ = [| E 1.0; E 1 |]

Drup · 2016-06-07T12:30:41Z

I really whish this could be made the default. Putting aside backward compat, there are no downside at all of this optimization (the let rec issue is inconsequential for most users ..).

If I understand correctly, this will break FFI only if the C side construct/destruct values from a datatype with one constructor, is that right ? Does that even happen in practice ?

damiendoligez · 2016-06-07T13:02:28Z

About the let rec issue, another solution would be to make x equal to Val_unit. It actually has the intended semantics.

It's a bit more complex than just let rec x = x : you need to handle chains of aliases of the form:

let rec x = y
and y = z
and z = x

Anyway, that's more of a discussion for #556 as it's pretty orthogonal to the present optimization.

damiendoligez · 2016-06-07T13:07:17Z

From a quick scan, it is not clear to me whether the issue of the float array hack is handled properly.

Indeed. Your example is not enough to trigger the problem but if I do:

type t = E : 'a -> t [@@unboxed];;
let a = Array.make 10 (E 1.0);;

then I get a (flat-allocated) float array where I can store ints. As far as I can tell, that should segfault, but for some reason it doesn't.

[edit: it does segfault after fixing another bug that was hiding this one]

damiendoligez · 2016-06-07T13:09:57Z

the let rec issue is inconsequential for most users

Indeed, I only noticed the let rec issue because it broke a test in the test suite. I tried on OPAM and none of the OPAM packages I could compile had such a let rec.

lpw25 · 2016-06-07T14:05:54Z

Indeed.

I think what is needed is a check for whether the argument can be either a float or something else. It is hard to define this property precisely. I think a rule allowing only the following three cases would be sufficient, but may be overly conservative:

The argument type has no existential type variables
The argument type is incompatible with float.
The argument type is equal to float.

yminsky · 2016-06-07T16:23:35Z

Is there any hope/plan of making this the default in the future?

damiendoligez · 2016-06-08T13:54:31Z

Is there any hope/plan of making this the default in the future?

I'd like to get some feedback on this. The incompatibilities don't seem to be really problematic, so right now I'm on the fence.

yminsky · 2016-06-08T14:25:08Z

My view is that, except for the c binding issue, it's a pretty clear win. One could imagine a reasonable transition story: start with a flag to turn it on by default, and over time migrate to making it opt out instead of opt in, and then finally remove the old behavior.

If that was available, I believe we'd use that and maybe never bother with the annotation.

alainfrisch · 2016-06-08T14:26:42Z

I'm personally in favor of making this the default but I've a general tendency to be rather liberal in terms of breaking backward compatibility when useful.

Do we have a way to asses the impact of making this the default? I guess there aren't so many sum/record types with a single constructor/field, so perhaps a pass on public OPAM packages, excluding pure packages without C bindings, could give a good indication.

I'd like to point out also that the impact could be in theory larger than C bindings. For instance we have code at LexiFi that processes our runtime type representations and does some (Obj.)magic with the concrete representation of values. The same could happen with code-generators. Of course people using this are on their own (even though they make the same assumptions as in C bindings). Another potential issue is with alternative backends such as js_of_ocaml or Bucklescript (some Javascript code assuming a specific representation of OCaml values).

About the feature itself: this could come later, but I'd also love to be able to unbox specific constructors. Typically it should be possible to unbox at most one (in a given sum type) constructor taking a "string" argument. One could also support "unboxing" a constructor whose argument is itself a sum type (allocating tags properly to avoid clashes).

damiendoligez · 2016-06-08T14:27:03Z

I was thinking of simply making it the default and providing an annotation to turn it off on a specific type.

lpw25 · 2016-06-08T14:31:24Z

I'd be a little hesitant about making it the default whilst the float array hack still exists, because then some existing type definitions will have to become errors until annotated with [@@boxed].

damiendoligez · 2016-06-08T14:32:25Z

Typically it should be possible to unbox at most one (in a given sum type) constructor taking a "string" argument. One could also support "unboxing" a constructor whose argument is itself a sum type (allocating tags properly to avoid clashes).

This would need major changes to the compilation of pattern-matching. Indeed it will come later, if ever.

DemiMarie · 2016-06-09T19:19:53Z

I would like to be able to unbox float, int32, int64, and nativeint within records and algebraic datatypes. But that requires changes to the runtime (specifically the GC).

bluddy · 2016-06-09T20:06:33Z

Not just the GC, but also generic comparison and generic serialization.

nojb · 2016-06-09T20:13:29Z

Also for reference:
http://caml.inria.fr/pub/ml-archives/caml-list/2001/01/2be66fbcb6844de11cac665cd28fbf0d.en.html

DemiMarie · 2016-06-10T16:13:43Z

My solution is to have all pointers contiguous at the start of the object.
That massively reduces the overhead — the GC only looks at the first part
of the object.
On Jun 9, 2016 4:13 PM, "Nicolas Ojeda Bar" notifications@github.com
wrote:

Also for reference:

http://caml.inria.fr/pub/ml-archives/caml-list/2001/01/2be66fbcb6844de11cac665cd28fbf0d.en.html

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#606 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/AGGWB4Gbs79D_6DVjVnfTlICydLLThdLks5qKHPxgaJpZM4Iv2df
.

alainfrisch · 2016-06-10T16:30:18Z

One needs somehow to mark which prefix of the block must be scanned (i.e. contain normal values). This could be done in several ways:

Using a special marker value within the block (hence "wasting" one word per such block, but this is still better than boxing multiple fields). The advantage is that computing the size of the block is unchanged.
Storing separately in the head the number of scanned words in addition to the total size. At least on 32-bit, we don't have enough bits in the header in general, but we could split the current size field in two only when the tag is between 1 and no_scan_tag. The make computing the actual block size a bit more complex (but still looking only at the header, so it should remain cheap). The rationale is that objects with such tag are never big (arrays have tag 0), since they correspond to sum type constructors. If this representation is used for records (with unboxed fields), one would use a non-0 tag for them.

Generic operations would need to be adapted as well. Without keeping more layout information about these mixed blocks, it will be impossible to preserve the exact same behavior (e.g. inlined floats cannot simply be compared bit-wise), but this is probably fine as long as it is documented (at least if unboxing fields is explicit).

DemiMarie · 2016-06-11T05:13:19Z

My thought is to have the first word in such a "mixed block" have the 10 in the two lower bits, and the number of additional unscanned words in the higher bits. Since OCaml values will always have both of the low-order bits 0 (if they are pointers) or the low order bit 1 (otherwise) this is currently invalid, so it can be used for a special case.

My preference is for unboxing of fields to be the default at some points. Yes, this will break C stubs, but I also think that OCaml should move away from C stubs towards an FFI integrated with the compiler, in which OCaml – not C – is responsible for marshaling of data, either inline within generated code (for ocamlopt) or by compiler-generated C code (for ocamlc).

As for structural comparison, hashing, etc, I think that a bitwise comparison would be enough. I actually think that structural comparison/hashing of mutable objects is a misfeature, and that mutable objects would be better compared/hashed by object identity. But this is the wrong discussion for that. In any case, the only alternative that I can think of is to either (1) do a type lookup for each and every type (using even more metadata) or (2) do mandatory specialization of comparison and hashing (but what about polymorphic recursion?

damiendoligez · 2016-06-15T14:26:43Z

My thought is to have the first word in such a "mixed block" have the 10 in the two lower bits, and the number of additional unscanned words in the higher bits. Since OCaml values will always have both of the low-order bits 0 (if they are pointers) or the low order bit 1 (otherwise) this is currently invalid, so it can be used for a special case.

Unfortunately, that breaks compaction because of infix closure pointers.

DemiMarie · 2016-06-16T03:43:25Z

My proposal would still treat closures specially (or not, with GPR #203). The first word (which points to the compiled code for the closure) would always be treated as a non-pointer (since compiled code never moves).

damiendoligez · 2016-06-20T15:07:37Z

The consensus at the latest developer meeting (2016-06-16) was that this should not be the default at first, until users of the FFI (especially the likes of ctypes and camlidl) have adapted. Also, it would be nice to design a tool that can help with the transition.

yminsky · 2016-06-20T21:39:39Z

Would it make sense to add a flag that flips the default? That way, we could try it out inside of our walls, and perhaps learn more about the FFI issues. It would be nice, at least within our walled garden, to get the performance benefits without needing to litter our code with annotations.

I suppose an alternate approach would be for us to write a PPX that automatically adds the annotation to every single-entry variant.

damiendoligez · 2016-06-29T14:46:15Z

Are you thinking of a configuration flag or a compiler flag?

One problem with making it the default: when checking GATDs for unboxability, the conditions are rather complex, so I'd rather make [@@unboxed] the default for all single-entry variants (and records) and then you get an error if it's not unboxable (because of GADTs + float array optimization). In that (rare) case, you will have to add a [@@ocaml.boxed] annotation to the type declaration. This is rather ugly, but I don't think it would be a good idea to make the representation depend on the details of other type declarations.

yminsky · 2016-06-30T00:06:13Z

I was thinking of a compiler flag, since it would allow you to change the behavior in a library by library way. For us, we'd likely leave the default behavior as is for externally developed libraries.

@lpw25 @diml : what do you guys think about the wisdom of having a compiler flag to flip the default, versus a PPX to determine the behavior? If we do the PPX, it will be easy for us to work out in practice what seems most convenient, and then maybe we can use that to inform what the API should look like in the compiler longer term.

damiendoligez · 2016-07-15T14:02:59Z

@alainfrisch I think it's cleaner like this. Could you review again?

Also, we need to agree on the name.

alainfrisch · 2016-07-15T14:24:54Z

bytecomp/typeopt.ml

+        match Env.find_type p env with
+        | {type_unboxed = {unboxed = true; _}; _} ->
+          Misc.Stdlib.Option.value_default (fun x -> x.desc) ~default:sty
+            (Typedecl.get_unboxed_type_representation env ty)


Shouldn't this logic be moved to Typeopt.scrape in order to benefit to other functions that depend on it (e.g. is_base_type?

(Having is_base_type covered is useful for Translcore.specialize_comparison, for instance.)

damiendoligez · 2016-07-19T14:29:01Z

@alainfrisch I've implemented both of your suggestions. Can we merge now? Do you still want to discuss the name?

alainfrisch · 2016-07-19T14:33:47Z

Can we merge now?

I'm not fully confident with the restrictions related to unboxed float arrays, but I don't think that more code review will address that on my side. So, for me: "yes".

Do you still want to discuss the name?

Nope!

alainfrisch · 2016-07-26T12:57:25Z

utils/warnings.ml

@@ -150,6 +151,7 @@ let number = function
  | No_cmx_file _ -> 58
  | Assignment_to_non_mutable_value -> 59
  | Unused_module _ -> 60
+  | Unboxable_type_in_prim_decl _ -> 61
 ;;

 let last_warning_number = 60


@damiendoligez This should be bumped.

Good catch, I bumped it in 7fee1ea .

mmottl · 2016-09-21T15:54:52Z

I have just run into an issue with the OCaml 4.04 beta2 using this feature. Consider the following code:

type ('a, 'kind) tree =
  | Root : { mutable value : 'a; mutable rank : int } -> ('a, [ `root ]) tree
  | Inner : { mutable parent : 'a node } -> ('a, [ `inner ]) tree

and 'a node = Node : ('a, _) tree -> 'a node  [@@ocaml.unboxed]

type 'a t = ('a, [ `inner ]) tree

The above will fail with:

Error: This type cannot be unboxed because
       it might contain both float and non-float values.
       You should annotate it with [@@ocaml.boxed].

Here is a kludgy workaround:

type ('a, 'kind, 'parent) tree =
  | Root : { mutable value : 'a; mutable rank : int } -> ('a, [ `root ], 'parent) tree
  | Inner : { mutable parent : 'parent } -> ('a, [ `inner ], 'parent) tree

type 'a node = Node : ('a, _, 'a node) tree -> 'a node  [@@ocaml.unboxed]

type 'a t = ('a, [ `inner ], 'a node) tree

I suspect that the type tree is still unavailable when the compiler handles type node in the first version, because it is defined within the same recursive definition. After breaking the recursion with the help of a type variable, the compiler can apparently see that tree is populated with non-floats. It's probably just a matter of preparing lookup tables for type definitions before handling the attribute.

gasche · 2016-09-21T21:46:44Z

Thanks for the catch, I submitted a bug report to make sure we track this properly: PR#7364.

Unboxed types

Co-authored-by: tmattio <tmattio@users.noreply.github.com>

mshinwell changed the title ~~[WIP] Unboxed types~~ Unboxed types Jun 10, 2016

mshinwell added the work-in-progress label Jun 10, 2016

gasche added the caml-weekly-news label Jun 15, 2016

damiendoligez force-pushed the unboxed-types branch 2 times, most recently from b69bc5e to 1c52075 Compare July 15, 2016 14:02

alainfrisch reviewed Jul 15, 2016
View reviewed changes

damiendoligez force-pushed the unboxed-types branch from 1c52075 to 2a5c1c6 Compare July 19, 2016 10:59

damiendoligez force-pushed the unboxed-types branch 2 times, most recently from 3c6eddb to 9cc74e7 Compare July 21, 2016 09:48

GPR#606: add unboxed types

d5a6e50

damiendoligez force-pushed the unboxed-types branch from 9cc74e7 to d5a6e50 Compare July 21, 2016 11:52

alainfrisch mentioned this pull request Jul 25, 2016

Small improvements to type-based optimizations (array, lazy) #712

Merged

damiendoligez merged commit faed766 into ocaml:trunk Jul 25, 2016

alainfrisch reviewed Jul 26, 2016
View reviewed changes

bobzhang mentioned this pull request Sep 2, 2016

optimize immutable block with size one rescript-lang/rescript#381

Closed

damiendoligez deleted the unboxed-types branch September 5, 2016 22:27

yallop mentioned this pull request Jan 2, 2017

A new check that 'let rec' bindings are well formed #556

Merged

damiendoligez mentioned this pull request Jun 9, 2017

PR#7511: Unboxed type with unboxed argument should not be accepted #1133

Merged

camlspotter pushed a commit to camlspotter/ocaml that referenced this pull request Oct 17, 2017

Merge pull request ocaml#606 from damiendoligez/unboxed-types

77195ca

Unboxed types

damiendoligez mentioned this pull request Nov 8, 2017

Make use of the [@@immediate] information when checking [@@unboxed] #1469

Merged

bobzhang mentioned this pull request Oct 26, 2018

to-do list for upgrade rescript-lang/rescript#3000

Closed

37 tasks

This was referenced Mar 14, 2019

Optimization of types #3978

Closed

Inflexibility of unboxed types in recursive declarations #7364

Closed

bobzhang mentioned this pull request Dec 17, 2019

Ideal unboxed support rescript-lang/rescript#4044

Closed

yallop mentioned this pull request Mar 16, 2020

Proposal: constructor unboxing ocaml/RFCs#14

Open

stedolan pushed a commit to stedolan/ocaml that referenced this pull request May 24, 2022

Export Flambda2.classic_mode_types (ocaml#606)

f8645fc

EmileTrotignon pushed a commit to EmileTrotignon/ocaml that referenced this pull request Jan 12, 2024

[create-pull-request] automated change (ocaml#606)

1afda7b

Co-authored-by: tmattio <tmattio@users.noreply.github.com>

Unboxed types #606

Unboxed types #606

Conversation

damiendoligez commented Jun 7, 2016 • edited Loading

Some questions

Future work

Compatibility

jhjourdan commented Jun 7, 2016

lpw25 commented Jun 7, 2016 • edited Loading

lpw25 commented Jun 7, 2016

Drup commented Jun 7, 2016

damiendoligez commented Jun 7, 2016

damiendoligez commented Jun 7, 2016 • edited Loading

damiendoligez commented Jun 7, 2016 • edited Loading

lpw25 commented Jun 7, 2016

yminsky commented Jun 7, 2016

damiendoligez commented Jun 8, 2016

yminsky commented Jun 8, 2016

alainfrisch commented Jun 8, 2016

damiendoligez commented Jun 8, 2016

lpw25 commented Jun 8, 2016

damiendoligez commented Jun 8, 2016

DemiMarie commented Jun 9, 2016

bluddy commented Jun 9, 2016

nojb commented Jun 9, 2016

DemiMarie commented Jun 10, 2016

alainfrisch commented Jun 10, 2016

DemiMarie commented Jun 11, 2016

damiendoligez commented Jun 15, 2016

DemiMarie commented Jun 16, 2016

damiendoligez commented Jun 20, 2016

yminsky commented Jun 20, 2016

damiendoligez commented Jun 29, 2016

yminsky commented Jun 30, 2016

damiendoligez commented Jul 15, 2016

alainfrisch Jul 15, 2016

Choose a reason for hiding this comment

alainfrisch Jul 15, 2016

Choose a reason for hiding this comment

damiendoligez commented Jul 19, 2016

alainfrisch commented Jul 19, 2016

alainfrisch Jul 26, 2016

Choose a reason for hiding this comment

gasche Jul 26, 2016

Choose a reason for hiding this comment

mmottl commented Sep 21, 2016

gasche commented Sep 21, 2016

damiendoligez commented Jun 7, 2016 •

edited

Loading

lpw25 commented Jun 7, 2016 •

edited

Loading

damiendoligez commented Jun 7, 2016 •

edited

Loading

damiendoligez commented Jun 7, 2016 •

edited

Loading