|
| 1 | +# OCamlrun WebAssembly |
| 2 | + |
| 3 | +This is a build script for building the OCaml bytecode interpreter for WebAssembly using emscripten. |
| 4 | + |
| 5 | +## Why a VM in a VM? |
| 6 | + |
| 7 | +### Why not compile OCaml code straight to WebAssembly? |
| 8 | + |
| 9 | +Currently it is not that easy because AFAIK there is no high fidelity maintained LLVM output target nor any WebAssembly compatible output target. It would also be difficult to do one because WebAssembly currently doesn't have the necessary hooks to scan its stack for GC pointers. Maybe OCaml already maintains a manual stack or can be modified to do so, I don't know. |
| 10 | + |
| 11 | +The interpreter is written in portable C and maintains its own stack so it makes this easy. |
| 12 | + |
| 13 | +However, a big downside of WebAssembly code right now is that VMs implement it as an AOT compilation toolchain. That means that you spend a lot of time upfront compiling code. |
| 14 | + |
| 15 | +If you have a lot of code paths that are not hot, you're wasting a lot more cycles compiling it than it would take to run it in the interpreter. When you run it in the interpreter you can start executing code instantly. |
| 16 | + |
| 17 | +It might be the case that browsers change to an interpreter mode in the future which might change this equation. |
| 18 | + |
| 19 | +### Why not compile OCaml code to JavaScript like js_of_ocaml or BuckleScript? |
| 20 | + |
| 21 | +This is probably still the best technique if you want full GC interop and it gives you JIT capabilities for hot paths. |
| 22 | + |
| 23 | +However, if you have a lot of code that means a lot of JavaScript to parse and compile on the client during start up. The benefit of running a bytecode is that it can immediately start executing. |
| 24 | + |
| 25 | +Additionally, by using the GC built specifically for OCaml, you take advantage of more predictable and consistent GC behavior across browsers. |
| 26 | + |
| 27 | +Multicore OCaml is coming and so is Shared Typed Array Buffers and Atomics for the web. By using a custom memory model we'll be able to take advantage of parallelism in the browser. |
| 28 | + |
| 29 | +## Installation |
| 30 | + |
| 31 | +I have only tried this on OS X so far and haven't polished any build scripts yet. |
| 32 | + |
| 33 | +### 1. Browser |
| 34 | + |
| 35 | +To test this you'll need a browser with WebAssembly enabled such as [Chrome Canary](https://www.google.com/chrome/browser/canary.html). |
| 36 | + |
| 37 | +### 2. Emsdk |
| 38 | + |
| 39 | +First you need to install the Emscripten SDK. According to the [WebAssembly Developer's Guide](http://webassembly.org/getting-started/developers-guide/) you need to currently build the toolchain from source. It says to include `binaryen-master-64bit` but that didn't work for me and currently I don't need it. `sdk-incoming-64bit` should be enough. (Note: This will need a lot of disk space to rebuild clang from source.) |
| 40 | + |
| 41 | +``` |
| 42 | +git clone https://github.com/juj/emsdk.git |
| 43 | +cd emsdk |
| 44 | +./emsdk install sdk-incoming-64bit |
| 45 | +./emsdk activate sdk-incoming-64bit |
| 46 | +``` |
| 47 | + |
| 48 | +After these steps, the installation is complete. To enter an Emscripten compiler environment in the current command line prompt, type |
| 49 | + |
| 50 | +``` |
| 51 | +source ./emsdk_env.sh |
| 52 | +``` |
| 53 | + |
| 54 | +Return to the project folder. |
| 55 | + |
| 56 | +### 3. Checkout the OCaml source code |
| 57 | + |
| 58 | +The ocaml source is checked out as a Git submodule of this project. |
| 59 | + |
| 60 | +``` |
| 61 | +git submodule update --init --recursive |
| 62 | +``` |
| 63 | + |
| 64 | +### 4. Build |
| 65 | + |
| 66 | +In the root of this repo run the build script. |
| 67 | + |
| 68 | +``` |
| 69 | +./build.sh |
| 70 | +``` |
| 71 | + |
| 72 | +It will build the example.ml file into OCaml bytecode. This will then be embedded into emscripten's virtual file system. |
| 73 | + |
| 74 | +It will also build the OCaml bytecode interpreter and GC into a Web Assembly file. |
| 75 | + |
| 76 | +`example.html` contains the bootstrapping script. |
| 77 | + |
| 78 | +You can try it out in your browser using `emrun` (or just server it over HTTP yourself). |
| 79 | + |
| 80 | +``` |
| 81 | +emrun --browser=chrome_canary example.html |
| 82 | +``` |
| 83 | + |
| 84 | +It should print to the console with the default example. |
| 85 | + |
| 86 | +### Next Steps |
| 87 | + |
| 88 | +This is just the easiest set up to build to get started. However, emscripten and ocamlrun has a lot of features such a virtual file system and dynamic linking that is often not applicable to the web context. We'll want to strip that down into the smallest possible library that can fit neatly into an existing web toolchain. |
| 89 | + |
| 90 | +This script also compiles with the `-O2` flag. We should see if we could cut down of file size with one of the other flags. |
0 commit comments