This is a fast and memory-efficient interpreter of the Unlambda programming language.
Compared to unl.c by Emil Jeřábek, which itself is 50-100 times faster than the official c-refcnt interpreter, this interpreter is about 2 times faster and uses about 1/2 of the memory.
Benchmark | unl.c | This interpreter |
---|---|---|
adventure1 | 0.72s | 0.41s |
lisp2 | 2.05s | 1.28s |
elvm-8cc3 | 44.3s | 20.1s |
To achieve this performance, this interpreter introduces several new
combinators (B
, C
, T
, and V
) used only internally to substitute
expressions under evaluation by pattern matching. The following substitution
rules are implemented:
`S`Kf -> `Bf where ```Bfgx = `f`gx
``Sf`Kg -> ``Cfg where ```Cfgx = ``fxg
``SI`Kx -> `Tx where ``Txy = `yx
``S`Tx`Ky -> ``Vxy where ```Vxyz = ``zxy
(Note that V
is the "pair" combinator (also known as "cons") and unrelated to
the Unlambda's v
("black hole" function) builtin.)
For example, when the first argument is given to S
, if it is a partial
application of K
(with one argument f
given), it is replaced by `Bf
.
These auxiliary combinators use less memory and evaluate faster than the original SKI-only combinator expressions.
The object graph of Unlambda execution does not cycle, so memory management can be done using reference counting. In fact, the c-refcnt interpreter and unl.c both use reference counting.
However, since Unlambda frequently creates and destroys objects, reference counting can be quite an overhead. Also, optimizing things like omitting reference counter operations where possible, or overwriting and reusing objects when the counter is 1, as unl.c does, can make the code more complicated.
Therefore, this interpreter adopted a generational garbage collector. For new generation it uses two regions for 256k objects and performs copying GC. Objects that have survived this minor GC twice are moved to the old generation region. When the old generation area is full, a mark-sweep GC is performed on the entire heap as a major GC.
Generational GC is very effective in Unlambda, often collecting more than 99% of objects in minor GC. In benchmark measurements, GC accounted for about 1% of the overall execution time.
In general, Generational GC requires a write barrier to keep track of references from the old generation area to the new generation area. But in this interpreter, once an object is created, it is never rewritten, so references from the old generation to the new generation do not occur. Since no write barrier is needed, the evaluator can be written without worrying too much about GC (although copy GC changes object addresses).
$ make
$ unlambda [options] [program-file]
If program-file is not specified, Unlambda program is read from the standard input.
Options:
-h
: Print help and exit.-v
: Print version and exit.-v0
(default): Do not print any debug information.-v1
: Print some statistics after execution.-v2
: Print logs for major GCs.-v3
: Print logs for minor GCs.
This software is released under the MIT License.
Footnotes
-
Compute
(fib 16)
in Unlambda Lisp. ↩ -
Compile a simple C program with
8cc.c.eir.unl
generated by ELVM (make unl
). ↩