Experimental Tapir support #31086

vchuravy · 2019-02-15T23:38:07Z

Introduction -- What is Tapir

Tapir is a parallel IR extension to LLVM. For the interested I recommend
perusing the Tapir paper. The key takeaway is that parallel (non-concurrent) programs, can be effectively model with cilk-style task parallelism and that given the serial-projection property (serial execution is always a valid execution), it is possible to reason about parallelism in the LLVM compiler.

By doing so Tapir solves one primary problem: Traditionally introducing parallelism into a program, inhibits compiler optimisations. This is due to a variety of reasons, but chiefly that most implementations of parallelism choose to do early-outlining of parallel thunks. Causing the optimizer to only see calls into the runtime/program thunks without context. A classical optimisation that is inhibited by this is loop-invariant-code-movement. In Julia we encounter a different problem (#15276) in which using a closure to outline a thunk can cause performance issues.

Tapir concepts

Syncregion

An opaque token that is used to associate the various parallel IR statements with each other, so that during sync only synchronizes tasks that it is responsible for. Important for nested parallelism and inlining of functions containing parallel constructs.

Detach

Think of this as a "function call" to the parallel region.
detach within %syncregion, %label, %reattach. The label points to the basic-block that starts-off the parallel region and the reattach label points past a reattach statement and represents the execution on the task that is spawning the parallel region.

Reattach

This is the "return" of a parallel region. It reattaches the parallel region to the original code and the label should point to the same basic-block that the reattach label in detach is pointing to.

Sync

Synchronises all tasks with the same syncregion

Goal of this PR

This is very much ongoing research on how to best integrate the ideas from Tapir and the technology behind it into Julia. I want to lay a foundation on which we can build and experiment in the future. While the full-benefits will only be realised if one uses a Tapir enabled LLVM build, one
of my goals is to bring the concepts of tapir into the Julia IR and thereby enable us to do optimizations on parallel code in the Julia IR even on a LLVM that doesn't have the Tapir extension. Right now we are in the very early stages of supporting Tapir in Julia.

It is important to note that the semantics of this representation are parallel and not concurrent,
by this extent this will not and cannot replace Julia Tasks. In order to exemplify this issue see the following Julia task code:

@sync begin
    ch1 = Channel(0)
    ch2 = Channel(0)
    @async begin
        take!(ch1)
        put!(ch2, 1)
    end
    @async begin
        put!(ch1, 1)
        take!(ch2)
    end
end

Doing a serial projection of this code leads to a deadlock.

User interface

In test/tapir.jl I have placed some functions that I have been experimenting with. I do not expect users to directly use @syncregion, @spawn and @sync_end, but rather I think the prototype implementation of a parallel for loop and @sync, @spawn.

@par for i in 1:10
    ...
end

function fib(N)
    if N <= 1
        return N
    end
    x = Ref{Int64}()
    @sync begin # different sync than Tasks
        @spawn begin
            x[] = fib(N-2)
        end
        y = fib(N-1)
    end
    return x[] + y
end

Changes/Current Status

Buildsystem support for Tapir/LLVM
New expr nodes:
- syncregion: Obtain a token to synchronize spawned tasks
- spawn: Spawn a block in a task
- sync: Synchronize all tasks using the same token
New IR nodes:
- detach: Detach a parallel region
- reattach: Join a parallel region
Codegen support for syncregion, detach, reattach, sync

Examples

TODO:

Notes

Make.user

LLVM_VER=svn
USE_TAPIR=1
BUILD_LLVM_CLANG=1
LLVM_GIT_VER="WIP-taskinfo"
LLVM_GIT_VER_CLANG="WIP-csi-tapir-exceptions"
LLVM_GIT_VER_COMPILER_RT="WIP-cilksan-bugfixes"
override CC=gcc-7
override CXX=g++-7

Acknowledgments

Many thanks to T.B. Schardl (@neboat) for the many discussions around Tapir and LLVM.

vchuravy · 2019-02-16T04:51:30Z

Some fun numbers with the fib example. (Note that the overhead of setting up the tasks is the main cost here, serial runtime without tasks is 0.41s, the same version with Julia tasks OOMs my machine)

function fib(N)
    if N <= 1
        return N
    end
    token = @syncregion()
    x1 = Ref{Int64}()
    @spawn token begin
        x1[]  = fib(N-1)
    end
    x2 = fib(N-2)
    @sync_end token
    return x1[] + x2
end

1 Workers

julia> @time fib(40)
  4.883457 seconds (5.16 k allocations: 384.174 KiB)

2 Workers (Note my machine has 2 Cores, SMT-2)

julia> @time fib(40)
  2.448542 seconds (5.16 k allocations: 384.174 KiB)
102334155

4 Workers (Note my machine has 2 Cores, SMT-2)

julia> @time fib(40)
  1.952545 seconds (5.16 k allocations: 384.174 KiB)
102334155

datnamer · 2019-02-19T20:32:34Z

How is this positioned with regards to partr?

StefanKarpinski · 2019-02-19T20:36:40Z

Technically, it's independent of partr. It does impact considerations for the design of the threading API, however, so there's some interaction there. Still mostly independent though.

c42f · 2019-02-26T01:11:21Z

This looks really interesting. How does it relate to the structured concurrency ideas expressed in Trio and libdill et al.? (Described, for example in https://trio.discourse.group/t/structured-concurrency-resources/21 and https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful)

This reverts commit 121b6bf.

… should maintain

vchuravy · 2021-02-21T14:58:58Z

Superseded by #39773

vchuravy force-pushed the vc/tapir2 branch 2 times, most recently from 4156e00 to 1755d4c Compare February 16, 2019 04:11

vchuravy mentioned this pull request Feb 16, 2019

Refactor Expr(:simdloop) to Expr(:loopinfo, ...) #31095

Merged

vchuravy force-pushed the vc/tapir2 branch from 530c0b7 to 6a19de7 Compare February 17, 2019 04:21

vchuravy force-pushed the vc/tapir2 branch 2 times, most recently from bc956d1 to 472b26d Compare March 28, 2019 14:54

vchuravy force-pushed the vc/tapir2 branch from 472b26d to 2505046 Compare June 10, 2019 16:39

vchuravy added 19 commits July 6, 2019 13:37

build systems changes for experimental tapir and cilkrts support

db64838

add tapir indvars patch

6dfd4b3

add parallel IR nodes

777ee2e

add Tapir LLVM passes

5f01220

add codegen for parallel IR

2ae4452

add parallel IR examples in test/tapir.jl

bb5b029

switch to cilkrts provided by CilkHub

437b519

handle DetachNode pointing to dead BB

0329583

use loopspawning

844b548

add vecadd_err example

62d0063

add some utility routines

a393be9

Revert "handle DetachNode pointing to dead BB"

15c4457

This reverts commit 121b6bf.

add reattach edge in abstractinterpretation

b3e627d

exception within parallel region causes dce to kill cfg edges that we…

e6d523d

… should maintain

track parallel regions

3b59068

fix Base.Meta

0557f88

begin work on Julia Task Tapir ABI

a0dad0e

treat sync-node more as a terminator

077243d

remove non-sensical code from abstract interpretation

7317f61

vchuravy added 2 commits July 6, 2019 13:37

fixup! treat sync-node more as a terminator

5543fb2

remove usage of mark_parallel region

9e630e7

vchuravy force-pushed the vc/tapir2 branch from 2505046 to 9e630e7 Compare July 6, 2019 17:37

c42f mentioned this pull request Aug 29, 2019

asyncmap: Include original backtrace in rethrown exception #32749

Closed

tkf mentioned this pull request Sep 17, 2019

Taking Structured Concurrency Seriously #33248

Open

tkf mentioned this pull request Oct 15, 2020

Fix a possible typo in renumber_ir_elements! #38033

Closed

tkf mentioned this pull request Feb 21, 2021

RFC: Experimental API for may-happen in parallel parallelism #39773

Open

3 tasks

vchuravy closed this Feb 21, 2021

DilumAluthge deleted the vc/tapir2 branch February 28, 2021 05:53

DilumAluthge added experimental multithreading Base.Threads and related functionality parallelism Parallel or distributed computation labels Mar 4, 2021

MasonProtter mentioned this pull request Mar 29, 2024

New package: Tapir v0.1.0 JuliaRegistries/General#103806

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental Tapir support #31086

Experimental Tapir support #31086

vchuravy commented Feb 15, 2019 •

edited

Loading

vchuravy commented Feb 16, 2019

datnamer commented Feb 19, 2019

StefanKarpinski commented Feb 19, 2019

c42f commented Feb 26, 2019

vchuravy commented Feb 21, 2021

Experimental Tapir support #31086

Experimental Tapir support #31086

Conversation

vchuravy commented Feb 15, 2019 • edited Loading

Introduction -- What is Tapir

Tapir concepts

Syncregion

Detach

Reattach

Sync

Goal of this PR

User interface

Changes/Current Status

Examples

TODO:

Notes

Make.user

Acknowledgments

vchuravy commented Feb 16, 2019

1 Workers

2 Workers (Note my machine has 2 Cores, SMT-2)

4 Workers (Note my machine has 2 Cores, SMT-2)

datnamer commented Feb 19, 2019

StefanKarpinski commented Feb 19, 2019

c42f commented Feb 26, 2019

vchuravy commented Feb 21, 2021

vchuravy commented Feb 15, 2019 •

edited

Loading