upgrade threads from experimental to stable-with-caveats #35686

JeffBezanson · 2020-05-01T17:56:18Z

Any more warnings we should list?

ViralBShah · 2020-05-01T18:44:58Z

Darn the failures look like they are from the old doc PRs. I'm looking at fixing them.

tkf · 2020-05-01T22:57:55Z

doc/src/manual/multi-threading.md

+  * After a task starts running on a certain thread (e.g. via `@spawn`), it
+    will always be restarted on the same thread after blocking. In the future
+    this limitation will be removed, and tasks will migrate between threads.


Can you avoid thread migration by @spawn @async f()? If so, maybe mention that this is the forward-compatible way to rely on this property? (Not sure if we should be encouraging this pattern, though.)

Hm, curious. Probably not—instead seems like @async should inherit the sticky bit from the parent in the future.

A function using @async internally may be called via @spawn. If @async inherits the sticky bit, this internal use of @async becomes actually equivalent to @spawn, right?

I think it's possible to use @async in a way concurrent-safe but thread-unsafe. If so, isn't it dangerous to inherit the sticky bit?

doc/src/manual/multi-threading.md

tkf · 2020-05-01T23:10:03Z

I had hoped that we get structured concurrency #33248 before @spawn becomes stable...

JeffBezanson · 2020-05-02T01:35:59Z

The purpose here is to define what "stable" means in this context. It obviously can't mean that we will never change or add anything ever again. @spawn is a primitive, and we want to say that you can use it, and we will not delete it in 1.x. Is it possible to document what it means in a way that is useful but allows the changes you want? For example, would we entirely disallow @spawn with no @sync? If so, I really think the value of that is dubious. To me it seems like we have "goto" now, and might add "while" loops in the future. What's the problem?

JeffBezanson · 2020-05-02T02:11:13Z

Also, I believe structured concurrency applies as soon as you have Tasks. The only thing @spawn adds is that Tasks can run on multiple threads, and structured concurrency is not about threads (i.e. it makes sense even on 1 thread). We've had Tasks since v0.1, so if you need structured concurrency before Tasks we are definitely a bit late.

tkf · 2020-05-02T08:36:51Z

Sorry, my comment was a bit too rant-y and not good for starting a constructive discussion.

I hate to say this, but, realistically speaking, I don't think introducing structured concurrency with/before stabilization of @spawn is doable or appropriate. I think there is simply too much work to do to introduce structured concurrency even in a non-strict form. Also, we can't have a strict form of structured concurrency without breaking @async/schedule anyway.

I'll reply to the technical parts of your comments in #33248.

vtjnash · 2020-05-02T22:28:06Z

doc/src/base/multi-threading.md

+See also [Synchronization](@ref lib-task-sync).
+
+## Atomic operations
+


I think we should mark these as subject to probable change.

Ok. Do you think we'd need to delete them entirely, or just add a better interface?

Yeah, just don’t want to commit exactly to these yet

vtjnash · 2020-05-02T22:32:08Z

doc/src/manual/multi-threading.md

+## Caveats
+
+At this time, most operations in the Julia runtime and standard libraries
+are thread safe.


Suggested change

are thread safe.

are thread safe, except for `eval`.

If we’re going to specifically call out one thing here, I think it should be the data structures. eval is covered by the top-level bullet below (but could probably be specifically called out).

Right, the idea is for the subsequent list to list all the exceptions to this. I called out include because that will generally involve many kinds of top-level expressions and so probably isn't safe, but I don't think it's useful to say that eval is unsafe. Is eval(:(1+2)) unsafe?

I think the idea generally is that if it cant be done without eval, it’s unsafe right now. While that could have been a closure or something similar.

Ok, that's a clearer rule. Maybe we could be even more specific and list some things like method definitions, global assignments, ... ?

vtjnash · 2020-05-02T22:36:55Z

doc/src/manual/multi-threading.md

+    running in other threads that are allocating memory. In these cases it may
+    be necessary to insert a manual call to `GC.safepoint()` to allow GC to run.
+    This limitation will be removed in the future.
+  * Avoid using finalizers in conjunction with threads, particularly if they


This item feels like it sticks out in the list as being unclear and unreasonable to state. I think we might need to be more explicit here about that finalizers get run simultaneously with code (kind of always have). I can work on some wording and sample code if that helps.

Yes, that would be good, thanks.

Suggested change

* Avoid using finalizers in conjunction with threads, particularly if they

* Be aware that some library's finalizers may sometimes break badly if

threads are enabled (though likely are already broken). This may require

some transitional work across the ecosystem before threading can be widely

adopted with confidence. Further details on this are below.

## Safe use of Finalizers Because finalizers can interrupt any code, they must be very careful in how they interact with any global state. Unfortunately, the main reason that finalizers are used is to update global state (a pure function is generally rather pointless as a finalizer). This leads us to a bit of a tricky conundrum. But one which has a few approaches to dealing with: 1. When singled-threaded, code could call access the internal `jl_gc_enable_finalizers` function to prevent finalizers from being scheduled inside a critical region. Internally, this is used inside some functions (such as our C locks) to prevent recursion when doing certain operations (incremental package loading, codegen, etc.). Observe that this combination (of an inefficient spinlock and setting this flag) can be used to make finalizers safe to work. 2. A second strategy, employed by Base in a couple places, is to explicitly try to acquire the lock non-recursively. For example, `WeakKeyDict` and `Distributed.finalize_ref` take this approach (though both only handle the single-threading case correctly). This example below is updated to work in the multi-threaded case: function finalize_ref(r::AbstractRemoteRef) if r.where > 0 # Check if the finalizer is already run if islocked(client_refs) || !trylock(client_refs) # delay finalizer for later if we aren't free to acquire the lock finalizer(finalize_ref, r) return nothing end try # `lock` should always be followed by `try` in correct code if r.where > 0 # Must check again here # Do actual cleanup here r.where = 0 end finally unlock(client_refs) end end nothing end 3. A related third strategy is to use a yield-free queue. We don't currently have a lock-free queue implemented in base, but `Base.InvasiveLinkedListSynchronized{T}` is suitable. This can frequently be a good strategy to use for code with event loops. For example, this strategy is employed by `Gtk.jl` to manage lifetime ref-counting. In this approach, we don't do any explicit work inside the `finalizer`, and merely add it to a queue to be dealt with at a safer time. Actually, `schedule` already uses this, so defining the finalizer as `x -> @spawn do_cleanup(x)` is one example of this approach. Note however that this doesn't control which thread `do_cleanup` is running on here, so `do_cleanup` would still need to acquire a lock. That doesn't need to be true if you implement your own queue, as you can explicitly only drain that queue from your thread. Additionally, instead of the `@spawn` convenience macro, you could manually schedule it on a specific thread: t = @task do_cleanup(x) ccall(:jl_set_task_tid, Cvoid, (Any, Cint), t, 0) # make sticky to thread with tid=1 schedule(t)

That's excellent, thanks. I'm not sure we want to mention jl_set_task_tid, since then it might be considered "documented" and we will have to keep it.

I’m okay with that, though we might want to make it look less like a hack

NEWS.md

doc/src/manual/multi-threading.md

more threading doc edits

JeffBezanson added docs This change adds or pertains to documentation multithreading Base.Threads and related functionality labels May 1, 2020

ViralBShah force-pushed the jb/threadstatus branch from a8b5126 to 8d53d88 Compare May 1, 2020 18:54

tkf reviewed May 1, 2020

View reviewed changes

doc/src/manual/multi-threading.md Outdated Show resolved Hide resolved

tkf mentioned this pull request May 2, 2020

Taking Structured Concurrency Seriously #33248

Open

vtjnash reviewed May 2, 2020

View reviewed changes

JeffBezanson force-pushed the jb/threadstatus branch from 8d53d88 to 478b629 Compare May 3, 2020 17:59

tkf reviewed May 3, 2020

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

tkf reviewed May 3, 2020

View reviewed changes

doc/src/manual/multi-threading.md Outdated Show resolved Hide resolved

JeffBezanson force-pushed the jb/threadstatus branch from 478b629 to 101dd5d Compare May 4, 2020 17:08

JeffBezanson and others added 2 commits May 4, 2020 23:16

upgrade threads from experimental to stable-with-caveats

d541c2d

more threading doc edits

add documentation on finalizer interactions with threads

c879d1a

JeffBezanson force-pushed the jb/threadstatus branch from 101dd5d to c879d1a Compare May 5, 2020 03:18

JeffBezanson merged commit d07fadf into master May 5, 2020

JeffBezanson deleted the jb/threadstatus branch May 5, 2020 05:59

tkf mentioned this pull request May 5, 2020

allow tasks to migrate among threads #35688

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upgrade threads from experimental to stable-with-caveats #35686

upgrade threads from experimental to stable-with-caveats #35686

JeffBezanson commented May 1, 2020

ViralBShah commented May 1, 2020

tkf May 1, 2020 •

edited by KristofferC

Loading

vtjnash May 2, 2020

tkf May 3, 2020

tkf commented May 1, 2020

JeffBezanson commented May 2, 2020

JeffBezanson commented May 2, 2020

tkf commented May 2, 2020 •

edited

Loading

vtjnash May 2, 2020 •

edited

Loading

JeffBezanson May 3, 2020

vtjnash May 3, 2020

vtjnash May 2, 2020

mbauman May 3, 2020

JeffBezanson May 3, 2020

vtjnash May 3, 2020

JeffBezanson May 3, 2020

vtjnash May 2, 2020

JeffBezanson May 3, 2020

vtjnash May 3, 2020

JeffBezanson May 3, 2020

vtjnash May 3, 2020

		See also [Synchronization](@ref lib-task-sync).

		## Atomic operations

-  * Avoid using finalizers in conjunction with threads, particularly if they
+  * Be aware that some library's finalizers may sometimes break badly if
+    threads are enabled (though likely are already broken). This may require
+    some transitional work across the ecosystem before threading can be widely
+    adopted with confidence. Further details on this are below.

upgrade threads from experimental to stable-with-caveats #35686

upgrade threads from experimental to stable-with-caveats #35686

Conversation

JeffBezanson commented May 1, 2020

ViralBShah commented May 1, 2020

tkf May 1, 2020 • edited by KristofferC Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tkf commented May 1, 2020

JeffBezanson commented May 2, 2020

JeffBezanson commented May 2, 2020

tkf commented May 2, 2020 • edited Loading

vtjnash May 2, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tkf May 1, 2020 •

edited by KristofferC

Loading

tkf commented May 2, 2020 •

edited

Loading

vtjnash May 2, 2020 •

edited

Loading