Fix concurrenty issue with global MC cache #203

mhauru · 2025-11-03T18:14:39Z

Closes #202

I tried to come up with a version of the MWE in that issue that could be put into tests, but failed. My hope was to write a slow method for particular CacheKeys, not all. But Base.cache falls back on this:

hash(@nospecialize(x), h::UInt) = hash_uint(3h - objectid(x))  # On 1.12.1
hash(@nospecialize(data), h::UInt) = hash(objectid(data), h)  # On main

which doesn't give me much surface to hold onto. Hence no tests added.

That implementation of hash makes me extra confused as to how we can reliably hit this in Turing.jl tests. That hash should take no time at all.

penelopeysm

Nice! And yeah, this is pretty much impossible to test (much like how I couldn't get an MWE without monkey-patching hash). Quite happy to sign off on it and forget about it.

penelopeysm · 2025-11-03T19:38:48Z

That implementation of hash makes me extra confused as to how we can reliably hit this in Turing.jl tests. That hash should take no time at all.

On the original Turing issue I claimed that

I assume that there's some performance regression on hash in 1.12 that causes this error to manifest, and we were lucky enough that 1.11 was fast enough to not error, but 1.12 wasn't.

so I decided to put it to the test:

using Libtask, Chairmarks
f(x) = x
sig = Tuple{typeof(f),Int}
key = Libtask.CacheKey(Base.get_world_counter(), sig)
@be hash(key)

1.11:

julia> @be hash(key)
Benchmark: 3162 samples with 1013 evaluations
 min    27.229 ns (1 allocs: 16 bytes)
 median 27.724 ns (1 allocs: 16 bytes)
 mean   29.536 ns (1 allocs: 16 bytes, 0.03% gc time)
 max    4.827 μs (1 allocs: 16 bytes, 98.61% gc time)

1.12:

julia> @be hash(key)
Benchmark: 3038 samples with 778 evaluations
 min    35.882 ns (1 allocs: 16 bytes)
 median 36.204 ns (1 allocs: 16 bytes)
 mean   39.878 ns (1 allocs: 16 bytes, 0.06% gc time)
 max    5.440 μs (1 allocs: 16 bytes, 98.54% gc time)

Idk, I guess maybe that's enough to cause problems. Also the Base dictionary code hashes multiple keys in a loop so the difference in time will be scaled multiplicatively too (and I suppose it could be performance regressions in other parts of the Dict code, too, although my naive guess is that hashing is the most time-consuming aspect of Dict code(?)).

mhauru · 2025-11-04T09:53:12Z

Nice, good check. I thought about checking the performance effects on Libtask too, but then decided that there's little point because this clearly just needs to be done. If we start optimising Libtask there's other fruit hanging way lower. I can tell you that the totality of the test suite runs noticably, but not horribly slower (7.something seconds vs 8.something, I forget now).

github-actions · 2025-11-04T10:10:29Z

Libtask.jl documentation for PR #203 is available at:
https://TuringLang.github.io/Libtask.jl/previews/PR203/

mhauru · 2025-11-04T10:14:47Z

I changed the docstring of GlobalMCCache to a comment, because Documenter was complaining that something in copyable_task had a docstring that wasn't included in the docs. This feels like a perverse incentive created by Documenter, but I couldn't reproduce the error locally and don't have time to dig into it, so whateva imma merge this thing.

The other two failures are Turing.jl/Mooncake issues and not new.

mhauru added 2 commits November 3, 2025 17:52

Fix concurrency bug

abe23d2

Add a docstring

b52bdc0

mhauru requested a review from penelopeysm November 3, 2025 18:14

penelopeysm approved these changes Nov 3, 2025

View reviewed changes

Convert GlobalMCCache docstring to just a comment

50d9d85

mhauru merged commit c5dd2bc into main Nov 4, 2025
12 of 14 checks passed

mhauru deleted the mhauru/fix-concurrency-cache branch November 4, 2025 10:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix concurrenty issue with global MC cache #203

Fix concurrenty issue with global MC cache #203

mhauru commented Nov 3, 2025

Uh oh!

penelopeysm left a comment

Uh oh!

penelopeysm commented Nov 3, 2025 •

edited

Loading

Uh oh!

mhauru commented Nov 4, 2025

Uh oh!

github-actions bot commented Nov 4, 2025

Uh oh!

mhauru commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix concurrenty issue with global MC cache #203

Fix concurrenty issue with global MC cache #203

Conversation

mhauru commented Nov 3, 2025

Uh oh!

penelopeysm left a comment

Choose a reason for hiding this comment

Uh oh!

penelopeysm commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhauru commented Nov 4, 2025

Uh oh!

github-actions bot commented Nov 4, 2025

Uh oh!

mhauru commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

penelopeysm commented Nov 3, 2025 •

edited

Loading