Make PyClassBorrowChecker thread safe #4544

ngoldbaum · 2024-09-10T20:36:35Z

Ref #4265 (comment) and replies from @alex.

I tried doing this with an AtomicUsize but because try_borrow accepts an immutable reference, I couldn't figure out a way to get that to work without keeping the Cell. A mutex seemed like a more natural choice for the existing code structure.

I think in principle we could use a mutex on the GIL-enabled build as well, since the lock is only held very briefly in rust to update the borrow checker state.

Is the new test that only triggers on the free-threaded build OK? I could also write it to test that there isn't an exception on the GIL-enabled build.

alex · 2024-09-10T22:16:15Z

Hmm, not sure I follow why taking an & reference should be a problem -- all of the relevant AtomicUsize methods are also &self?

ngoldbaum · 2024-09-11T19:57:35Z

Hmm, not sure I follow why taking an & reference should be a problem

I realized today that if I make BorrowFlag mutable, then I also need to make increment and decrement take &self instead of self. I was confused by the error the borrow checker was giving me and thought it indicated a fundamental problem. Thanks for pointing out my reasoning was flawed.

src/pycell/impl_.rs

codspeed-hq · 2024-09-11T20:16:32Z

CodSpeed Performance Report

Merging #4544 will not alter performance

_{Comparing ngoldbaum:pyclass-borrow-checker (5550527) with main (cb88478)}

Summary

✅ 83 untouched benchmarks

ngoldbaum · 2024-09-11T21:38:37Z

Ping @colesbury - I'd appreciate a code review from you if you have some spare cycles.

src/pycell/impl_.rs

pytests/src/pyclasses.rs

mejrs · 2024-09-17T22:48:29Z

I feel quite strongly that we shouldn't merge this PR. The semantics in #4265 (comment) make much more sense to me.

davidhewitt · 2024-09-18T06:35:52Z

@mejrs I'm very much in agreement with you that keeping these "refcell-like" semantics on the freethreaded build is likely to be completely unusable in real-world conditions.

Would you be prepared to accept this PR merging in 0.23 with the understanding that we will follow up in 0.24 with something akin to #4265 (comment), i.e. a more realistic design.

I think this allows us to make PyO3 sound for downstream testing of free-threading without needing to introduce breaking changes (of which there are already enough collected for this release).

alex · 2024-09-18T11:52:32Z

I disagree strongly that these are unusable in real-world situations. I did a review of pyca/cryptography and these semantics would be appropriate for all of our usage of non-frozen pyclasses.

The reason is that while these types require mutable access, there's no particularly coherent behavior for concurrent multi-threaded usage. These types are, for example, iterators and hashers. There's no real world usage for concurrent mutable use.

mejrs · 2024-09-18T12:42:03Z

Would you be prepared to accept this PR merging in 0.23 with the understanding that we will follow up in 0.24 with something akin to #4265 (comment), i.e. a more realistic design.

I'm OK with that.

I disagree strongly that these are unusable in real-world situations.

They may work in your situation, but not in general. Imagine a List-like pyclass with a fn push(&mut self, value) method. Most of the time that is going to work in a multi threaded context, but sometimes it will trip a borrow error and crash the program. It sounds like a footgun that everyone is going to run into at some point.

davidhewitt · 2024-09-23T20:45:09Z

I did a review of pyca/cryptography and these semantics would be appropriate for all of our usage of non-frozen pyclasses.

Thanks, that's interesting to see. I guess that makes a lot of sense if most of your types are either frozen or are "algorithm" helpers, then these semantics are workable. I'm still quite worried that the runtime crashes will trip up users, but I guess maybe it's better than silently blocking and degrading throughput. Still, I think that a lot of PyO3 users might not be so careful in designing their types!

👍 thanks all for the agreement, it seems like we at least have a plan for 0.23 and a rough roadmap of where we're going after that.

davidhewitt

I've finally read the code & test - very concise change in the end. I have just a few thoughts, and then once we've discussed those points let's move ahead with a merge. Thanks!

pytests/src/pyclasses.rs

src/pycell/impl_.rs

davidhewitt

So I've reviewed the replacement without SeqCst and have a different opinion on what the correct implementation is. 🙈

src/pycell/impl_.rs

ngoldbaum · 2024-09-26T20:25:41Z

Agreed on the reviews, thanks! I missed the SeqCst use in the mutable borrows, I was laser focused on the increment implementation.

davidhewitt · 2024-09-26T22:16:04Z

I think still possible changes warranted for the increment case, see #4544 (comment). That proposal will probably also fix the MSRV build.

Also, now that we're venturing more out of my comfort zone with the less strict memory orderings, it would be great to come up with some tests which give us confidence the implementation is correct.

I came up with this one, which fails on main and passes on this branch. Annoyingly it still passes (on my Mac) even if all the atomic orderings are Relaxed, which makes me want to come up with some more tests...

#[test]
fn test_thread_safety() {
    #[pyclass]
    struct MyClass {
        x: u64,
    }

    Python::with_gil(|py| {
        let inst = Py::new(py, MyClass { x: 0 }).unwrap();

        let total_modifications = py.allow_threads(|| {
            std::thread::scope(|s| {
                // Spawn a bunch of threads all racing to write to
                // the same instance of `MyClass`.
                let threads = (0..10)
                    .map(|_| {
                        s.spawn(|| {
                            Python::with_gil(|py| {
                                // Each thread records its own view of how many writes it made
                                let mut local_modifications = 0;
                                for _ in 0..100 {
                                    if let Ok(mut i) = inst.try_borrow_mut(py) {
                                        i.x += 1;
                                        local_modifications += 1;
                                    }
                                }
                                local_modifications
                            })
                        })
                    })
                    .collect::<Vec<_>>();

                // Sum up the total number of writes made by all threads
                threads.into_iter().map(|t| t.join().unwrap()).sum::<u64>()
            })
        });

        // If the implementation is free of data races, the total number of writes
        // should match the final value of `x`.
        assert_eq!(total_modifications, inst.borrow(py).x);
    });
}

alex · 2024-09-26T22:25:13Z

src/pycell/impl_.rs

+            // On success the read is synchronized to ensure other
+            // threads don't get a reference before this thread checks
+            // that it can get one
+            Ordering::Acquire,


Does this need to be AcqRel?

The documentation for compare_exchange states:

Using Acquire as success ordering makes the store part of this operation Relaxed

I'm not totally convinced it's necessary to establish a happens-before relationship with other threads when they observe the successfully stored flag (as they won't be allowed to read the accompanying data), but I think this also cannot hurt.

Co-authored-by: David Hewitt <mail@davidhewitt.dev>

davidhewitt · 2024-09-26T23:02:58Z

Ok, here's yet another test, which fails if the atomic orderings are all relaxed:

#[test]
fn test_thread_safety_2() {
    struct SyncUnsafeCell<T>(UnsafeCell<T>);
    unsafe impl<T> Sync for SyncUnsafeCell<T> {}

    impl<T> SyncUnsafeCell<T> {
        fn get(&self) -> *mut T {
            self.0.get()
        }
    }

    let data = SyncUnsafeCell(UnsafeCell::new(0));
    let data2 = SyncUnsafeCell(UnsafeCell::new(0));
    let borrow_checker = BorrowChecker(BorrowFlag(AtomicUsize::new(BorrowFlag::UNUSED)));

    std::thread::scope(|s| {
        s.spawn(|| {
            for _ in 0..1_000_000 {
                if borrow_checker.try_borrow_mut().is_ok() {
                    // thread 1 writes to both values during the mutable borrow
                    unsafe { *data.get() += 1 };
                    unsafe { *data2.get() += 1 };
                    borrow_checker.release_borrow_mut();
                }
            }
        });

        s.spawn(|| {
            for _ in 0..1_000_000 {
                if borrow_checker.try_borrow().is_ok() {
                    // if the borrow checker is working correctly, it should be impossible
                    // for thread 2 to observe a difference in the two values
                    assert_eq!(unsafe { *data.get() }, unsafe { *data2.get() });
                    borrow_checker.release_borrow();
                }
            }
        });
    });
}

ngoldbaum · 2024-09-26T23:20:03Z

Added that test, I can also reproduce the failure if I make the orderings relaxed.

davidhewitt

Thanks for all the work here, I'm now reasonably convinced that what we have here is correct. We might yet dream up new tests later, though I don't think we need to wait for those to merge.

davidhewitt · 2024-09-27T12:07:32Z

Ah, we probably need to gate the new tests with #[cfg(not(target_arch = "wasm32"))]. We don't have threading support on that target at the moment in tests.

ngoldbaum commented Sep 11, 2024

View reviewed changes

src/pycell/impl_.rs Outdated Show resolved Hide resolved

ngoldbaum force-pushed the pyclass-borrow-checker branch from ab32156 to fb19488 Compare September 11, 2024 20:38

alex reviewed Sep 12, 2024

View reviewed changes

src/pycell/impl_.rs Outdated Show resolved Hide resolved

ngoldbaum commented Sep 12, 2024

View reviewed changes

pytests/src/pyclasses.rs Show resolved Hide resolved

alex approved these changes Sep 17, 2024

View reviewed changes

davidhewitt mentioned this pull request Sep 18, 2024

Tracking issue for no-gil/freethreaded work #4265

Open

14 tasks

ngoldbaum mentioned this pull request Sep 20, 2024

0.23 Release #4554

Closed

3 tasks

davidhewitt reviewed Sep 23, 2024

View reviewed changes

pytests/src/pyclasses.rs Outdated Show resolved Hide resolved

pytests/src/pyclasses.rs Outdated Show resolved Hide resolved

src/pycell/impl_.rs Outdated Show resolved Hide resolved

src/pycell/impl_.rs Outdated Show resolved Hide resolved

ngoldbaum added 10 commits September 24, 2024 10:05

use a mutex in PyClassBorrowChecker

3a97502

add a test that triggers a reference race

7973caa

make BorrowFlag wrap an AtomicUsize

878bd25

fix errors seen on CI

4ad5bfb

add changelog entry

11d0dc3

use a compare-exchange loop in try_borrow

c1cdc47

move atomic increment implementation into increment method

1b55141

fix bug pointed out in code review

1ea1034

make test use an atomic

258804d

make test runnable on GIL-enabled build

27eaf02

ngoldbaum force-pushed the pyclass-borrow-checker branch from d92e733 to 27eaf02 Compare September 24, 2024 17:58

ngoldbaum added 2 commits September 24, 2024 12:06

use less restrictive ordering, add comments

4082553

fix ruff error

f38f6af

davidhewitt reviewed Sep 26, 2024

View reviewed changes

src/pycell/impl_.rs Outdated Show resolved Hide resolved

src/pycell/impl_.rs Outdated Show resolved Hide resolved

davidhewitt reviewed Sep 26, 2024

View reviewed changes

src/pycell/impl_.rs Outdated Show resolved Hide resolved

src/pycell/impl_.rs Outdated Show resolved Hide resolved

relax ordering in mutable borrows as well

111ba40

alex reviewed Sep 26, 2024

View reviewed changes

ngoldbaum and others added 4 commits September 26, 2024 16:27

Update impl_.rs

cf0fab8

Co-authored-by: David Hewitt <mail@davidhewitt.dev>

fix path

7632963

use AcqRel for mutable borrow compare_exchange loop

8db3d75

add test from david

ce6dd59

one more test

92502dd

davidhewitt approved these changes Sep 27, 2024

View reviewed changes

davidhewitt added this pull request to the merge queue Sep 27, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Sep 27, 2024

ngoldbaum and others added 3 commits October 1, 2024 11:58

disable thread safety tests on WASM

1b365b8

skip python test on WASM as well

4ab556e

fix format

5707088

davidhewitt enabled auto-merge October 4, 2024 21:29

fixup skipif reason

5550527

davidhewitt added this pull request to the merge queue Oct 5, 2024

Merged via the queue into PyO3:main with commit 8288fb9 Oct 5, 2024
43 checks passed

mejrs mentioned this pull request Oct 25, 2024

Declare free-threaded support for PyModule #4588

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make PyClassBorrowChecker thread safe #4544

Make PyClassBorrowChecker thread safe #4544

ngoldbaum commented Sep 10, 2024

alex commented Sep 10, 2024

ngoldbaum commented Sep 11, 2024

codspeed-hq bot commented Sep 11, 2024 •

edited

Loading

ngoldbaum commented Sep 11, 2024

mejrs commented Sep 17, 2024

davidhewitt commented Sep 18, 2024 •

edited

Loading

alex commented Sep 18, 2024

mejrs commented Sep 18, 2024

davidhewitt commented Sep 23, 2024

davidhewitt left a comment

davidhewitt left a comment

ngoldbaum commented Sep 26, 2024

davidhewitt commented Sep 26, 2024 •

edited

Loading

alex Sep 26, 2024

davidhewitt Sep 26, 2024

davidhewitt commented Sep 26, 2024

ngoldbaum commented Sep 26, 2024

davidhewitt left a comment

davidhewitt commented Sep 27, 2024

Make PyClassBorrowChecker thread safe #4544

Make PyClassBorrowChecker thread safe #4544

Conversation

ngoldbaum commented Sep 10, 2024

alex commented Sep 10, 2024

ngoldbaum commented Sep 11, 2024

codspeed-hq bot commented Sep 11, 2024 • edited Loading

CodSpeed Performance Report

Merging #4544 will not alter performance

Summary

ngoldbaum commented Sep 11, 2024

mejrs commented Sep 17, 2024

davidhewitt commented Sep 18, 2024 • edited Loading

alex commented Sep 18, 2024

mejrs commented Sep 18, 2024

davidhewitt commented Sep 23, 2024

davidhewitt left a comment

Choose a reason for hiding this comment

davidhewitt left a comment

Choose a reason for hiding this comment

ngoldbaum commented Sep 26, 2024

davidhewitt commented Sep 26, 2024 • edited Loading

alex Sep 26, 2024

Choose a reason for hiding this comment

davidhewitt Sep 26, 2024

Choose a reason for hiding this comment

davidhewitt commented Sep 26, 2024

ngoldbaum commented Sep 26, 2024

davidhewitt left a comment

Choose a reason for hiding this comment

davidhewitt commented Sep 27, 2024

codspeed-hq bot commented Sep 11, 2024 •

edited

Loading

davidhewitt commented Sep 18, 2024 •

edited

Loading

davidhewitt commented Sep 26, 2024 •

edited

Loading