gh-120496: Use CAS approach for rangeiter_next #120534

corona10 · 2024-06-15T01:08:14Z

Issue: Sequence iterator thread-safety #120496

Fidget-Spinner · 2024-06-15T05:58:42Z

Thanks for doing this! Do you have any benchmark results on how much the perf impact is? versus say using Py_BEGIN_CRITICAL_SECTION ?

corona10 · 2024-06-15T06:04:37Z

Thanks for doing this! Do you have any benchmark results on how much the perf impact is? versus say using

I will try to prepare :)

corona10 · 2024-06-15T06:27:49Z

@Fidget-Spinner @colesbury

I am not sure this will be a fair benchmark, but it shows different strengths in terms of technique and size of N.

Single thread

import pyperf

runner = pyperf.Runner()
runner.timeit(name="bench iter single",
              stmt="""
data = list(it)
""",
              setup = """
N = 100
it = iter(range(N))
"""
              )

pyperf compare_to single_base.json single_cas.json
Benchmark hidden because not significant (1): bench iter single

Mutl thread

import pyperf

runner = pyperf.Runner()
runner.timeit(name="bench iter multi",
              stmt="""
for _ in range(100):
    it = iter(range(N))
    with concurrent.futures.ThreadPoolExecutor() as executor:
        data = list(executor.map(lambda _: next(it), range(N)))
""",
              setup = """
import concurrent.futures
N = 100
"""
              )

Mean +- std dev: [multi_baseline] 284 ms +- 11 ms -> [multi_cas] 277 ms +- 20 ms: 1.03x faster

Fidget-Spinner · 2024-06-15T06:31:56Z

Hmmm the perf difference is not that large. That might imply that we should just use critical section for simplicity, and hope that someone on PyPI publishes a "prange()" or something like that.

corona10 · 2024-06-15T06:32:25Z

That might imply that we should just use critical section for simplicity, and hope that someone on PyPI publishes a "prange()" or something like that.

Yeah, I prefer this one.

eendebakpt · 2024-06-15T06:32:42Z

Objects/rangeobject.c

+        if (len <= 0) {
+            return NULL;
+        }
+        long result = _Py_atomic_load_long_relaxed(&r->start);


What if we have a range object with len=1 and many threads simultaneously at this point? The check on len has already been done, so the threads each increment the value of start and we end up with a value for result larger than end.

Maybe fail at _Py_atomic_compare_exchange_long?

Ah.. yes understood what you pointed out.

There will be a slight timing issue. Then yes, let's just use the critical section that @Fidget-Spinner also preferred.

To prepare for the overcommitted issue, we can validate the value of the result by comparing the end value.
But let's just use critical section approach.

corona10 · 2024-06-15T06:40:22Z

See: #120540

bedevere-app bot mentioned this pull request Jun 15, 2024

Sequence iterator thread-safety #120496

Closed

corona10 requested review from colesbury and Fidget-Spinner June 15, 2024 01:08

corona10 added the DO-NOT-MERGE label Jun 15, 2024

corona10 added 2 commits June 15, 2024 10:18

pythongh-120496: Use CAS approach for rangeiter_next

eb7674b

nit

2d9ac70

corona10 added the skip news label Jun 15, 2024

nit

a547e9d

corona10 force-pushed the gh-120496 branch from ef235dd to a547e9d Compare June 15, 2024 01:22

corona10 added 5 commits June 15, 2024 10:23

fix

6ee317a

More optimization

fd95922

Update atomic package

901ea19

fix

827d38b

fix

805a90e

corona10 removed the DO-NOT-MERGE label Jun 15, 2024

corona10 marked this pull request as ready for review June 15, 2024 03:12

bedevere-app bot added the awaiting core review label Jun 15, 2024

corona10 added needs backport to 3.13 bugs and security fixes and removed awaiting core review labels Jun 15, 2024

eendebakpt reviewed Jun 15, 2024

View reviewed changes

corona10 closed this Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-120496: Use CAS approach for rangeiter_next #120534

gh-120496: Use CAS approach for rangeiter_next #120534

Uh oh!

corona10 commented Jun 15, 2024 •

edited by bedevere-app bot

Loading

Uh oh!

Fidget-Spinner commented Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024 •

edited

Loading

Uh oh!

Fidget-Spinner commented Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024

Uh oh!

eendebakpt Jun 15, 2024 •

edited

Loading

Uh oh!

corona10 Jun 15, 2024

Uh oh!

corona10 Jun 15, 2024

Uh oh!

corona10 Jun 15, 2024

Uh oh!

corona10 Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024

Uh oh!

Uh oh!

Uh oh!

gh-120496: Use CAS approach for rangeiter_next #120534

gh-120496: Use CAS approach for rangeiter_next #120534

Uh oh!

Conversation

corona10 commented Jun 15, 2024 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Single thread

Mutl thread

Uh oh!

Fidget-Spinner commented Jun 15, 2024

Uh oh!

corona10 commented Jun 15, 2024

Uh oh!

eendebakpt Jun 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

corona10 Jun 15, 2024

Choose a reason for hiding this comment

Uh oh!

corona10 Jun 15, 2024

Choose a reason for hiding this comment

Uh oh!

corona10 Jun 15, 2024

Choose a reason for hiding this comment

Uh oh!

corona10 Jun 15, 2024

Choose a reason for hiding this comment

Uh oh!

corona10 commented Jun 15, 2024

Uh oh!

Uh oh!

corona10 commented Jun 15, 2024 •

edited by bedevere-app bot

Loading

corona10 commented Jun 15, 2024 •

edited

Loading

eendebakpt Jun 15, 2024 •

edited

Loading