any/all short circuiting #426

sbrugman · 2022-05-16T17:04:28Z

Description

List comprehension can postpone the evaluation of any/all, which can hurt performance for larger iterables. Surprisingly, this was not yet included here as rule.

Example:

def hi():
    print('hi')
    return True

>>> any(hi() for num in [1, 2, 3, 4])
hi

>>> any([hi() for num in [1, 2, 3, 4]])
hi
hi
hi
hi

From this answer

Proposal

Extend the current set of rules with C418 and C419 (any, all), similar to C403 and C404

The text was updated successfully, but these errors were encountered:

adamchainz · 2022-05-19T08:02:39Z

Thanks for the suggestion.

In the past I removed some rules that suggested using generators over comprehensions, because the increase in laziness is not always a 100% compatible change: https://github.com/adamchainz/flake8-comprehensions/blob/main/HISTORY.rst#340-2021-03-18 . In the case of any/all, any side effects from the comprehension would be lost. I prefer to have rules don't have any false positives.

Also generator expressions are not always faster. For small-ish collections, list comprehensions often win.

I'd be open to having a set of default-disabled rules that one opts into though. Perhaps there could be a setting ("suggest-generators" or something) that users add with the understanding that semantics could change...

Thoughts?

sbrugman · 2022-05-19T09:55:53Z

Thanks for the useful pointers.

The scope of C407 used to be much broader, considering all builtins - frequently resulting in false positives. For any/all, there are clear cases where the generator expression is preferable.

The choice for generator or comprehension should indeed be made consciously (and flake8-comprehensions should help with pointing that out). Opting-in and having a proper disclaimer seems like a great solution.

Notes on performance, for future reference:

A fully exhausted generator is slower than an equivalent comprehension. However, any and all will terminate early when the first True or False value respectively is encountered. As a rough estimate, if a fully exhausted generator is 50% slower (reasonable based on benchmarks), compared to an equivalent comprehension, then on average, a generator may be preferable when an early termination value is expected before 2/3rds of the sequence is processed. Note that the generator variant is more memory efficient.

Skylion007 · 2023-02-18T22:42:21Z

@adamchainz Here is some performance bench-marking specifically for any / all: https://www.katrin-affolter.ch/Programming/performance_of_all_and_any

Skylion007 · 2023-02-21T18:21:54Z

FYI for any who wants to use this rule, it's implemented in flake8-pie as PIE802.

adamchainz · 2023-03-18T11:46:26Z

Thanks @Skylion007 , since i'ts implemented elsewhere I think we can close this. I think I'll avoid adding rules that change laziness to this plugin, it'll just be easier to maintain and understand.

adamchainz · 2023-03-19T08:43:54Z

Actually there's a PR open, I think we cxan review and add this.

Skylion007 · 2023-03-19T17:31:51Z

There is one counter example here, but the potential speedups are potentially worth it: astral-sh/ruff#3259 (comment)

adamchainz · 2023-03-27T14:00:02Z

The counterexample isn't for any/all ?

sbrugman mentioned this issue May 16, 2022

Add rule for list comprehensions passed to any()/all() #427

Merged

matthewlloyd mentioned this issue Feb 27, 2023

Unnecessary list comprehension in sum, min, max astral-sh/ruff#3259

Closed

adamchainz closed this as completed Mar 18, 2023

adamchainz reopened this Mar 19, 2023

adamchainz closed this as completed in #427 Apr 13, 2023

Adrien-LUDWIG mentioned this issue May 1, 2023

Replace last os.path occurrencies by pathlib ManimCommunity/manim#3224

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

any/all short circuiting #426

any/all short circuiting #426

sbrugman commented May 16, 2022 •

edited

Loading

adamchainz commented May 19, 2022

sbrugman commented May 19, 2022

Skylion007 commented Feb 18, 2023

Skylion007 commented Feb 21, 2023

adamchainz commented Mar 18, 2023

adamchainz commented Mar 19, 2023

Skylion007 commented Mar 19, 2023

adamchainz commented Mar 27, 2023

any/all short circuiting #426

any/all short circuiting #426

Comments

sbrugman commented May 16, 2022 • edited Loading

Description

Proposal

adamchainz commented May 19, 2022

sbrugman commented May 19, 2022

Skylion007 commented Feb 18, 2023

Skylion007 commented Feb 21, 2023

adamchainz commented Mar 18, 2023

adamchainz commented Mar 19, 2023

Skylion007 commented Mar 19, 2023

adamchainz commented Mar 27, 2023

sbrugman commented May 16, 2022 •

edited

Loading