Type inference performance issue #579

7omb · 2018-07-02T14:20:42Z

In pylint 1.9.2 astroids type inference performs very badly for the code below:

args = []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args += []

if True:
    args.append('')

If the args.append('') appears in the first if statement pylint finishes almost immediately. If you move it to the end it already takes more than 7s. Profiling shows that the running time and number of calls to the function _infer_binary_operation increases exponentially

The issue can be reproduced in a virtual environment with only pylint installed and an empty pylintrc.

The text was updated successfully, but these errors were encountered:

brycepg · 2018-07-03T03:21:49Z

The inference result of args at the end of the above code:

has 16384 items
includes multiple duplicates with lines closer to the bottom being referenced more

log2(16384) is 14 which is equivalent to the number of AugAssigns in the above code, so it is definitely a n^2 perf issue

It looks like inference is (correctly?) inferencing all possible permutations of the list:

code = """
args = []

if True:
    args += ['a']

if True:
    args += ['b']

if True:
    args += ['c']
    
args #@
"""

result  = astroid.extract_node(code).inferred()
[result.as_string() for result in result]

['[]',
 "['a']",
 "['b']",
 "['a', 'b']",
 "['c']",
 "['a', 'c']",
 "['b', 'c']",
 "['a', 'b', 'c']"]

Not too sure what to do about this

PCManticore · 2018-07-03T07:06:26Z

Is this an actual example of something that can happen in the wild or is just an artificial example? I wonder if we'd need to introduce some sort of recursion guard in the inference, that will stop after a certain amount of recursive inferred branches. This should prevent a couple of performance issues where we'd try to infer all the potential values that a complex function could generate, although I'm not yet exactly sure on how the mechanics would work for that guard.

7omb · 2018-07-03T08:12:47Z

Yes, this is an actual example which happened a few days ago. It appeared in a function which takes takes a setting dictionary and constructs the command line arguments for a monitoring plugin. Basically one commit increased the pylint running time of our project from about 0:30 h to 5:30 h.

PCManticore · 2018-07-03T08:29:29Z

Ouch, that's quite the jump! Curious how big your project in terms of LOC is if it already takes half an hour without this bug.

7omb · 2018-07-03T08:58:45Z

According to cloc roughly 2000 Python files with 250000 lines of code. Without this issue and a bit of tuning it's now down to 0:10 h again.

svenpanne · 2018-07-03T09:11:29Z

Just for clarification: Spell checking in pylint seems to take ages, so we took out the checks wrong-spelling-in-comment and wrong-spelling-in-docstring. This brought down the pylint runtime from roughly 30-40 min to 11 min. Just in case anybody wonders...

brycepg · 2018-07-03T23:43:40Z

I think we could open an issue on pylint for that performance problem

@PCManticore
We could limit the number of possible inferences in cache_generator by counting the number of yielded inferences, and, if it hit some limit (like 1000), just append Uninferable at the end and return from the generator. It could even be a environment variable set lower for performance on bigger projects.

PCManticore · 2018-07-04T06:43:45Z

@brycepg Good catch, that might work!

spot performance issues. Add new envrionment variable call ASTROID_MAX_INFERABLE to tune the max inferable amount of values at a time. Close pylint-dev#579 Close pylint-dev/pylint#2251

spot performance issues. Add new envrionment variable call ASTROID_MAX_INFERABLE to tune the max inferable amount of values at a time. Close #579 Close pylint-dev/pylint#2251

brycepg added Bug 🪳 topic-performance and removed Bug 🪳 labels Jul 3, 2018

PCManticore mentioned this issue Jul 4, 2018

Investigate why the spellchecking checks are slow pylint-dev/pylint#2249

Open

brycepg mentioned this issue Jul 6, 2018

Abnormal test running time for consider-using-join test file pylint-dev/pylint#2251

Closed

brycepg mentioned this issue Jul 6, 2018

Limit inference to a maximum of 100 results at a time #586

Merged

brycepg closed this as completed in #586 Jul 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type inference performance issue #579

Type inference performance issue #579

7omb commented Jul 2, 2018

brycepg commented Jul 3, 2018 •

edited

Loading

PCManticore commented Jul 3, 2018

7omb commented Jul 3, 2018

PCManticore commented Jul 3, 2018

7omb commented Jul 3, 2018

svenpanne commented Jul 3, 2018

brycepg commented Jul 3, 2018

PCManticore commented Jul 4, 2018

Type inference performance issue #579

Type inference performance issue #579

Comments

7omb commented Jul 2, 2018

brycepg commented Jul 3, 2018 • edited Loading

PCManticore commented Jul 3, 2018

7omb commented Jul 3, 2018

PCManticore commented Jul 3, 2018

7omb commented Jul 3, 2018

svenpanne commented Jul 3, 2018

brycepg commented Jul 3, 2018

PCManticore commented Jul 4, 2018

brycepg commented Jul 3, 2018 •

edited

Loading