[red-knot] Add list of failing/slow ecosystem projects #17474

carljm · 2025-04-18T22:30:46Z

Summary

I ran red-knot on every project in mypy-primer. I moved every project where red-knot ran to completion (fast enough, and mypy-primer could handle its output) into good.txt, so it will run in our CI.

The remaining projects I left listed in bad.txt, with a comment summarizing the failure mode (a few don't fail, they are just slow -- on a debug build, at least -- or output too many diagnostics for mypy-primer to handle.)

We will now run CI on 109 projects; 34 are left in bad.txt.

The main question about this PR is whether running mypy-primer on 111 projects is too much for CI: does it take too long? does it make the mypy-primer output overwhelming? If it is too much, I can split good.txt into run-in-ci.txt and good.txt, and we can pick some subset to run in CI.

Test Plan

CI on this PR!

github-actions · 2025-04-18T22:38:04Z

`mypy_primer` results

No ecosystem changes detected ✅

AlexWaygood · 2025-04-18T22:41:16Z

I'm pretty strongly in favour of running (a lot) more mypy_primer projects in CI. Apart from anything else, we've already had several cases where the mypy_primer workflow has caught new panics for us, which seem very important for us to know about.

It looks like the changes here mean that it now runs in six minutes, which is quite a lot slower than the two minutes it took previously. We could consider sharding the mypy_primer workflow between several CI jobs and joining the artifacts afterwards, which is what both mypy and typeshed do in their mypy_primer workflows.

carljm · 2025-04-18T23:52:49Z

6 minutes is reasonable enough that I'm tempted to just land this as-is and move on; I don't want to spend a lot of time wrangling GitHub Actions. But maybe if it's easy to borrow the mypy/typeshed sharding setup, I could look at that. This does make mypy-primer the long pole in our CI (replacing windows and release-mode tests at 4 minutes.)

carljm · 2025-04-19T13:43:33Z

I tried to do some work towards sharding in #17475, but I feel like I've reached my timebox on that. I have the sharded runs working fine and uploading artifacts, but the "comment" workflow needs significant changes in order to download all the diff artifacts and combine them, and our current comment workflow is quite different from the typeshed/mypy ones. Compounding the difficulty, it appears that a workflow that is triggered on completion of another workflow (like the mypy primer comment one) only runs if it exists on the main branch of the repo, so it seems like debugging it would have to occur on main branch.

So at this point my proposal would be to merge this and accept that ecosystem checks take 6-8 minutes. Open to alternative proposals, including "put more work into sharding" or "reduce the project set."

carljm · 2025-04-19T14:00:32Z

Hmm, scratch that; something caused the last run to take 20min and time out. Will need to dig into that before we can consider merging this.

## Summary Takes the `good.txt` changes from #17474, and removes the following projects: - arrow (not part of mypy_primer upstream) - freqtrade, hydpy, ibis, pandera, xarray (saw panics locally, all related to try_metaclass cycles) Increases the mypy_primer CI run time to ~4 min. ## Test Plan Three successful CI runs.

sharkdp

@carljm Thank you for this. After updating mypy_primer to run from upstream (which also includes a lot of recent changes to project dependencies), I re-analyzed all projects here. See the commit history for a log of the changes that I did. I moved five projects to bad.txt (all try_metaclass_ panics) which were recently part of good.txt (merged separately). I also moved four projects to good.txt. The hang on Tanjun is probably related to #17537, I reclassified it.

MichaReiser · 2025-04-22T12:25:59Z

I don't know if performance is still a concern and if it is mainly red knot or mypy primer being slow. If it still is a concern and it's mainly red knot, then consider creating a new build profile which performs a release build but disables LTO (you could even experiment with less aggressive optimizations). LTO tends to be the slowest step and this could be a good trade between a faster red-knot without spending too much time on compiling.

sharkdp · 2025-04-22T12:31:04Z

I don't know if performance is still a concern and if it is mainly red knot or mypy primer being slow.

The red knot compilation used to be the bottleneck (when we ran on a few projects only), which is why I made it a debug build. Now it's the setup (clone + dependency installation) + the actual execution of red knot, which is the bottleneck. So we could consider switching back to a release build, with the disadvantage that we would have worse backtraces and no debug-assertions.

If it still is a concern and it's mainly red knot, then consider creating a new build profile which performs a release build but disables LTO (you could even experiment with less aggressive optimizations). LTO tends to be the slowest step and this could be a good trade between a faster red-knot without spending too much time on compiling.

👍

MichaReiser · 2025-04-22T12:32:37Z

with the disadvantage that we would have worse backtraces and no debug-assertions.

We could enable debug assertions, similar to the profiling profile

* main: (37 commits) [red-knot] Add list of failing/slow ecosystem projects (#17474) [red-knot] mypy_primer: extend ecosystem checks (#17544) [red-knot] Move `InstanceType` to its own submodule (#17525) [red-knot] mypy_primer: capture backtraces (#17543) [red-knot] mypy_primer: Use upstream repo (#17500) [red-knot] `typing.dataclass_transform` (#17445) Update dependency react-resizable-panels to v2.1.8 (#17513) Update dependency smol-toml to v1.3.3 (#17505) Update dependency uuid to v11.1.0 (#17517) Update actions/setup-node action to v4.4.0 (#17514) [red-knot] Fix variable name (#17532) [red-knot] Add basic subtyping between class literal and callable (#17469) [`pyupgrade`] Add fix safety section to docs (`UP030`) (#17443) [`perflint`] Allow list function calls to be replaced with a comprehension (`PERF401`) (#17519) Update pre-commit dependencies (#17506) [red-knot] Simplify visibility constraint handling for `*`-import definitions (#17486) [red-knot] Detect (some) invalid protocols (#17488) [red-knot] Correctly identify protocol classes (#17487) Update dependency ruff to v0.11.6 (#17516) Update Rust crate shellexpand to v3.1.1 (#17512) ...

AlexWaygood added the ty Multi-file analysis & type inference label Apr 18, 2025

AlexWaygood added the ci Related to internal CI tooling label Apr 18, 2025

carljm marked this pull request as ready for review April 18, 2025 23:51

carljm requested review from AlexWaygood, dcreager and sharkdp as code owners April 18, 2025 23:51

carljm force-pushed the cjm/addprojects branch from 95939da to 767884b Compare April 18, 2025 23:55

carljm closed this Apr 19, 2025

carljm reopened this Apr 19, 2025

carljm force-pushed the cjm/addprojects branch from 767884b to b78112a Compare April 19, 2025 13:36

carljm marked this pull request as draft April 19, 2025 14:00

sharkdp mentioned this pull request Apr 22, 2025

[red-knot] mypy_primer: extend ecosystem checks #17544

Merged

carljm and others added 2 commits April 22, 2025 13:41

[red-knot] update mypy-primer projects list

caaaca6

Add five new try_metaclass_ cycle panics

028e225

sharkdp force-pushed the cjm/addprojects branch from b78112a to 028e225 Compare April 22, 2025 11:43

sharkdp changed the title ~~[red-knot] update mypy-primer projects list~~ [red-knot] Add list of failing/slow ecosystem projects Apr 22, 2025

sharkdp added 8 commits April 22, 2025 13:52

Tanjun also cycle panics

ed780f6

dragonchain succeeds

fc52844

pandas check succeeds, but slow to process in mypy_primer

ec40a94

Running on pip works

6559e02

Running on poetry works

2cce095

Running on setuptools works

fbce5a7

spark is a cycle panic

255b299

Minor consistency change

8baaac8

sharkdp marked this pull request as ready for review April 22, 2025 12:05

sharkdp approved these changes Apr 22, 2025

View reviewed changes

sharkdp merged commit 0299a52 into main Apr 22, 2025
23 checks passed

sharkdp deleted the cjm/addprojects branch April 22, 2025 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[red-knot] Add list of failing/slow ecosystem projects #17474

[red-knot] Add list of failing/slow ecosystem projects #17474

Uh oh!

carljm commented Apr 18, 2025 •

edited by sharkdp

Loading

Uh oh!

github-actions bot commented Apr 18, 2025 •

edited

Loading

Uh oh!

AlexWaygood commented Apr 18, 2025

Uh oh!

carljm commented Apr 18, 2025

Uh oh!

carljm commented Apr 19, 2025 •

edited

Loading

Uh oh!

carljm commented Apr 19, 2025

Uh oh!

sharkdp left a comment

Uh oh!

Uh oh!

MichaReiser commented Apr 22, 2025

Uh oh!

sharkdp commented Apr 22, 2025 •

edited

Loading

Uh oh!

MichaReiser commented Apr 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[red-knot] Add list of failing/slow ecosystem projects #17474

[red-knot] Add list of failing/slow ecosystem projects #17474

Uh oh!

Conversation

carljm commented Apr 18, 2025 • edited by sharkdp Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

github-actions bot commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

AlexWaygood commented Apr 18, 2025

Uh oh!

carljm commented Apr 18, 2025

Uh oh!

carljm commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carljm commented Apr 19, 2025

Uh oh!

sharkdp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MichaReiser commented Apr 22, 2025

Uh oh!

sharkdp commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MichaReiser commented Apr 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

carljm commented Apr 18, 2025 •

edited by sharkdp

Loading

github-actions bot commented Apr 18, 2025 •

edited

Loading

`mypy_primer` results

carljm commented Apr 19, 2025 •

edited

Loading

sharkdp commented Apr 22, 2025 •

edited

Loading