Harmonise methods for distinguishing different Python source types #13682

AlexWaygood · 2024-10-08T17:42:37Z

Summary

I was looking into #13246 to see if we could land the proposed change as part of Ruff 0.7, and noticed that we're currently pretty inconsistent in how we try to determine whether something is a Python source file or not (and, if so, what kind of source file it is). This PR tries to make it so that we only ever use ruff_python_ast::PySourceType for that purpose. This removes duplicated logic and the number of magic "py" or "pyi" strings across the codebase, and it should make it easier to make changes such as #13246 in the future.

Test Plan

cargo test

AlexWaygood · 2024-10-08T17:58:01Z

crates/red_knot_server/src/session/index.rs

-        } else if Path::new(url.path())
-            .extension()
-            .map_or(false, |ext| ext.eq_ignore_ascii_case("ipynb"))
-        {
+        } else if PySourceType::from(url.path()).is_ipynb() {


this compares case-sensitively where the existing code compares case-insensitively. I don't actually know if that's correct for Jupyter notebooks, but I do know that casing does matter for other Python files. import foo succeeds if the foo module is found at foo.py, but fails if the foo module is found at foo.PY

Hard to get data on this...

Seems overkill to make the breaking change for ipynb, I think the .PY import logic does make sense though?

this compares case-sensitively where the existing code compares case-insensitively. I don't actually know if that's correct for Jupyter notebooks, but I do know that casing does matter for other Python files. import foo succeeds if the foo module is found at foo.py, but fails if the foo module is found at foo.PY

Is this also true on case insensitive file systems? I do think that import resolver is different from what we consider a python file

My experience is that most applications don't consider file extensions to be case sensitive. Naming a file test.txt or test.TXT doesn't change the application in which my desktop environment opens the file. That's why I think we shouldn't treat file extensions as case sensitive but I can see how this is beyond the scope of this pr

It does make sense that the module resolver only tests for the existence of ’.py’ and whether ’.PY’ is considered to match depends on the file system's case sensitivity

For now I will revert these changes so that this PR is a "pure refactor" that does not change functionality. I think we may need to review this, though. In most places, we currently do case-sensitive matching for file extensions. Maybe that's correct or maybe it's incorrect, but it seems like we're pretty inconsistent right now. (And if that inconsistency is correct, because we need to apply logic in different situations, then we could at least add some comments to explain it more clearly 😄)

github-actions · 2024-10-08T18:04:50Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

crates/ruff_linter/src/rules/flake8_builtins/rules/builtin_module_shadowing.rs

MichaReiser · 2024-10-09T12:35:41Z

Could you create an issue to re-visit the case-insensitive extension matching and assign it to the next milestone. We should re visit this soon.

codspeed-hq · 2024-10-09T13:19:58Z

CodSpeed Performance Report

Merging #13682 will degrade performances by 4.13%

_{Comparing alex/remove-redundant-routines (ac777ae) with main (b9827a4)}

Summary

❌ 1 regressions
✅ 31 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`main`	`alex/remove-redundant-routines`	Change
❌	`lexer[numpy/globals.py]`	28.9 µs	30.1 µs	-4.13%

AlexWaygood · 2024-10-09T13:24:05Z

the codspeed flamegraph for this PR is very noisy so it's hard to tell, but I think that's just a false positive. I don't think I can see anything that would be plausibly related to this PR.

AlexWaygood · 2024-10-09T13:29:40Z

Could you create an issue to re-visit the case-insensitive extension matching and assign it to the next milestone. We should re visit this soon.

I opened #13691

AlexWaygood added the internal An internal refactor or improvement label Oct 8, 2024

AlexWaygood requested review from MichaReiser and carljm as code owners October 8, 2024 17:42

Harmonise methods for distinguishing different Python source types

6251505

AlexWaygood force-pushed the alex/remove-redundant-routines branch from bd426ea to 6251505 Compare October 8, 2024 17:45

AlexWaygood commented Oct 8, 2024

View reviewed changes

revert changes that result in different behaviour

a54cf34

AlexWaygood requested review from zanieb and removed request for carljm October 9, 2024 10:49

MichaReiser approved these changes Oct 9, 2024

View reviewed changes

crates/ruff_linter/src/rules/flake8_builtins/rules/builtin_module_shadowing.rs Outdated Show resolved Hide resolved

Add a helper

ac777ae

AlexWaygood enabled auto-merge (squash) October 9, 2024 13:16

AlexWaygood merged commit 5b4afd3 into main Oct 9, 2024
19 checks passed

AlexWaygood deleted the alex/remove-redundant-routines branch October 9, 2024 13:18

AlexWaygood mentioned this pull request Oct 9, 2024

Audit how we determine whether a file is a "Python source file" #13691

Open

Harmonise methods for distinguishing different Python source types #13682

Harmonise methods for distinguishing different Python source types #13682

Uh oh!

Conversation

AlexWaygood commented Oct 8, 2024

Summary

Test Plan

Uh oh!

AlexWaygood Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

zanieb Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

zanieb Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

MichaReiser Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

MichaReiser Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AlexWaygood Oct 9, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

Uh oh!

Uh oh!

MichaReiser commented Oct 9, 2024

Uh oh!

Uh oh!

codspeed-hq bot commented Oct 9, 2024

CodSpeed Performance Report

Merging #13682 will degrade performances by 4.13%

Summary

Benchmarks breakdown

Uh oh!

AlexWaygood commented Oct 9, 2024

Uh oh!

AlexWaygood commented Oct 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MichaReiser Oct 8, 2024 •

edited

Loading

github-actions bot commented Oct 8, 2024 •

edited

Loading

`ruff-ecosystem` results