-
Notifications
You must be signed in to change notification settings - Fork 13.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Recover wrong-cased keywords that start items #99918
Conversation
This comment has been minimized.
This comment has been minimized.
This seems reasonable and I think we could even get those more complex item signatures working as well with this approach. |
8c787fd
to
46c6c37
Compare
I've added the support for functions (the original intent of this PR), the code is somewhat messy, but I think it works. |
if case_insensitive | ||
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | ||
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Surprised rustfix didn't catch this:
if case_insensitive | |
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | |
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() { | |
if case_insensitive | |
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | |
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() | |
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did you mean rustfmt? It doesn't work at all with if let chains currently
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We also don't really have a style guide for it, so 🤷
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | ||
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | |
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() { | |
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | |
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() | |
{ |
/// Eats a keyword, optionally ignoring the case. | ||
/// If the case differs (and is ignored) an error is issued. | ||
/// This is useful for recovery. | ||
fn eat_keyword_case(&mut self, kw: Symbol, case_insensitive: bool) -> bool { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What happens if you do the insensitive check unconditionally? I guess that doesn't work because you need to pass in that the next token can be the keyword or at least an arbitrary ident... 🤔
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, being always case insensitive doesn't work because you may have a keyword | ident
, for example here:
https://github.com/rust-lang/rust/blob/6a25a433b62701ed48e85857a3b9f29bc4ba758f/compiler/rustc_parse/src/parser/item.rs#L219
If we'd always use case insensitive checks, then Use!()
would stop working. I had similar problems while working on this, where
impl Const /* <-- type */ {}
was wrongly parsed as impl const /* waiting for a trait */
#![allow(unused_imports)] | ||
|
||
fn main() {} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Follow up work: handle fn foo(_: &Dyn Debug) {}
|
||
if case == Case::Insensitive | ||
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | ||
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now I'm mildly amused at the thought of introducing a CamelCased keyword 😄
@estebank just to check: this is waiting on review, right? or do I have to do something? |
☔ The latest upstream changes (presumably #102297) made this pull request unmergeable. Please resolve the merge conflicts. |
(ping @estebank) |
@bors r+ |
☀️ Test successful - checks-actions |
Finished benchmarking commit (5b82ea7): comparison URL. Overall result: ❌ regressions - ACTION NEEDEDNext Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression Instruction countThis is a highly reliable metric that was used to determine the overall result at the top of this comment.
Max RSS (memory usage)ResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesThis benchmark run did not return any relevant results for this metric. |
Uh oh, this looks like a genuine perf regression |
@WaffleLapkin can you spend time on this? |
I can, although I'm not sure what the next step would be |
if case == Case::Insensitive | ||
&& let Some((ident, /* is_raw */ false)) = self.token.ident() | ||
&& ident.as_str().to_lowercase() == kw.as_str().to_lowercase() { | ||
true | ||
} else { | ||
false | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't see any obvious reason for the slow down, but can you try putting up PRs that revert some of these changes independently? Let's try to find if there's a single method that's introducing most of the perf regression (and it very well could be that it's just worse cache locality due to change in method inlining or arguments or something "silly" like that).
So here is the result of my investigation. Cleared by-hand (please, someone, teach me how to profile rustc properly ;-;) top of the cachegrind diff output:
The thing to notice here is that there are a lot of Upon further investigation I've found this function, where I accidentally used rust/compiler/rustc_parse/src/parser/item.rs Lines 2244 to 2261 in 96ddd32
I don't think it breaks the parser per se ( |
Fix perf regression by correctly matching keywords This should (hopefully) fix regression from rust-lang#99918 r? `@estebank`
Recover wrong-cased keywords that start items (_this pr was inspired by [this tweet](https://twitter.com/Azumanga/status/1552982326409367561)_) r? `@estebank` We've talked a bit about this recovery, but I just wanted to make sure that this is the right approach :) For now I've only added the case insensitive recovery to `use`s, since most other items like `impl` blocks, modules, functions can start with multiple keywords which complicates the matter.
Fix perf regression by correctly matching keywords This should (hopefully) fix regression from rust-lang#99918 r? `@estebank`
(this pr was inspired by this tweet)
r? @estebank
We've talked a bit about this recovery, but I just wanted to make sure that this is the right approach :)
For now I've only added the case insensitive recovery to
use
s, since most other items likeimpl
blocks, modules, functions can start with multiple keywords which complicates the matter.