parser: Keep current and previous tokens precisely #69006

petrochenkov · 2020-02-09T21:14:19Z

...including their unnormalized forms.
Add more documentation for them.

Hopefully, this will help to eliminate footguns like #68728 (comment).

I'll try to address the FIXMEs in separate PRs during the next week.

including their unnormalized forms. Add more documentation for them.

Centril · 2020-02-10T00:37:04Z

src/librustc_parse/parser/mod.rs

+    /// FIXME: Remove in favor of `(unnormalized_)prev_token().kind`.
    prev_token_kind: PrevTokenKind,
+    /// Equivalent to `unnormalized_prev_token().span`.
+    /// FIXME: Remove in favor of `(unnormalized_)prev_token().span`.


This is used so frequently that we might want self.prev_span() (unless those occurrences are meant to be replaced with self.prev_token.span).

Centril · 2020-02-10T00:37:34Z

@bors r+

bors · 2020-02-10T00:37:36Z

📌 Commit cd7a428 has been approved by Centril

@Centril

parser: Keep current and previous tokens precisely ...including their unnormalized forms. Add more documentation for them. Hopefully, this will help to eliminate footguns like rust-lang#68728 (comment). I'll try to address the FIXMEs in separate PRs during the next week. r? @Centril

@ghost

Rollup of 6 pull requests Successful merges: - #68694 (Reduce the number of `RefCell`s in `InferCtxt`.) - #68966 (Improve performance of coherence checks) - #68976 (Make `num::NonZeroX::new` an unstable `const fn`) - #68992 (Correctly parse `mut a @ b`) - #69005 (Small graphviz improvements for the new dataflow framework) - #69006 (parser: Keep current and previous tokens precisely) Failed merges: r? @ghost

@Centril

parser: Remove `Parser::prev_token_kind` Follow-up to rust-lang#69006. r? @Centril

@Centril

parser: Simplify treatment of macro variables in `Parser::bump` Follow-up to rust-lang#69006. Token normalization for `$ident` and `$lifetime` is merged directly into `bump`. Special "unknown macro variable" diagnostic for unexpected `$`s is removed as preventing legal code from compiling (as a result `bump` also doesn't call itself recursively anymore and can't make `prev_token` inconsistent). r? @Centril

@Centril

parser: Cleanup `Parser::bump_with` and its uses Follow-up to rust-lang#69006. r? @Centril

@Centril

parser: `token` -> `normalized_token`, `nonnormalized_token` -> `token` So, after rust-lang#69006, its follow-ups and an attempt to remove `Parser::prev_span` I came to the conclusion that the unnormalized token and its span is what you want in most cases, so it should be default. Normalization only makes difference in few cases where we are checking against `token::Ident` or `token::Lifetime` specifically. This PR uses `normalized_token` for those cases. Using normalization explicitly means that people writing code should remember about `NtIdent` and `NtLifetime` in general. (That is alleviated by the fact that `token.ident()` and `fn parse_ident_*` are already written.) Remembering about `NtIdent`, was, however, already the case, kind of, because the implicit normalization was performed only for the current/previous token, but not for things like `look_ahead`. As a result, most of token classification methods in `token.rs` already take `NtIdent` into account (this PR fixes a few pre-existing minor mistakes though). The next step is removing `normalized(_prev)_token` entirely and replacing it with `token.ident()` (mostly) and `token.normalize()` (occasionally). I want to make it a separate PR for that and run it though perf. `normalized_token` filled on every bump has both a potential to avoid repeated normalization, and to do unnecessary work in advance (it probably doesn't matter anyway, the normalization is very cheap). r? @Centril

@Centril

rustc_parse: Remove `Parser::normalized(_prev)_token` Perform the "normalization" (renamed to "uninterpolation") on the fly when necessary. The final part of #69579 #69384 #69376 #69211 #69034 #69006. r? @Centril

@Centril

rustc_parse: Remove `Parser::normalized(_prev)_token` Perform the "normalization" (renamed to "uninterpolation") on the fly when necessary. The final part of rust-lang#69579 rust-lang#69384 rust-lang#69376 rust-lang#69211 rust-lang#69034 rust-lang#69006. r? @Centril

parser: Keep current and previous tokens precisely

cd7a428

including their unnormalized forms. Add more documentation for them.

rust-highfive assigned Centril Feb 9, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Feb 9, 2020

Centril reviewed Feb 10, 2020

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 10, 2020

Dylan-DPC-zz mentioned this pull request Feb 10, 2020

Rollup of 6 pull requests #69012

Merged

bors merged commit cd7a428 into rust-lang:master Feb 10, 2020

petrochenkov mentioned this pull request Feb 10, 2020

parser: Remove Parser::prev_token_kind #69034

Merged

Dylan-DPC-zz pushed a commit to Dylan-DPC-zz/rust that referenced this pull request Feb 12, 2020

Rollup merge of rust-lang#69034 - petrochenkov:notokind, r=Centril

42f371c

parser: Remove `Parser::prev_token_kind` Follow-up to rust-lang#69006. r? @Centril

petrochenkov mentioned this pull request Feb 16, 2020

parser: Simplify treatment of macro variables in Parser::bump #69211

Merged

This was referenced Feb 22, 2020

parser: Cleanup Parser::bump_with and its uses #69376

Merged

parser: token -> normalized_token, nonnormalized_token -> token #69384

Merged

Dylan-DPC-zz pushed a commit to Dylan-DPC-zz/rust that referenced this pull request Feb 23, 2020

Rollup merge of rust-lang#69376 - petrochenkov:bumpwith, r=Centril

d6414f5

parser: Cleanup `Parser::bump_with` and its uses Follow-up to rust-lang#69006. r? @Centril

petrochenkov mentioned this pull request Mar 7, 2020

rustc_parse: Remove Parser::normalized(_prev)_token #69801

Merged

petrochenkov mentioned this pull request May 2, 2022

Remove NtIdent and NtLifetime. #96627

Closed

petrochenkov deleted the prevspan2 branch February 22, 2025 18:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

parser: Keep current and previous tokens precisely #69006

parser: Keep current and previous tokens precisely #69006

Uh oh!

petrochenkov commented Feb 9, 2020

Uh oh!

Centril Feb 10, 2020

Uh oh!

Centril commented Feb 10, 2020

Uh oh!

bors commented Feb 10, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

parser: Keep current and previous tokens precisely #69006

parser: Keep current and previous tokens precisely #69006

Uh oh!

Conversation

petrochenkov commented Feb 9, 2020

Uh oh!

Centril Feb 10, 2020

Choose a reason for hiding this comment

Uh oh!

Centril commented Feb 10, 2020

Uh oh!

bors commented Feb 10, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants