Rewrite tokenization with `proc-macro2` tokens #146

alexcrichton · 2017-05-26T21:00:48Z

This ended up being a bit larger of a commit than I intended! I imagine that
this'll be one of the larger of the commits working towards #142. The purpose of
this commit is to use an updated version of the quote crate which doesn't work
with strings but rather works with tokens form the proc-macro2 crate. The
proc-macro2 crate itself is based on the proposed API for proc_macro itself,
and will continue to mirror it. The hope is that we'll flip an easy switch
eventually to use compiler tokens, whereas for now we'll stick to string parsing
at the lowest layer.

The largest change here is the addition of span information to the AST. Building
on the previous PRs to refactor the AST this makes it relatively easy from a
user perspective to digest and use the AST still, it's just a few extra fields
on the side. The fallout from this was then quite large throughout the
printing feature of the crate. The parsing, fold, and visit features
then followed suit to get updated as well.

This commit also changes the the semantics of the AST somewhat as well.
Previously it was inferred what tokens should be printed, for example if you
have a closure argument syn would automatically not print the colon in a: b
if the type listed was "infer this type". Now the colon is a separate field and
must be in sync with the type listed as the colon/type will be printed
unconditionally (emitting no output if both are None).

This ended up being a bit larger of a commit than I intended! I imagine that this'll be one of the larger of the commits working towards dtolnay#142. The purpose of this commit is to use an updated version of the `quote` crate which doesn't work with strings but rather works with tokens form the `proc-macro2` crate. The `proc-macro2` crate itself is based on the proposed API for `proc_macro` itself, and will continue to mirror it. The hope is that we'll flip an easy switch eventually to use compiler tokens, whereas for now we'll stick to string parsing at the lowest layer. The largest change here is the addition of span information to the AST. Building on the previous PRs to refactor the AST this makes it relatively easy from a user perspective to digest and use the AST still, it's just a few extra fields on the side. The fallout from this was then quite large throughout the `printing` feature of the crate. The `parsing`, `fold`, and `visit` features then followed suit to get updated as well. This commit also changes the the semantics of the AST somewhat as well. Previously it was inferred what tokens should be printed, for example if you have a closure argument `syn` would automatically not print the colon in `a: b` if the type listed was "infer this type". Now the colon is a separate field and must be in sync with the type listed as the colon/type will be printed unconditionally (emitting no output if both are `None`).

alexcrichton · 2017-05-26T21:03:46Z

Alright so this is a bit of a monster PR! I'm hoping this is the final version of the AST before we close out #142. Next steps I can see are:

Review the proc-macro2 crate
Publish the proc-macro2 crate
Review the changes to the quote crate
Publish the quote crate
Review this PR
Update serde_derive to work with this PR before merging perhaps?
Merge

So far I'm pretty happy with how this turned out, it ended up being pretty easy to implement parsing, ToTokens, visiting, and folding after this change. I don't think there's redundant information (only what's needed), but an extra pair of eyes is always helpful!

alexcrichton · 2017-05-26T21:06:18Z

I'll also note that I ended up diverging somewhat from the proposed proc_macro API. I tried to stick as closely as possible to the current revision in rust-lang/rust#40939 as possible, but to get all the tests passing in this repo (especially the round-trip ones) I had to significantly bolster the Literal API to meet the various needs.

I imagine we'll solve all these issues before stabilizing proc_macro, however.

alexcrichton · 2017-05-26T21:07:53Z

Also note that this does not parse from tokens yet, this only updates tokenization. All parser code is "untouched" minus the updates necessary to get this new AST being generated.

dtolnay · 2017-05-26T23:23:25Z

This is pretty clearly the future so I am comfortable merging this and following up with any necessary tweaks separately, rather than futzing with dependent PRs.

My only concern is I would like to have the real proc-macro API from rust-lang/rust#40939 in better shape before publishing either of proc-macro2 or syn. Experimenting with a bolstered API is fine but when publishing a syn release I would like to make sure that proc-macro2 has "stable" and "unstable" modules that are perfectly identical.

alexcrichton · 2017-05-27T03:12:07Z

Yeah sounds good to me. If it's ok I think we should hold off on publishing proc-macro2 and the next version of this crate until the change to rustc happens. I think that the current API is sufficient albeit very difficult to work around, but we could make do if need be.

My current thinking is that when the proc_macro API lands on nightly we'll freeze the stable module and the interface of the proc_macro2 crate to look exactly like that. As the proc_macro API changes (if it does) we'll update the unstable module to do a translation between the old and the new, maintaining API compatibility of the proc_macro2 crate itself. Then finally once we get closer to stabilization I figure we can release a new breaking change.

alexcrichton mentioned this pull request May 26, 2017

Update to operate with proc-macro2 tokens dtolnay/quote#37

Merged

dtolnay merged commit d630f9d into dtolnay:master May 26, 2017

alexcrichton deleted the new-tokens2 branch May 27, 2017 03:10

antoyo mentioned this pull request May 28, 2017

Better error reporting antoyo/relm#27

Closed

dtolnay mentioned this pull request Jan 16, 2018

Rename => token to FatArrow #330

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite tokenization with `proc-macro2` tokens #146

Rewrite tokenization with `proc-macro2` tokens #146

alexcrichton commented May 26, 2017

alexcrichton commented May 26, 2017 •

edited

Loading

alexcrichton commented May 26, 2017

alexcrichton commented May 26, 2017

dtolnay commented May 26, 2017

alexcrichton commented May 27, 2017

Rewrite tokenization with proc-macro2 tokens #146

Rewrite tokenization with proc-macro2 tokens #146

Conversation

alexcrichton commented May 26, 2017

alexcrichton commented May 26, 2017 • edited Loading

alexcrichton commented May 26, 2017

alexcrichton commented May 26, 2017

dtolnay commented May 26, 2017

alexcrichton commented May 27, 2017

Rewrite tokenization with `proc-macro2` tokens #146

Rewrite tokenization with `proc-macro2` tokens #146

alexcrichton commented May 26, 2017 •

edited

Loading