Implement token-based handling of attributes during expansion by Aaron1011 · Pull Request #82608 · rust-lang/rust

Aaron1011 · 2021-02-27T20:10:32Z

This PR modifies the macro expansion infrastructure to handle attributes
in a fully token-based manner. As a result:

Derives macros no longer lose spans when their input is modified
by eager cfg-expansion. This is accomplished by performing eager
cfg-expansion on the token stream that we pass to the derive
proc-macro
Inner attributes now preserve spans in all cases, including when we
have multiple inner attributes in a row.

This is accomplished through the following changes:

New structs AttrAnnotatedTokenStream and AttrAnnotatedTokenTree are introduced.
These are very similar to a normal TokenTree, but they also track
the position of attributes and attribute targets within the stream.
They are built when we collect tokens during parsing.
An AttrAnnotatedTokenStream is converted to a regular TokenStream when
we invoke a macro.
Token capturing and LazyTokenStream are modified to work with
AttrAnnotatedTokenStream. A new ReplaceRange type is introduced, which
is created during the parsing of a nested AST node to make the 'outer'
AST node aware of the attributes and attribute target stored deeper in the token stream.
When we need to perform eager cfg-expansion (either due to #[derive] or #[cfg_eval]), we tokenize and reparse our target, capturing additional information about the locations of #[cfg] and #[cfg_attr] attributes at any depth within the target. This is a performance optimization, allowing us to perform less work in the typical case where captured tokens never have eager cfg-expansion run.

Aaron1011 · 2021-02-27T20:11:19Z

@bors try @rust-timer queue

rust-timer · 2021-02-27T20:11:20Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-02-27T20:11:29Z

⌛ Trying commit ac71b40b84add10df1cc6b4394ca53f5c3bfe16d with merge b86108850e24b4dd26ad05cb34d04ef489f2385a...

bors · 2021-02-27T21:01:55Z

☀️ Try build successful - checks-actions
Build commit: b86108850e24b4dd26ad05cb34d04ef489f2385a (b86108850e24b4dd26ad05cb34d04ef489f2385a)

rust-timer · 2021-02-27T21:01:57Z

Queued b86108850e24b4dd26ad05cb34d04ef489f2385a with parent ec7f8d9, future comparison URL.

rust-timer · 2021-02-27T23:23:47Z

Finished benchmarking try commit (b86108850e24b4dd26ad05cb34d04ef489f2385a): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf

petrochenkov · 2021-02-28T19:41:45Z

@Aaron1011
Could you move tests to a separate PR or commit? I'm interested in diffs in test outputs before and after the compiler changes.

Aaron1011 · 2021-02-28T19:43:36Z

@petrochenkov Sure

Aaron1011 · 2021-02-28T20:15:20Z

Opened #82643

compiler/rustc_ast/src/ast_like.rs

compiler/rustc_ast/src/attr/mod.rs

compiler/rustc_ast/src/tokenstream.rs

compiler/rustc_ast/src/mut_visit.rs

compiler/rustc_expand/src/config.rs

compiler/rustc_expand/src/lib.rs

petrochenkov · 2021-02-28T20:28:45Z

(Still need to review changes in rustc_parse.)

Aaron1011 · 2021-02-28T21:30:38Z

@bors try @rust-timer queue

rust-timer · 2021-02-28T21:30:39Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-02-28T21:30:47Z

⌛ Trying commit 87977c6a776ca0b4075f0998bde21267b3addcd6 with merge f1c431c58af7789983cbdfd61c470034c37eb631...

bors · 2021-02-28T22:17:01Z

☀️ Try build successful - checks-actions
Build commit: f1c431c58af7789983cbdfd61c470034c37eb631 (f1c431c58af7789983cbdfd61c470034c37eb631)

rust-timer · 2021-02-28T22:17:02Z

Queued f1c431c58af7789983cbdfd61c470034c37eb631 with parent 573a697, future comparison URL.

rust-timer · 2021-03-01T01:11:04Z

Finished benchmarking try commit (f1c431c58af7789983cbdfd61c470034c37eb631): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf

Aaron1011 · 2021-03-01T05:19:58Z

There appears to be a significant hit from being unable to bail out early from collect_tokens_trailing_token when parsing an expression. We have several different options:

Accept the performance hit. This is a pretty bad option, as I suspect this could be a measurable slowdown on large, complex projects.
Refactor expression parsing to determine in advance if the specific type of expression we are parsing (e.g. an if expression) supports inner attributes. However, the expression parsing code is already quite complicated, and this would require introducing even more complexity.
Speed up the slower path of collect_tokens_trailing_token. Given the current design of the parser, I think this will prove somewhat difficult. We need to clone the current token and the TokenCursor, which I think is causing most of the slowdown
Try to detect if inner attributes are definitely not present in the token stream. This would be kind of a hack, since we'd be effectively re-implementing part of the parser in collect_tokens_trailing_token. If inner attributes are present, then we will have the tokens <open_delimiter> # ! ... <close_delimiter> somewhere in the input. However, src/test/ui/proc-macro/weird-braces.rs shows that finding where this occurs is very tricky without actually parsing all of the tokens.

Aaron1011 · 2021-04-11T01:29:17Z

I've rebased against master - this should now be ready to merge.

petrochenkov · 2021-04-11T05:24:35Z

r=me with commits squashed.

This PR modifies the macro expansion infrastructure to handle attributes in a fully token-based manner. As a result: * Derives macros no longer lose spans when their input is modified by eager cfg-expansion. This is accomplished by performing eager cfg-expansion on the token stream that we pass to the derive proc-macro * Inner attributes now preserve spans in all cases, including when we have multiple inner attributes in a row. This is accomplished through the following changes: * New structs `AttrAnnotatedTokenStream` and `AttrAnnotatedTokenTree` are introduced. These are very similar to a normal `TokenTree`, but they also track the position of attributes and attribute targets within the stream. They are built when we collect tokens during parsing. An `AttrAnnotatedTokenStream` is converted to a regular `TokenStream` when we invoke a macro. * Token capturing and `LazyTokenStream` are modified to work with `AttrAnnotatedTokenStream`. A new `ReplaceRange` type is introduced, which is created during the parsing of a nested AST node to make the 'outer' AST node aware of the attributes and attribute target stored deeper in the token stream. * When we need to perform eager cfg-expansion (either due to `#[derive]` or `#[cfg_eval]`), we tokenize and reparse our target, capturing additional information about the locations of `#[cfg]` and `#[cfg_attr]` attributes at any depth within the target. This is a performance optimization, allowing us to perform less work in the typical case where captured tokens never have eager cfg-expansion run.

Aaron1011 · 2021-04-11T05:32:56Z

@bors r=petrochenkov

bors · 2021-04-11T05:32:57Z

📌 Commit a93c4f0 has been approved by petrochenkov

bors · 2021-04-11T07:36:41Z

⌛ Testing commit a93c4f0 with merge ba6275b...

bors · 2021-04-11T10:04:41Z

☀️ Test successful - checks-actions
Approved by: petrochenkov
Pushing ba6275b to master...

EliaGeretto · 2021-04-12T12:19:22Z

I believe this PR introduced a regression, breaking the clang-sys crate. I performed a bisection with cargo-bisect-rustc and this change seems to be the culprit. I opened an issue in the clang-sys repo because I am not sure where the problem is. It should be reproducible with cargo build --features runtime in that repo.

Aaron1011 · 2021-04-12T15:51:50Z

@EliaGeretto: Thanks for finding that! I've opened #84130 to fix it

rust-highfive assigned petrochenkov Feb 27, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Feb 27, 2021

Aaron1011 mentioned this pull request Feb 27, 2021

[WIP] Implement token-based handling of attributes #80689

Closed

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Feb 27, 2021

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Feb 27, 2021

petrochenkov reviewed Feb 28, 2021

View reviewed changes

Aaron1011 mentioned this pull request Feb 28, 2021

Add more proc-macro attribute tests #82643

Merged

Aaron1011 force-pushed the feature/final-preexp-tts branch from ac71b40 to 87977c6 Compare February 28, 2021 21:30

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Feb 28, 2021

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 1, 2021

This comment has been minimized.

Sign in to view

This was referenced Apr 11, 2021

Compiler loses location information before calling macros (sometimes) #43081

Closed

Remove #[cfg] attributes during cfg-expansion #84110

Closed

EliaGeretto mentioned this pull request Apr 12, 2021

Build fails on nightly-2021-04-11 with runtime feature enabled. KyleMayes/clang-sys#127

Closed

nrc mentioned this pull request Apr 13, 2021

Fix two bugs with insert and delete tikv/client-rust#253

Merged

pnkfelix mentioned this pull request Apr 14, 2021

clang-sys 0.29.3 fails to build on nightly #84162

Closed

emilio mentioned this pull request Apr 14, 2021

clang-sys no longer builds on the latest nightly #84188

Closed

jyn514 mentioned this pull request Apr 14, 2021

Fix lookahead with None-delimited group #84130

Merged

RalfJung mentioned this pull request Apr 27, 2021

rustfmt no longer builds after rust-lang/rust#84310 #84538

Closed

nikomatsakis mentioned this pull request May 6, 2021

add back support for inner attributes on non-block expressions? #84879

Closed

hellow554 mentioned this pull request May 18, 2021

Nightly rejects some macro-generated doc attribute values accepted by stable #85432

Closed

jyn514 mentioned this pull request May 18, 2021

Fix incorrect gating of nonterminals in key-value attributes #85445

Closed

Aaron1011 mentioned this pull request May 25, 2021

regression: proc-macro derive unparseable #85692

Closed

hellow554 mentioned this pull request Jul 2, 2021

ICE when using serde derives with an invalid inner doc comment #86781

Closed

samlich mentioned this pull request Jul 29, 2021

ICE: Should not have unglued last token with cfg attr #87577

Open

apiraino mentioned this pull request Aug 12, 2021

ICE "Found outer attribute Attribute" #87936

Closed

eggyal mentioned this pull request Nov 28, 2021

Avoid uneccessary clone of Annotatable #91324

Merged

This was referenced Apr 27, 2022

Less NoDelim #96421

Merged

Remove hacks in make_token_stream. #96543

Merged

matthiaskrgr mentioned this pull request Oct 16, 2023

ice: Mismatched open/close delims #116781

Closed

tgross35 mentioned this pull request Jul 8, 2024

Clear inner_attr_ranges regularly. #127477

Merged

Uh oh!

Conversation

Aaron1011 commented Feb 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Aaron1011 commented Feb 27, 2021

Uh oh!

rust-timer commented Feb 27, 2021

Uh oh!

bors commented Feb 27, 2021

Uh oh!

bors commented Feb 27, 2021

Uh oh!

rust-timer commented Feb 27, 2021

Uh oh!

rust-timer commented Feb 27, 2021

Uh oh!

petrochenkov commented Feb 28, 2021

Uh oh!

Aaron1011 commented Feb 28, 2021

Uh oh!

Aaron1011 commented Feb 28, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

petrochenkov commented Feb 28, 2021

Uh oh!

Aaron1011 commented Feb 28, 2021

Uh oh!

rust-timer commented Feb 28, 2021

Uh oh!

bors commented Feb 28, 2021

Uh oh!

bors commented Feb 28, 2021

Uh oh!

rust-timer commented Feb 28, 2021

Uh oh!

rust-timer commented Mar 1, 2021

Uh oh!

Aaron1011 commented Mar 1, 2021

Uh oh!

This comment has been minimized.

This comment has been minimized.

Aaron1011 commented Apr 11, 2021

Uh oh!

petrochenkov commented Apr 11, 2021

Uh oh!

Aaron1011 commented Apr 11, 2021

Uh oh!

bors commented Apr 11, 2021

Uh oh!

bors commented Apr 11, 2021

Uh oh!

bors commented Apr 11, 2021

Uh oh!

EliaGeretto commented Apr 12, 2021

Uh oh!

Aaron1011 commented Apr 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Aaron1011 commented Feb 27, 2021 •

edited

Loading