Skip to content

Conversation

dianqk
Copy link
Member

@dianqk dianqk commented Jun 20, 2025

The PR introduces support for debug information within dead statements. Currently, only the reference statement is supported, which is sufficient to fix #128081.

I don't modify Stable MIR, as I don't think we need debug information when using it.

This PR represents the debug information for the dead reference statement via #dbg_value. For example, let _foo_b = &foo.b becomes #dbg_value(ptr %foo, !22, !DIExpression(DW_OP_plus_uconst, 4, DW_OP_stack_value), !26). You can see this here: https://rust.godbolt.org/z/d43js6adv.

The general principle for handling debug information is to never provide less debug information than the optimized LLVM IR.

The current rules for dropping debug information in this PR are:

I haven't drop debuginfos in MatchBranchSimplification, because LLVM also pick one branch for it.

For the perf result:

I expected this to introduce some regressions; however, the results mixed the effects of inlining. Looking at the doc profile, this is a clear optimization. One potential regression I'm investigating is serde-1.0.219-debug-full.

@rustbot rustbot added A-LLVM Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jun 20, 2025
@rust-log-analyzer

This comment has been minimized.

@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from 15c968a to 6b013d4 Compare June 20, 2025 08:01
@rust-log-analyzer

This comment has been minimized.

@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from 6b013d4 to 51576e7 Compare June 20, 2025 09:23
@rust-log-analyzer

This comment has been minimized.

@dianqk
Copy link
Member Author

dianqk commented Jun 20, 2025

@bors2 try @rust-timer queue

@rust-timer

This comment has been minimized.

@rust-bors
Copy link

rust-bors bot commented Jun 20, 2025

⌛ Trying commit 51576e7 with merge eb83156

To cancel the try build, run the command @bors2 try cancel.

rust-bors bot added a commit that referenced this pull request Jun 20, 2025
Introduce debuginfo to statements in MIR

Not ready for reviewing. Something known:

- [ ] Retain debuginfo when concatenating bbs
- [ ] Document about when to drop debuginfos (don't be worse than the optimized LLVM IR)
- [ ] Missing tests

r? ghost
@rustbot rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jun 20, 2025
@rust-bors
Copy link

rust-bors bot commented Jun 20, 2025

☀️ Try build successful (CI)
Build commit: eb83156 (eb83156169ae3bbdd1385d498455dbc44283f5ff, parent: 18491d5be00eb3ed2f1ccee2ac5b792694f2a7a0)

@rust-timer

This comment has been minimized.

@rust-timer
Copy link
Collaborator

Finished benchmarking commit (eb83156): comparison URL.

Overall result: ❌✅ regressions and improvements - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

mean range count
Regressions ❌
(primary)
0.9% [0.2%, 14.5%] 51
Regressions ❌
(secondary)
0.8% [0.1%, 1.6%] 52
Improvements ✅
(primary)
-0.4% [-2.1%, -0.2%] 35
Improvements ✅
(secondary)
-0.3% [-0.5%, -0.2%] 39
All ❌✅ (primary) 0.4% [-2.1%, 14.5%] 86

Max RSS (memory usage)

Results (primary 2.3%, secondary 3.4%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

mean range count
Regressions ❌
(primary)
2.3% [0.8%, 4.7%] 54
Regressions ❌
(secondary)
3.6% [0.7%, 7.4%] 38
Improvements ✅
(primary)
- - 0
Improvements ✅
(secondary)
-3.3% [-3.3%, -3.3%] 1
All ❌✅ (primary) 2.3% [0.8%, 4.7%] 54

Cycles

Results (primary 3.2%, secondary 2.5%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

mean range count
Regressions ❌
(primary)
3.9% [1.1%, 14.4%] 7
Regressions ❌
(secondary)
2.5% [2.0%, 3.3%] 3
Improvements ✅
(primary)
-2.0% [-2.0%, -2.0%] 1
Improvements ✅
(secondary)
- - 0
All ❌✅ (primary) 3.2% [-2.0%, 14.4%] 8

Binary size

Results (primary 0.5%, secondary 0.3%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

mean range count
Regressions ❌
(primary)
0.7% [0.0%, 2.2%] 123
Regressions ❌
(secondary)
1.1% [0.0%, 3.5%] 59
Improvements ✅
(primary)
-0.9% [-6.5%, -0.1%] 11
Improvements ✅
(secondary)
-0.5% [-13.0%, -0.1%] 55
All ❌✅ (primary) 0.5% [-6.5%, 2.2%] 134

Bootstrap: 691.482s -> 692.445s (0.14%)
Artifact size: 371.94 MiB -> 372.12 MiB (0.05%)

@rustbot rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Jun 20, 2025
@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from 51576e7 to e72c3ae Compare June 21, 2025 02:31
@dianqk
Copy link
Member Author

dianqk commented Jun 21, 2025

@bors2 try @rust-timer queue

@rust-timer

This comment has been minimized.

@rust-bors
Copy link

rust-bors bot commented Jun 21, 2025

⌛ Trying commit e72c3ae with merge 77d5c6a

To cancel the try build, run the command @bors2 try cancel.

rust-bors bot added a commit that referenced this pull request Jun 21, 2025
Introduce debuginfo to statements in MIR

Not ready for reviewing. Something known:

- [ ] Retain debuginfo when concatenating bbs
- [ ] Document about when to drop debuginfos (don't be worse than the optimized LLVM IR)
- [ ] Missing tests

r? ghost
@rustbot rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jun 21, 2025
@rust-log-analyzer

This comment has been minimized.

@rust-bors
Copy link

rust-bors bot commented Jun 21, 2025

☀️ Try build successful (CI)
Build commit: 77d5c6a (77d5c6a20a77fabdee3790a412618b82178e9ab4, parent: 15c701fbc995eb6c5b3a86021c18185f8eee020d)

@rust-timer

This comment has been minimized.

@rust-timer
Copy link
Collaborator

Finished benchmarking commit (77d5c6a): comparison URL.

Overall result: ❌✅ regressions and improvements - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

mean range count
Regressions ❌
(primary)
0.9% [0.1%, 14.5%] 53
Regressions ❌
(secondary)
0.8% [0.1%, 1.6%] 50
Improvements ✅
(primary)
-0.4% [-3.0%, -0.2%] 46
Improvements ✅
(secondary)
-0.3% [-0.5%, -0.2%] 41
All ❌✅ (primary) 0.3% [-3.0%, 14.5%] 99

Max RSS (memory usage)

Results (primary 2.4%, secondary 3.4%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

mean range count
Regressions ❌
(primary)
2.4% [0.5%, 5.7%] 38
Regressions ❌
(secondary)
4.2% [1.3%, 6.2%] 25
Improvements ✅
(primary)
- - 0
Improvements ✅
(secondary)
-1.7% [-3.0%, -1.1%] 4
All ❌✅ (primary) 2.4% [0.5%, 5.7%] 38

Cycles

Results (primary 2.1%, secondary 0.5%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

mean range count
Regressions ❌
(primary)
3.0% [0.8%, 14.1%] 10
Regressions ❌
(secondary)
3.3% [2.3%, 4.3%] 2
Improvements ✅
(primary)
-2.3% [-2.9%, -1.7%] 2
Improvements ✅
(secondary)
-2.2% [-2.6%, -1.9%] 2
All ❌✅ (primary) 2.1% [-2.9%, 14.1%] 12

Binary size

Results (primary 0.5%, secondary 0.3%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

mean range count
Regressions ❌
(primary)
0.7% [0.0%, 3.0%] 122
Regressions ❌
(secondary)
1.1% [0.0%, 3.5%] 59
Improvements ✅
(primary)
-0.7% [-4.9%, -0.0%] 15
Improvements ✅
(secondary)
-0.5% [-12.7%, -0.0%] 55
All ❌✅ (primary) 0.5% [-4.9%, 3.0%] 137

Bootstrap: 690.617s -> 691.47s (0.12%)
Artifact size: 371.83 MiB -> 372.01 MiB (0.05%)

@rustbot rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jun 21, 2025
@bors
Copy link
Collaborator

bors commented Jun 25, 2025

☔ The latest upstream changes (presumably #142870) made this pull request unmergeable. Please resolve the merge conflicts.

GuillaumeGomez added a commit to GuillaumeGomez/rust that referenced this pull request Aug 6, 2025
mir: Do not modify NonUse in `super_projection_elem`

Split from rust-lang#142771.
r? cjgillot
rust-timer added a commit that referenced this pull request Aug 7, 2025
Rollup merge of #144998 - dianqk:visit-no-use-proj, r=cjgillot

mir: Do not modify NonUse in `super_projection_elem`

Split from #142771.
r? cjgillot
rust-timer added a commit that referenced this pull request Aug 7, 2025
Rollup merge of #144996 - dianqk:simplifycfg-collapse_goto_chain-changed, r=cjgillot

simplifycfg: Mark as changed when start is modified in collapse goto chain

Split from #142771.
r? cjgillot
@dianqk dianqk force-pushed the mir-stmt-debuginfo branch 2 times, most recently from 7a0c75b to 458ad93 Compare August 10, 2025 14:48
Copy link
Contributor

@cjgillot cjgillot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is looking very good thanks! I'm not certain I'm the best reviewer for the codegen part, so I just left a few questions.

@@ -312,8 +321,11 @@ macro_rules! make_mir_visitor {
index += 1;
}

let location = Location { block, statement_index: index };
for debuginfo in after_last_stmt_debuginfos as & $($mutability)? [_] {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

in &$(mutability)? after_last_stmt_debuginfos?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dereferencing does not happen with this syntax.

) -> Option<Place<'tcx>> {
// Compute the place that we are storing to, if any
let destination = match stmt_kind {
StatementKind::Assign(box (place, rvalue)) => (rvalue.is_safe_to_remove()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit, easier for me to understand:

            StatementKind::Assign(box (_, rvalue)) if !rvalue.is_safe_to_remove() => None,
            StatementKind::Assign(box (place, _)) if stmt_kind.as_debuginfo().is_some() => Some(*place),

and then proceed to reasoning according to debuginfo_locals.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a little scary for me, their order can be accidentally re-ordered.

@@ -160,19 +160,23 @@ impl<'ll> DebugInfoBuilderMethods for Builder<'_, 'll, '_> {
&mut self,
dbg_var: &'ll DIVariable,
dbg_loc: &'ll DILocation,
variable_alloca: Self::Value,
is_declared: bool,
val: Self::Value,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we just have 2 methods? a dbg_var_addr and a dbg_var_value?

pub(crate) fn codegen_stmt_debuginfo(&mut self, bx: &mut Bx, debuginfo: &StmtDebugInfo<'tcx>) {
match debuginfo {
StmtDebugInfo::AssignRef(dest, place) => {
let place_ref = match self.locals[place.local] {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This local is poorly named. It represents place.local, not the while place. local_ref?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines 135 to 145
(Some(place_ref), false) => {
(place_ref.val, place_ref.layout, place.projection.as_slice())
}
(Some(place_ref), true) => {
let projected_ty =
place_ref.layout.ty.builtin_deref(true).unwrap_or_else(|| {
bug!("deref of non-pointer {:?}", place_ref)
});
let layout = bx.cx().layout_of(projected_ty);
(place_ref.val, layout, &place.projection[1..])
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand the logic here. If this is an indirect place, we change the type but we don't change the val?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The layout is used to calculate offset with the projection. The function name should be debug_new_ref_to_local , not debug_new_value_to_local. I'm looking a better logic.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

}
}

fn debug_new_value_to_local_as_var(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this be inlined? This should allow to compute offsets only once outside the loop.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let ref_v1 = &ref_foo.1;
let ref_v2 = &ref_foo.2;
ref_foo.0
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you have an example where we take the address of an Index projection?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

terminator.successors_mut(|successor| {
self.collapse_goto_chain(successor, &mut changed);
});
if changed && let Some(identical_succ) = identical_succ {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we introduce a more localized chain_changed variable?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code has been removed in 8702b70.

let mut succ_debuginfos =
self.basic_blocks[succ].after_last_stmt_debuginfos.clone();
self.basic_blocks[current].after_last_stmt_debuginfos.extend(&mut succ_debuginfos);
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there a way to look at pred_count to avoid a clone?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

8702b70. I don't think I should move debuginfos before the terminator. The new change should be clearer and easier to maintain, and it also fixes a case where debuginfos was lost.

use crate::util::is_within_packed;

/// Performs the optimization on the body
///
/// The `borrowed` set must be a `DenseBitSet` of all the locals that are ever borrowed in this
/// body. It can be generated via the [`borrowed_locals`] function.
fn eliminate<'tcx>(tcx: TyCtxt<'tcx>, body: &mut Body<'tcx>) {
fn eliminate<'tcx>(tcx: TyCtxt<'tcx>, body: &mut Body<'tcx>) -> bool {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you doc-comment what this bool is?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rust-log-analyzer

This comment has been minimized.

@dianqk
Copy link
Member Author

dianqk commented Aug 17, 2025

Sorry I haven't replied to all of them. A little busy...

@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from a4ce010 to b0d20dd Compare August 28, 2025 14:28
@rustbot
Copy link
Collaborator

rustbot commented Aug 28, 2025

This PR was rebased onto a different master commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

@rust-log-analyzer

This comment has been minimized.

@dianqk dianqk added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 28, 2025
@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from b0d20dd to 39c5bf2 Compare August 29, 2025 00:22
@rust-log-analyzer

This comment has been minimized.

@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from 39c5bf2 to 450c929 Compare August 29, 2025 00:45
@rust-log-analyzer

This comment has been minimized.

dianqk added 4 commits August 29, 2025 21:35
`&_1` is only valid if the pass mode is indirect.
Move all debuginfos to the last basic block if each bb has
only one predecessor.
Add a test case for the index projection.
@dianqk dianqk force-pushed the mir-stmt-debuginfo branch from 450c929 to 84d42cf Compare August 29, 2025 13:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-LLVM Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues. perf-regression Performance regression. S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bad codegen for non-copy-derived struct with all Copy derived fields
8 participants