Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using the iterator inside the for loop causes UB #16011

Closed
bluss opened this issue Jul 26, 2014 · 8 comments
Closed

Using the iterator inside the for loop causes UB #16011

bluss opened this issue Jul 26, 2014 · 8 comments

Comments

@bluss
Copy link
Member

bluss commented Jul 26, 2014

Experimenting with using the iterator inside the loop (awsome feature), found weird bug

fn main() {
    let mut it = range(0u, 5);
    for i in it {
        println!("{}", i)
        if i == -23 {
            it.next();
        }
    }
}

Using rustc 0.12.0-pre (66a0b52 2014-07-25 20:36:10 +0000)

Actual Output:

1
1
1
1
1

Expected Output:

1
2
3
4
5

Bug needs optimization level 2 or higher -- so it is probably undefined behavior-related.

The actual condition doesn't matter, I just picked one that would never be true.

@bluss
Copy link
Member Author

bluss commented Jul 26, 2014

cc @pcwalton

@bluss bluss changed the title Using the iterator inside the for loop causes weird bug Using the iterator inside the for loop causes UB Jul 26, 2014
@japaric
Copy link
Member

japaric commented Jul 26, 2014

@blake2-ppc

This looks like a regression in the borrow checker. You're not supposed to be able to mutably borrow it inside the for loop, because the for i in it already mutably borrows it.

You can confirm this is a regression by testing your program in the playpen, but compiling it with the 0.11 snapshot, where it gets flagged as incorrect (reason: double mutable borrow).

@thestinger
Copy link
Contributor

It doesn't have to borrow it as &mut, it's only calling an &mut self method on it and the return value isn't tied to that self. It was borrowing it as &mut before due to the AST expansion hack.

@eddyb
Copy link
Member

eddyb commented Jul 27, 2014

Interestingly enough, trying to reduce the IR by isolating the println call produces the correct result even with optimizations.

Yes, it does seem an optimized println call is important for this bug to manifest.
It reuses a pointer into the value of the Option<uint> that next returns, as part of the for loop, for printing.
However... It all seems fine until you notice the LLVM lifetime intrinsics: that Option gets llvm.lifetime.start called for it before the loop (because of that pointer reuse thing) then after each iteration. There is no llvm.lifetime.end call for it.
cc @dotdash

On a second thought, there are no such intrinsics without optimizations so they must've been added by inlining or other LLVM passes. I'll leave this to the experts.

@dotdash
Copy link
Contributor

dotdash commented Jul 28, 2014

Looks like a mis-optimization.

; ModuleID = '<stdin>'
target datalayout = "e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

define void @test(i1 %c) {
entry-block:
  %x = alloca i8
  br label %loop_in

loop_in:                                          ; preds = %loop_out, %loop_body, %entry-block
  call void @llvm.lifetime.start(i64 8, i8* %x)
  br label %loop_body

loop_body:                                        ; preds = %loop_in
  br i1 %c, label %loop_in, label %loop_out

loop_out:                                         ; preds = %loop_body
  call void @llvm.lifetime.end(i64 8, i8* %x)
  br label %loop_in

out:                                              ; No predecessors!
  ret void
}

; Function Attrs: nounwind
declare void @llvm.lifetime.start(i64, i8* nocapture) unnamed_addr #0

; Function Attrs: nounwind
declare void @llvm.lifetime.end(i64, i8* nocapture) unnamed_addr #0

attributes #0 = { nounwind }

Looks like this after SimplifyCFG:

; ModuleID = 'test.no-opt.bc'
target datalayout = "e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

define void @test(i1 %c) {
entry-block:
  %x = alloca i8
  br label %loop_body

loop_body:                                        ; preds = %loop_body, %entry-block
  call void @llvm.lifetime.start(i64 8, i8* %x)
  br label %loop_body
}

; Function Attrs: nounwind
declare void @llvm.lifetime.start(i64, i8* nocapture) unnamed_addr #0

; Function Attrs: nounwind
declare void @llvm.lifetime.end(i64, i8* nocapture) unnamed_addr #0

attributes #0 = { nounwind }

@dotdash
Copy link
Contributor

dotdash commented Jul 28, 2014

The above on its own isn't enough. In addition to that the loop rotate pass moves the lifetime start from the start of the loop to the end, and that kills it.

@dotdash
Copy link
Contributor

dotdash commented Jul 28, 2014

Fix submitted upstream at http://reviews.llvm.org/D4699

@luqmana
Copy link
Member

luqmana commented Aug 4, 2014

Fixed by #16080.

@luqmana luqmana closed this as completed Aug 4, 2014
bors added a commit that referenced this issue Aug 6, 2014
Simplifying the code of methods: `nth`, `fold`, `rposition`, and iterators: `Filter`, `FilterMap`, `SkipWhile`.

```
before
test iter::bench_multiple_take      ... bench:        15 ns/iter (+/- 0)
test iter::bench_rposition          ... bench:       349 ns/iter (+/- 94)
test iter::bench_skip_while         ... bench:       158 ns/iter (+/- 6)

after
test iter::bench_multiple_take      ... bench:        15 ns/iter (+/- 0)
test iter::bench_rposition          ... bench:       314 ns/iter (+/- 2)
test iter::bench_skip_while         ... bench:       107 ns/iter (+/- 0)
```
@koalazen has the code for `Skip`.

Once #16011 is fixed, `min_max` could use a for loop.
lnicola pushed a commit to lnicola/rust that referenced this issue Jan 3, 2024
…ild-on-save, r=Veykril

feat: add proc-macro rebuild on save option

Related: rust-lang#15033

I need some advice on how to test it.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants