-
Notifications
You must be signed in to change notification settings - Fork 13.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Miscompilations on AArch64 since "[SimplifyCFG] don't sink common insts too soon" #38898
Comments
assigned to @TNorthover |
To narrow down the problem, we can compile the source to optimized IR: Then pass that IR to the backend: And run the code: Does that produce the wrong result? If yes, turn off optimization in the backend: Does that produce the wrong result? If yes, the bug is probably in the IR. If not, the bug is probably in the backend...and so r320749 is probably not the problem. (It's possible that there are multiple bugs interacting, but let's hope not...) |
Thanks for these points! It does indeed look like this commit isn't to blame, as the issue disappears if building the IR with llc -O0. That would also explain why it only appears on AArch64. Unfortunately, that also gets me more or less back to square one about figuring out the reason for it, but I guess I'll have to strip down the optimization behaviour change further to zoom in on what might be causing it. |
Reduced test case |
After reducing the issue further, it is clear that the issue lies somewhere around AArch64 CCMP instructions. CC:ing a few people who probably know a bit or two around that. The issue seems to be that there is a condition that looks like this:
After lowering this to CCMPs, the effective form of the condition turns out into this:
To reproduce, compile the attached reproduction case e.g. like this: $ clang -O2 -target aarch64-linux-gnu ref_mvs-preproc.c -o repro The generated assembly shows the issue rather clearly: While the issue is clearly visible here, I do see in the comment for emitConditionalComparison in AArch64ISelLowering.cpp that it is prepared to handle mixed AND/OR conditions, but apparently something goes wrong here. |
You cannot directly express It seems though something goes wrong with that computation... |
I think for this to work properly we would need to emit the expression as: |
I've got a fix for the issue now. Just need to cleanup the code some more and create a testcase before I can upload a review. |
Thanks for looking at this! Yes, I found these functions; but the code seemed like it already meant to take care of this and it wasn't really obvious where the assumptions went wrong. |
https://reviews.llvm.org/D54137 should fix this. |
I can confirm that the current version of the patch fixes this issue - thanks! |
Tim, is this OK to merge to the release_70 branch? |
FWIW, merging that fix also requires merging https://reviews.llvm.org/rL346203 as a preceding cleanup. |
Yep, I think they're both good. |
Merged: r348636 r348642 |
Extended Description
I'm running into a case where code seems to miscompile after SVN r320749, "[SimplifyCFG] don't sink common insts too soon (#34603 )" (part of LLVM 6.0, but the same issues can be reproduced with the latest trunk version as well).
I can't (yet) point out exactly where the new generated code is wrong, but this commit changed the outcome of the attached code.
To reproduce (somewhat), compile the attached sample with "clang -std=c99 -O3 -fvisibility=hidden -fomit-frame-pointer -ffast-math --target=aarch64-linux-gnu -c ref_mvs-preproc.c". If compiled with clang built from before SVN r320749, the compiled code does what it is supposed to, while if compiled with a later version, it produces incorrect results.
I have tried looking at the output from compiling with
-mllvm -print-after-all
to look at differences between before and after this commit, and there obviously are differences, but nothing that I could spot that stands out as obviously incorrect.Surprisingly, the same code built for other architectures (both 32 and 64 bit x86, and armv7) with newer clang/llvm versions run just fine without any of the misbehaviour as I run into on AArch64.
Can someone spot what this SimplifyCFG change does wrt to this code sample, if there's some overlooked case? Or are the transformations correct and it just happens to trigger buggy codepaths in the AArch64 target after the transformation?
The text was updated successfully, but these errors were encountered: