Only emit #[link_name] attribute if necessary. #1558

michaelwoerister · 2019-05-07T12:48:56Z

This PR makes bindgen not emit a #[link_name] attribute if it detects that it is not necessary.

Why do we want to do this?

Because when ThinLTO is performed across Rust/C/C++ language boundaries, the linker plugin will see both sides of the code at the LLVM IR level (as opposed to the machine code level). The #[link_name] attribute causes calls from Rust to C/C++ code to reference symbol names with backend-mangling applied to them (e.g. _foo@4 for a stdcall function) while the definition of the function on the C/C++ side has the name before this kind of mangling is applied (i.e. just foo for that same function). The linker (or respectively, the LLVM linker plugin) will thus not find any function named _foo@4 and report an undefined symbol error. The changes in this PR try to keep symbol names in sync at the LLVM IR level too (in more cases than before).

How does it work?

Before emitting an #[link_name] attribute, bindgen now checks if the Rust name will end up as the intended name anyway and in that case just not emit the attribute. It does so by looking at the Rust name (canonical_name), the proposed link_name, and the calling convention of a function. If the Rust name, with calling convention specific mangling applied to it, is equal to the link_name, then the attribute is not needed.

What are the downsides of this approach?

This approach does not work for C++ manglings with additional backend mangling applied, like for example __ZN3bar3FOOE (Itanium mangled C++ global with additional macOS-specific leading underscore). I think this applies to anything C++ on macOS :/

Could we do better?

Ideally, we'd add a #[link_name] attribute with the mangled name without the backend-specific mangling and without the leading \0 char. Then LLVM could just do the right thing. Unfortunately it seems that libClang does not provide this.

We could also try to actively remove backend part of mangled names; but doing so robustly on all platforms might be tricky.

@emilio, what do you think?

emilio · 2019-05-12T01:13:55Z

This looks reasonable to me. As far as I can tell, regarding this bit:

I think this applies to anything C++ on macOS :/

This wouldn't be a regression, right? I agree that trying to un-apply the mangling depending on current target would be a bit brittle. It's what we were doing before doing we switched to #[link_name]...

We could try to add an API to LLVM for this, if somebody ends up needing it. But all the high-level APIs that libclang exposes also don't expose any options for this.

emilio · 2019-05-12T01:14:57Z

src/codegen/mod.rs

+
+            // This is something we don't recognize, stay on the safe side
+            // by emitting the `#[link_name]` attribute
+            Some(_) => {


Maybe worth logging a debug! message or such? Otherwise no braces.

I couldn't think of a good message, so I removed the braces.

emilio · 2019-05-12T01:16:48Z

src/codegen/mod.rs

+
+            // Check that the suffix starts with '@' and is all ASCII decimals
+            // after that.
+            if suffix[0] != b'@' || !suffix[1..].iter().all(u8::is_ascii_digit)


It'd be good to have tests for this. Do we have them? I see call-conv-field has a change for @0, do we have one for longer suffixes?

I'll add one with a longer suffix.

michaelwoerister · 2019-05-13T09:39:44Z

I think this applies to anything C++ on macOS :/

This wouldn't be a regression, right?

Apparently (according to https://bugzilla.mozilla.org/show_bug.cgi?id=1486042#c42) things already work on macOS (that is things seem to already work on macOs without this patch). That is a bit surprising to me. I need to double check if Clang includes the leading underscore there already in LLVM IR (instead of it being added by the backend).

I agree that trying to un-apply the mangling depending on current target would be a bit brittle. It's what we were doing before doing we switched to #[link_name]...

We could try to add an API to LLVM for this, if somebody ends up needing it. But all the high-level APIs that libclang exposes also don't expose any options for this.

LLVM has a few useful methods for this on DataLayout, e.g.:

hasMicrosoftFastStdCallMangling()
doNotMangleLeadingQuestionMark()
hasLinkerPrivateGlobalPrefix()
getGlobalPrefix()

Replicating the logic would still be complicated but at least the actual per-target settings would be defined in LLVM. Unfortunately, LLVM's C interface does not expose any of these methods.

michaelwoerister · 2019-05-13T12:26:41Z

I've added some fastcall and stdcall tests but they seem to fail for LLVM 3.8. Looking at other tests, it seems like it is expected for LLVM 3.8 to behave differently. Is there a way I can make a test apply to all LLVM versions except one? Or do I have to duplicate the test case for all LLVM versions?

michaelwoerister · 2019-05-14T11:14:25Z

For reference: I just verified that the current version of this PR solves the cross-language ThinLTO problem for Firefox on x86 Windows.

emilio

Sorry, for some reason I didn't get the notification from Github that you had pushed.

michaelwoerister · 2019-05-15T08:44:03Z

Sorry, for some reason I didn't get the notification from Github that you had pushed.

Yeah, GH doesn't notify about that. I planned to ping you but wanted to wait until travis went green.

Can I help with publishing this on crates.io somehow?

highfive added the S-awaiting-review label May 7, 2019

michaelwoerister force-pushed the link-name branch 5 times, most recently from a4cd80a to d466754 Compare May 8, 2019 10:27

michaelwoerister changed the title ~~[WIP] Only emit #[link_name] attribute if necessary.~~ Only emit #[link_name] attribute if necessary. May 10, 2019

emilio approved these changes May 12, 2019

View reviewed changes

michaelwoerister force-pushed the link-name branch from 51ef923 to 01c0d17 Compare May 14, 2019 14:12

michaelwoerister added 2 commits May 14, 2019 16:15

Only emit #[link_name] attribute if necessary.

89ae362

Update tests to account for changed #[link_name] attribute emission.

be17028

michaelwoerister force-pushed the link-name branch from 01c0d17 to 8a97e04 Compare May 14, 2019 14:16

Add test cases for x86 Windows calling conventions.

920d285

michaelwoerister force-pushed the link-name branch from 8a97e04 to 920d285 Compare May 14, 2019 15:07

emilio approved these changes May 14, 2019

View reviewed changes

emilio merged commit 01a3e62 into rust-lang:master May 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Only emit #[link_name] attribute if necessary. #1558

Only emit #[link_name] attribute if necessary. #1558

Uh oh!

michaelwoerister commented May 7, 2019 •

edited

Loading

Uh oh!

emilio commented May 12, 2019

Uh oh!

emilio May 12, 2019

Uh oh!

michaelwoerister May 14, 2019

Uh oh!

emilio May 12, 2019

Uh oh!

michaelwoerister May 13, 2019

Uh oh!

michaelwoerister commented May 13, 2019

Uh oh!

michaelwoerister commented May 13, 2019

Uh oh!

michaelwoerister commented May 14, 2019

Uh oh!

emilio left a comment

Uh oh!

michaelwoerister commented May 15, 2019

Uh oh!

Uh oh!

Only emit #[link_name] attribute if necessary. #1558

Only emit #[link_name] attribute if necessary. #1558

Uh oh!

Conversation

michaelwoerister commented May 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why do we want to do this?

How does it work?

What are the downsides of this approach?

Could we do better?

Uh oh!

emilio commented May 12, 2019

Uh oh!

emilio May 12, 2019

Choose a reason for hiding this comment

Uh oh!

michaelwoerister May 14, 2019

Choose a reason for hiding this comment

Uh oh!

emilio May 12, 2019

Choose a reason for hiding this comment

Uh oh!

michaelwoerister May 13, 2019

Choose a reason for hiding this comment

Uh oh!

michaelwoerister commented May 13, 2019

Uh oh!

michaelwoerister commented May 13, 2019

Uh oh!

michaelwoerister commented May 14, 2019

Uh oh!

emilio left a comment

Choose a reason for hiding this comment

Uh oh!

michaelwoerister commented May 15, 2019

Uh oh!

Uh oh!

michaelwoerister commented May 7, 2019 •

edited

Loading