replace the implementation of inline functions #14527

thestinger · 2014-05-29T23:08:21Z

Rust's implementation of #[inline] functions is far from ideal. It's a major contributor to both slow compile times and bloated binaries.

There are an enormous number of #[inline] functions in the standard libraries, and most have far more than one layer of inner #[inline] function calls. The work to convert these functions to LLVM IR, optimize and translate to machine code is duplicated for every single crate. The inline pass is not used at --opt-level=0 and --opt-level=1, so the result is wasted time and duplicated code without any benefits.

It is possible to implement #[inline] functions without duplicating the work of converting to LLVM IR and optimizing. The compiler can also entirely avoid any duplicated function bodies when the optimization level is not high enough for inline, the function is used as a function pointer or it is above the threshold.

Before compiling all of the functions in a library crate, Rust should create an LLVM module with all of the externally reachable inline functions in the crate. It will run the optimization passes on this LLVM module before continuing to compile, and it end up stored as metadata in the rlib or dynamic library in the bytecode format.

The compiler will then continue on with the compilation of the other functions in the crate. The work to generate optimized LLVM IR from the externally reachable #[inline] is already complete and can be reused. These functions will not be marked internal, because other crates will be able to call through to these.

Now, when Rust is compiling another crate, it can start by fetching the LLVM bytecode for the required inline functions. These functions will be marked available_externally and use the original symbol from the source library, so that if inlining does not occur there will be no duplicate code. At --opt-level=0 and --opt-level=1, it can simply generate an external call immediately and ignore the bytecode blob.

It would also be possible to leverage this for instantiations of generic functions, by making the instantiations already done by the library available externally as LLVM bytecode blobs in the metadata.

The text was updated successfully, but these errors were encountered:

emberian · 2014-06-18T17:53:25Z

cc me

reem · 2015-03-02T08:10:55Z

Visiting for triage. I have nothing to add, but this is very interesting.

Mark-Simulacrum · 2018-07-29T02:34:00Z

@michaelwoerister Is the recent work with reusing upstream codegen for monomorphizations relevant here? It seems like maybe enabling that for inline functions would be good.

cc #47317

jonas-schievink · 2020-01-13T19:18:15Z

I expect this issue will be irrelevant when MIR-only rlibs happen (is that still the plan?)

bstrie · 2021-05-23T00:00:19Z

I expect this issue will be irrelevant when MIR-only rlibs happen

This is in reference to #38913 .

workingjubilee · 2022-07-19T03:01:46Z

#38913 was closed, so this will probably remain relevant for a while. 🙃

…able-externally, r=<try> Default-enable share-generics, with available_externally to still allow inlining. WIP, just experimenting, not clear whether this works for other codegen backends, or how practical of a tradeoff it is. cc rust-lang#14527 r? `@Mark-Simulacrum` for now

thestinger added I-slow labels May 29, 2014

thestinger mentioned this issue May 29, 2014

Consider using available_externally linkage for cross-crate inlined functions #10212

Closed

huonw mentioned this issue Aug 2, 2014

avoid redundant translation of items during monomorphization #16059

Merged

spernsteiner mentioned this issue Aug 5, 2014

use available_externally linkage for cross-crate inlined functions #16270

Closed

eddyb mentioned this issue Oct 12, 2014

Stop compressing metadata #9365

Closed

emberian self-assigned this Mar 25, 2015

emberian removed their assignment Jan 5, 2016

dotdash mentioned this issue Jan 15, 2016

Cross-crate #[inline] functions are handled badly in unoptimized builds #30933

Closed

Mark-Simulacrum added C-enhancement Category: An issue proposing an enhancement or a PR with one. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jul 21, 2017

alexcrichton mentioned this issue Oct 20, 2017

Tracking issue for enabling multiple CGUs in release mode by default #45320

Closed

11 tasks

jonas-schievink added the A-codegen Area: Code generation label Jan 13, 2020

Sl1mb0 mentioned this issue Sep 26, 2021

Optimize codegen scheduling #89281

Open

5 tasks

workingjubilee added the C-optimization Category: An issue highlighting optimization opportunities or PRs implementing such label Oct 8, 2023

This was referenced Apr 5, 2024

Enable -Zshare-generics for inline(never) functions #123244

Merged

Default-enable share-generics, with available_externally to still allow inlining. #123610

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace the implementation of inline functions #14527

replace the implementation of inline functions #14527

thestinger commented May 29, 2014

emberian commented Jun 18, 2014

reem commented Mar 2, 2015

Mark-Simulacrum commented Jul 29, 2018

jonas-schievink commented Jan 13, 2020

bstrie commented May 23, 2021

workingjubilee commented Jul 19, 2022

replace the implementation of inline functions #14527

replace the implementation of inline functions #14527

Comments

thestinger commented May 29, 2014

emberian commented Jun 18, 2014

reem commented Mar 2, 2015

Mark-Simulacrum commented Jul 29, 2018

jonas-schievink commented Jan 13, 2020

bstrie commented May 23, 2021

workingjubilee commented Jul 19, 2022