gh-101282: Apply BOLT optimisations to libpython for shared builds #104709

indygreg · 2023-05-20T23:12:42Z

(This change is a quick and dirty way to merge some of the build system improvements I'm proposing in gh-101093 before the 3.12 feature freeze. I wanted to scope bloat myself to fix some longstanding deficiencies in the build system around profile-guided builds. But I'm getting soft resistance to the reviews so close to the freeze deadline and it is obvious that we need a simpler solution to hit the 3.12 deadline. While this change is quick and dirty, it attempts to not make things worse.)

Before this change, we only applied bolt to the main python binary. After this change, we apply bolt to libpython if it is configured. In shared library builds, most of the C code is in libpython so it is critical to apply bolt to libpython to realize bolt benefits.

This change also reworks how bolt instrumentation is applied. It effectively removes the readelf based logic added in gh-101525 and replaces it with a mechanism that saves a copy of the pre-bolt binary and restores that copy when necessary. This allows us to perform bolt optimizations without having to manually delete the output binary to force a new bolt run.

We also add a new make target for purging bolt files and hook it up to clean so bolt state is purged when appropriate.

.gitignore rules have been added to ignore files related to bolt.

Before and after this refactor, make will no-op after a previous run. Both versions should also share common make DAG deficiencies where targets fail to trigger as often as they need to or can trigger prematurely in certain scenarios. e.g. after this change you may need to rm profile-bolt-stamp to force a bolt run because there aren't appropriate non-phony targets for bolt's make target to depend on. Fixing this is a non-trivial amount of work that will likely have to wait until the 3.13 window.

To make it easier to iterate on custom BOLT settings, the flags to pass to instrumentation and application are now defined in configure and can be overridden by passing BOLT_INSTRUMENT_FLAGS and BOLT_APPLY_FLAGS.

Issue: Enhance the BOLT build process #101282

corona10

Please update docs for BOLT_INSTRUMENT_FLAGS and BOLT_APPLY_FLAGS : https://docs.python.org/3.12/using/configure.html#performance-options

bedevere-bot · 2023-05-21T08:53:28Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

corona10

Overall LGTM.

(This change is a quick and dirty way to merge some of the build system improvements I'm proposing in pythongh-101093 before the 3.12 feature freeze. I wanted to scope bloat myself to fix some longstanding deficiencies in the build system around profile-guided builds. But I'm getting soft resistance to the reviews so close to the freeze deadline and it is obvious that we need a simpler solution to hit the 3.12 deadline. While this change is quick and dirty, it attempts to not make things worse.) Before this change, we only applied bolt to the main python binary. After this change, we apply bolt to libpython if it is configured. In shared library builds, most of the C code is in libpython so it is critical to apply bolt to libpython to realize bolt benefits. This change also reworks how bolt instrumentation is applied. It effectively removes the readelf based logic added in pythongh-101525 and replaces it with a mechanism that saves a copy of the pre-bolt binary and restores that copy when necessary. This allows us to perform bolt optimizations without having to manually delete the output binary to force a new bolt run. We also add a new make target for purging bolt files and hook it up to `clean` so bolt state is purged when appropriate. `.gitignore` rules have been added to ignore files related to bolt. Before and after this refactor, `make` will no-op after a previous run. Both versions should also share common make DAG deficiencies where targets fail to trigger as often as they need to or can trigger prematurely in certain scenarios. e.g. after this change you may need to `rm profile-bolt-stamp` to force a bolt run because there aren't appropriate non-phony targets for bolt's make target to depend on. Fixing this is a non-trivial amount of work that will likely have to wait until the 3.13 window. To make it easier to iterate on custom BOLT settings, the flags to pass to instrumentation and application are now defined in configure and can be overridden by passing `BOLT_INSTRUMENT_FLAGS` and `BOLT_APPLY_FLAGS`.

indygreg · 2023-05-21T20:15:02Z

Please update docs for BOLT_INSTRUMENT_FLAGS and BOLT_APPLY_FLAGS : https://docs.python.org/3.12/using/configure.html#performance-options

Done in latest push.

erlend-aasland

I cleaned up the docs and AC code; hope you don't mind.

I left some questions. Regarding BOLT technical stuff, I lean on Dong-hee's review.

.gitignore

Makefile.pre.in

erlend-aasland · 2023-05-21T22:29:13Z

Done in latest push.

We appreciate if you don't force-push:

It does not play very nice with the GitHub UX (messes up CI runs, commit history, review comments, etc.)
We often collaborate on PRs; pulling in new changes using git merge --no-ff is more friendly to our workflow

(This is also mentioned in the devguide.)

configure.ac

erlend-aasland · 2023-05-22T11:45:56Z

Thanks, Greg and Dong-hee!

indygreg requested review from corona10 and erlend-aasland as code owners May 20, 2023 23:12

bedevere-bot added the awaiting review label May 20, 2023

bedevere-bot mentioned this pull request May 20, 2023

Enhance the BOLT build process #101282

Closed

4 tasks

erlend-aasland requested review from mdboom and zware May 21, 2023 07:11

corona10 requested changes May 21, 2023

View reviewed changes

bedevere-bot removed the awaiting review label May 21, 2023

bedevere-bot added the awaiting changes label May 21, 2023

corona10 reviewed May 21, 2023

View reviewed changes

indygreg force-pushed the rework-bolt branch from f7851cd to 555686d Compare May 21, 2023 20:14

erlend-aasland added 2 commits May 21, 2023 23:43

Fixup doc formatting

52a266e

M4 style nits

3cf0f36

erlend-aasland reviewed May 21, 2023

View reviewed changes

.gitignore Show resolved Hide resolved

Makefile.pre.in Show resolved Hide resolved

Makefile.pre.in Show resolved Hide resolved

erlend-aasland reviewed May 21, 2023

View reviewed changes

configure.ac Show resolved Hide resolved

Remove extra backtick

972dcb2

erlend-aasland requested a review from corona10 May 22, 2023 11:37

erlend-aasland approved these changes May 22, 2023

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting changes labels May 22, 2023

erlend-aasland changed the title ~~gh-101282: rework the BOLT build process~~ gh-101282: Apply BOLT optimisations to libpython for shared builds May 22, 2023

erlend-aasland merged commit 5360cb3 into python:main May 22, 2023

bedevere-bot removed the awaiting merge label May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-101282: Apply BOLT optimisations to libpython for shared builds #104709

gh-101282: Apply BOLT optimisations to libpython for shared builds #104709

Uh oh!

indygreg commented May 20, 2023 •

edited by bedevere-bot

Loading

Uh oh!

corona10 left a comment

Uh oh!

bedevere-bot commented May 21, 2023

Uh oh!

corona10 left a comment

Uh oh!

indygreg commented May 21, 2023

Uh oh!

erlend-aasland left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

erlend-aasland commented May 21, 2023 •

edited

Loading

Uh oh!

Uh oh!

erlend-aasland commented May 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

gh-101282: Apply BOLT optimisations to libpython for shared builds #104709

gh-101282: Apply BOLT optimisations to libpython for shared builds #104709

Uh oh!

Conversation

indygreg commented May 20, 2023 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 left a comment

Choose a reason for hiding this comment

Uh oh!

bedevere-bot commented May 21, 2023

Uh oh!

corona10 left a comment

Choose a reason for hiding this comment

Uh oh!

indygreg commented May 21, 2023

Uh oh!

erlend-aasland left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

erlend-aasland commented May 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

erlend-aasland commented May 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

indygreg commented May 20, 2023 •

edited by bedevere-bot

Loading

erlend-aasland commented May 21, 2023 •

edited

Loading