unix: BOLT fixes #463

indygreg · 2025-01-01T19:36:11Z

As part of investigating failures with BOLT when upgrading to LLVM 19, I found and fixed a few issues with BOLT.

First, test_embed had been segfaulting on BOLT instrumented binaries. Why I'm not entirely sure. But the segfault only seems to occur in instrumentation mode. These tests are doing low-level things with the interpreter. So I suspect some kind of global mutable state issue or something.

I found the exact tests triggering the segfaults and added annotations to skip them.

The CPython build system treats the segfault as fatal on 3.13 but not 3.12. This means that on 3.12 we were only running a subset of tests and not collecting BOLT instrumentation nor applying optimizations for all tests after test_embed.

The removal of the segfault enables us to enable BOLT on 3.13+.

Second, LLVM 19.x has a hard error when handling PIC compiled functions containing computed gotos. It appears prior versions of LLVM could silently have buggy behavior in this scenario. We need to skip functions with computed gotos to allow LLVM 19.x to work with BOLT. It makes sense to apply this patch before LLVM 19.x upgrade to prevent bugs with computed gotos.

Third, I noticed BOLT was complaining about the lack of -update-debug-sections during instrumentation.

The 2nd and 3rd issues require common arguments to both BOLT instrumentation and application invocations. The patch fixing both introduces a new configure variable to hold common BOLT arguments. This patch is a good candidate for upstreaming.

indygreg · 2025-01-01T21:08:07Z

Bleh. More test_embed failures on 3.14 that need resolved...

As part of investigating failures with BOLT when upgrading to LLVM 19, I found and fixed a few issues with BOLT. First, `test_embed` had been segfaulting on BOLT instrumented binaries. Why I'm not entirely sure. But the segfault only seems to occur in instrumentation mode. These tests are doing low-level things with the interpreter. So I suspect some kind of global mutable state issue or something. I found the exact tests triggering the segfaults and added annotations to skip them. The CPython build system treats the segfault as fatal on 3.13 but not 3.12. This means that on 3.12 we were only running a subset of tests and not collecting BOLT instrumentation nor applying optimizations for all tests after `test_embed`. The removal of the segfault enables us to enable BOLT on 3.13+. Second, LLVM 19.x has a hard error when handling PIC compiled functions containing computed gotos. It appears prior versions of LLVM could silently have buggy behavior in this scenario. We need to skip functions with computed gotos to allow LLVM 19.x to work with BOLT. It makes sense to apply this patch before LLVM 19.x upgrade to prevent bugs with computed gotos. Third, I noticed BOLT was complaining about the lack of `-update-debug-sections` during instrumentation. The 2nd and 3rd issues require common arguments to both BOLT instrumentation and application invocations. The patch fixing both introduces a new configure variable to hold common BOLT arguments. This patch is a good candidate for upstreaming.

zanieb · 2025-01-01T21:34:57Z

cpython-unix/build-cpython.sh

-        # Due to a SEGFAULT when running `test_embed` with BOLT instrumented binaries, we can't use
-        # BOLT on Python 3.13+.
-        # TODO: Find a fix for this or consider skipping these tests specifically
-        echo "BOLT is disabled on Python 3.13+"


Yay! Thank you.

zanieb · 2025-01-01T21:35:23Z

cpython-unix/build-cpython.sh

+# On 3.12 (minimum BOLT version), the segfault causes the test harness to
+# abort and BOLT optimization uses the partial test results. On 3.13, the segfault
+# is a fatal error.
+if [ -n "${PYTHON_MEETS_MINIMUM_VERSION_3_10}" ]; then


Should these also be gated by -n "${BOLT_CAPABLE}" so we don't disable tests unnecessarily?

We don't actually run the full suite of tests. So it doesn't matter today.

(It would be nice to actually run the full test suite against the built distribution but that's for another PR I suppose.)

Won't these be reflected in the distributed Python too? Like, a consumer would have these tests disabled?

Agree it doesn't seem critical.

Oh, right, I forgot we distributed the unit tests.

I'll file an issue to clean things up. Agree we could do something better here. And we may be paving over a legit bug somewhere by disabling tests that segfault.

This is a redo of #420, which was merged prematurely. With the BOLT changes from #463 merged, LLVM 19 _just works_. As part of this we also modernize the BOLT apply settings to follow the recommendations at https://llvm.org/devmtg/2024-03/slides/practical-use-of-bolt.pdf. This includes enabling support for loading hot code from a huge page at runtime. This should _just work_ and could result in perf wins via improved iTLB hit rate, etc.

zanieb · 2025-01-04T06:04:39Z

Sort of tragically, I cannot reproduce the test_embed failures in CPython alone so I'm going to have to start using components here one by one until it occurs? Or attach a debugger to this build? I'm not sure. It's looking hard.

zanieb · 2025-01-04T06:06:29Z

Also, I'm not seeing anything related to

Second, LLVM 19.x has a hard error when handling PIC compiled functions containing computed gotos. It appears prior versions of LLVM could silently have buggy behavior in this scenario. We need to skip functions with computed gotos to allow LLVM 19.x to work with BOLT.

Am I missing some configuration to encounter this behavior?

edit: I reproduced this with a configure invocation copied from CI... just need to minimize it now.

zanieb · 2025-01-04T06:58:55Z

Ah okay --enable-shared is needed to reproduce both of these failures — and second fix is required before I can get nice test_embed segfaults :)

indygreg force-pushed the gps/bolt-fixes branch from 37b0cb6 to a4d5bad Compare January 1, 2025 19:37

indygreg mentioned this pull request Jan 1, 2025

Upgrade LLVM to 19.1.6 #462

Merged

indygreg force-pushed the gps/bolt-fixes branch from a4d5bad to 50395d7 Compare January 1, 2025 20:01

indygreg mentioned this pull request Jan 1, 2025

[BOLT] [3.12] Python 3.12.7 --enable bolt option not working python/cpython#124948

Open

indygreg force-pushed the gps/bolt-fixes branch from 50395d7 to 5c1e3a3 Compare January 1, 2025 21:19

zanieb self-requested a review January 1, 2025 21:26

zanieb added platform:darwin Specific to the macOS platform platform:linux Specific to the Linux platform labels Jan 1, 2025

indygreg force-pushed the gps/bolt-fixes branch from 5c1e3a3 to 1f321c4 Compare January 1, 2025 21:30

zanieb reviewed Jan 1, 2025

View reviewed changes

zanieb approved these changes Jan 1, 2025

View reviewed changes

indygreg mentioned this pull request Jan 1, 2025

Better handle BOLT skipped tests #465

Closed

indygreg merged commit 4859cdf into main Jan 1, 2025
297 checks passed

indygreg deleted the gps/bolt-fixes branch January 1, 2025 23:08

This was referenced Jan 2, 2025

[BOLT] llvm-bolt crashes during Python 3.13.1 build with addDebugFilenameToUnit segmentation fault llvm/llvm-project#121213

Closed

[BOLT] Support computed goto and allow map addrs inside functions llvm/llvm-project#120267

Merged

This was referenced Jan 3, 2025

Bolt instrumentation missing -update-debug-sections python/cpython#128437

Closed

gh-128437: Add BOLT_COMMON_FLAGS with -update-debug-sections python/cpython#128455

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unix: BOLT fixes #463

unix: BOLT fixes #463

indygreg commented Jan 1, 2025

indygreg commented Jan 1, 2025

zanieb Jan 1, 2025

zanieb Jan 1, 2025

indygreg Jan 1, 2025

zanieb Jan 1, 2025

indygreg Jan 1, 2025

zanieb commented Jan 4, 2025

zanieb commented Jan 4, 2025 •

edited

Loading

zanieb commented Jan 4, 2025

unix: BOLT fixes #463

unix: BOLT fixes #463

Conversation

indygreg commented Jan 1, 2025

indygreg commented Jan 1, 2025

zanieb Jan 1, 2025

Choose a reason for hiding this comment

zanieb Jan 1, 2025

Choose a reason for hiding this comment

indygreg Jan 1, 2025

Choose a reason for hiding this comment

zanieb Jan 1, 2025

Choose a reason for hiding this comment

indygreg Jan 1, 2025

Choose a reason for hiding this comment

zanieb commented Jan 4, 2025

zanieb commented Jan 4, 2025 • edited Loading

zanieb commented Jan 4, 2025

zanieb commented Jan 4, 2025 •

edited

Loading