Rollup of 4 pull requests #84739

Dylan-DPC-zz · 2021-04-30T08:04:42Z

Successful merges:

Reuse sys::unix::cmath on other platforms #84522 (Reuse sys::unix::cmath on other platforms)
Unignore a couple of tests #84638 (Unignore a couple of tests)
Be stricter about rejecting LLVM reserved registers in asm! #84658 (Be stricter about rejecting LLVM reserved registers in asm!)
[Arm64] use isb instruction instead of yield in spin loops #84725 ([Arm64] use isb instruction instead of yield in spin loops)

Failed merges:

r? @ghost
@rustbot modify labels: rollup

On arm64 we have seen on several databases that ISB (instruction synchronization barrier) is better to use than yield in a spin loop. The yield instruction is a nop. The isb instruction puts the processor to sleep for some short time. isb is a good equivalent to the pause instruction on x86. Below is an experiment that shows the effects of yield and isb on Arm64 and the time of a pause instruction on x86 Intel processors. The micro-benchmarks use https://github.com/google/benchmark.git $ cat a.cc static void BM_scalar_increment(benchmark::State& state) { int i = 0; for (auto _ : state) benchmark::DoNotOptimize(i++); } BENCHMARK(BM_scalar_increment); static void BM_yield(benchmark::State& state) { for (auto _ : state) asm volatile("yield"::); } BENCHMARK(BM_yield); static void BM_isb(benchmark::State& state) { for (auto _ : state) asm volatile("isb"::); } BENCHMARK(BM_isb); BENCHMARK_MAIN(); $ g++ -o run a.cc -O2 -lbenchmark -lpthread $ ./run -------------------------------------------------------------- Benchmark Time CPU Iterations -------------------------------------------------------------- AWS Graviton2 (Neoverse-N1) processor: BM_scalar_increment 0.485 ns 0.485 ns 1000000000 BM_yield 0.400 ns 0.400 ns 1000000000 BM_isb 13.2 ns 13.2 ns 52993304 AWS Graviton (A-72) processor: BM_scalar_increment 0.897 ns 0.874 ns 801558633 BM_yield 0.877 ns 0.875 ns 800002377 BM_isb 13.0 ns 12.7 ns 55169412 Apple Arm64 M1 processor: BM_scalar_increment 0.315 ns 0.315 ns 1000000000 BM_yield 0.313 ns 0.313 ns 1000000000 BM_isb 9.06 ns 9.06 ns 77259282 static void BM_pause(benchmark::State& state) { for (auto _ : state) asm volatile("pause"::); } BENCHMARK(BM_pause); Intel Skylake processor: BM_scalar_increment 0.295 ns 0.295 ns 1000000000 BM_pause 41.7 ns 41.7 ns 16780553 Tested on Graviton2 aarch64-linux with `./x.py test`.

Reuse `sys::unix::cmath` on other platforms Reuse `sys::unix::cmath` on all non-`windows` platforms. `unix` is chosen as the canonical location instead of `unsupported` or `common` because `unsupported` doesn't make sense semantically and `common` is reserved for code that is supported on all platforms. Also `unix` is already the home of some non-`windows` code that is technically not exclusive to `unix` like `unix::path`.

…ulacrum Unignore a couple of tests

Be stricter about rejecting LLVM reserved registers in asm! LLVM will silently produce incorrect code if these registers are used as operands. cc ````@rust-lang/wg-inline-asm````

[Arm64] use isb instruction instead of yield in spin loops On arm64 we have seen on several databases that ISB (instruction synchronization barrier) is better to use than yield in a spin loop. The yield instruction is a nop. The isb instruction puts the processor to sleep for some short time. isb is a good equivalent to the pause instruction on x86. Below is an experiment that shows the effects of yield and isb on Arm64 and the time of a pause instruction on x86 Intel processors. The micro-benchmarks use https://github.com/google/benchmark.git ``` $ cat a.cc static void BM_scalar_increment(benchmark::State& state) { int i = 0; for (auto _ : state) benchmark::DoNotOptimize(i++); } BENCHMARK(BM_scalar_increment); static void BM_yield(benchmark::State& state) { for (auto _ : state) asm volatile("yield"::); } BENCHMARK(BM_yield); static void BM_isb(benchmark::State& state) { for (auto _ : state) asm volatile("isb"::); } BENCHMARK(BM_isb); BENCHMARK_MAIN(); $ g++ -o run a.cc -O2 -lbenchmark -lpthread $ ./run -------------------------------------------------------------- Benchmark Time CPU Iterations -------------------------------------------------------------- AWS Graviton2 (Neoverse-N1) processor: BM_scalar_increment 0.485 ns 0.485 ns 1000000000 BM_yield 0.400 ns 0.400 ns 1000000000 BM_isb 13.2 ns 13.2 ns 52993304 AWS Graviton (A-72) processor: BM_scalar_increment 0.897 ns 0.874 ns 801558633 BM_yield 0.877 ns 0.875 ns 800002377 BM_isb 13.0 ns 12.7 ns 55169412 Apple Arm64 M1 processor: BM_scalar_increment 0.315 ns 0.315 ns 1000000000 BM_yield 0.313 ns 0.313 ns 1000000000 BM_isb 9.06 ns 9.06 ns 77259282 ``` ``` static void BM_pause(benchmark::State& state) { for (auto _ : state) asm volatile("pause"::); } BENCHMARK(BM_pause); Intel Skylake processor: BM_scalar_increment 0.295 ns 0.295 ns 1000000000 BM_pause 41.7 ns 41.7 ns 16780553 ``` Tested on Graviton2 aarch64-linux with `./x.py test`.

Dylan-DPC-zz · 2021-04-30T08:05:56Z

@bors r+ rollup=never p=5

bors · 2021-04-30T08:05:58Z

📌 Commit 7a1462f has been approved by Dylan-DPC

bors · 2021-04-30T09:22:00Z

⌛ Testing commit 7a1462f with merge 4a07567b863f1b1bf0d2d7341f366ce6778ba8e4...

rust-log-analyzer · 2021-04-30T09:50:00Z

The job dist-various-2 failed! Check out the build log: (web) (plain)

Click to see the possible cause of the failure (guessed by this bot)

[RUSTC-TIMING] gimli test:false 5.402
[RUSTC-TIMING] object test:false 10.260
warning: dropping unsupported crate type `dylib` for target `x86_64-fortanix-unknown-sgx`

error: invalid register `rbx`: rbx is used internally by LLVM and cannot be used as an operand for inline asm
  --> library/std/src/sys/sgx/ext/arch.rs:37:13
   |
37 |             in("rbx") request,


error: invalid register `rbx`: rbx is used internally by LLVM and cannot be used as an operand for inline asm
  --> library/std/src/sys/sgx/ext/arch.rs:65:13
   |
65 |             in("rbx") targetinfo,

error: aborting due to 2 previous errors; 1 warning emitted

[RUSTC-TIMING] std test:false 1.859
[RUSTC-TIMING] std test:false 1.859
error: could not compile `std`

To learn more, run the command again with --verbose.
command did not execute successfully: "/checkout/obj/build/x86_64-unknown-linux-gnu/stage0/bin/cargo" "build" "--target" "x86_64-fortanix-unknown-sgx" "-Zbinary-dep-depinfo" "-j" "16" "--release" "--locked" "--color" "always" "--features" "panic-unwind backtrace compiler-builtins-c" "--manifest-path" "/checkout/library/test/Cargo.toml" "--message-format" "json-render-diagnostics"
failed to run: /checkout/obj/build/bootstrap/debug/bootstrap dist --host= --target x86_64-fuchsia,aarch64-fuchsia,wasm32-unknown-unknown,wasm32-wasi,sparcv9-sun-solaris,x86_64-pc-solaris,x86_64-unknown-linux-gnux32,x86_64-fortanix-unknown-sgx,nvptx64-nvidia-cuda,armv7-unknown-linux-gnueabi,armv7-unknown-linux-musleabi,i686-unknown-freebsd
Build completed unsuccessfully in 0:24:28

bors · 2021-04-30T09:52:27Z

💔 Test failed - checks-actions

mark-i-m and others added 9 commits April 27, 2021 21:20

unignore a couple of tests

cf46fb1

Reuse unix::cmath

26fb1e3

Be stricter about rejecting LLVM reserved registers in asm!

e6a731e

fix test

6697b0d

Rollup merge of rust-lang#84638 - mark-i-m:unignore-tests, r=Mark-Sim…

730cc84

…ulacrum Unignore a couple of tests

Rollup merge of rust-lang#84658 - Amanieu:reserved_regs, r=petrochenkov

c923562

Be stricter about rejecting LLVM reserved registers in asm! LLVM will silently produce incorrect code if these registers are used as operands. cc ````@rust-lang/wg-inline-asm````

rustbot added the rollup A PR which is a rollup label Apr 30, 2021

bors added the S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. label Apr 30, 2021

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Apr 30, 2021

jackh726 mentioned this pull request Apr 30, 2021

Be stricter about rejecting LLVM reserved registers in asm! #84658

Merged

jackh726 closed this Apr 30, 2021

Dylan-DPC-zz deleted the rollup-6182psa branch April 30, 2021 11:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rollup of 4 pull requests #84739

Rollup of 4 pull requests #84739

Dylan-DPC-zz commented Apr 30, 2021

Dylan-DPC-zz commented Apr 30, 2021

bors commented Apr 30, 2021

bors commented Apr 30, 2021

rust-log-analyzer commented Apr 30, 2021

bors commented Apr 30, 2021

Rollup of 4 pull requests #84739

Rollup of 4 pull requests #84739

Conversation

Dylan-DPC-zz commented Apr 30, 2021

Dylan-DPC-zz commented Apr 30, 2021

bors commented Apr 30, 2021

bors commented Apr 30, 2021

rust-log-analyzer commented Apr 30, 2021

bors commented Apr 30, 2021