In LLVM backend, track which floats are guaranteed to be arithmetic, which makes the canonicalization a no-op. #934

nlewycky · 2019-11-07T18:53:44Z

Description

This is a reimplementation of the patch in PR #651.

Extend state.rs ExtraInfo to track more information about floats. In addition to tracking whether the value has a pending canonicalization of NaNs, also track whether the value is known to be arithmetic (which includes infinities, regular values, and non-signalling NaNs (aka. "arithmetic NaNs" in the webassembly spec)). When the value is arithmetic, the correct sequence of operations to canonicalize the value is a no-op. Therefore, we create a lattice where pending+arithmetic=arithmetic.

Also, this extends the tracking to track all values, including non-SIMD integers. That's why there are more places where pending canonicalizations are applied.

Looking at c-wasm-simd128-example, this provides no performance change to the non-SIMD case (takes 58s on my noisy dev machine). The SIMD case drops from 46s to 29s.

Review

Add a short description of the the change to the CHANGELOG.md file

losfair · 2019-11-10T10:32:32Z

lib/llvm-backend/src/state.rs

+pub struct ExtraInfo {
+    state: u8,
+}
+impl ExtraInfo {


Why do we need to distinguish between f32 and f64 NaNs in ExtraInfo, since the opcodes and values are already typed?

Because of the SIMD opcodes which aren't typed. They're all just v128, so we might go from a F64x2Add to a F32x4Mul with no cast in between.

v128_to_f32x4 checks for a pending f64 canonicalization and applies it, while not applying a pending f32 canonicalization. v128_to_f64x2 does the opposite.

MarkMcCaskey

Looks good! Looks like there's a few small mistakes but otherwise good!

MarkMcCaskey · 2019-11-26T19:07:54Z

CHANGELOG.md

@@ -24,6 +24,7 @@ Special thanks to [@newpavlov](https://github.com/newpavlov) and [@Maxgy](https:
 - [#939](https://github.com/wasmerio/wasmer/pull/939) Fix bug causing attempts to append to files with WASI to delete the contents of the file
 - [#940](https://github.com/wasmerio/wasmer/pull/940) Update supported Rust version to 1.38+
 - [#923](https://github.com/wasmerio/wasmer/pull/923) Fix memory leak in the C API caused by an incorrect cast in `wasmer_trampoline_buffer_destroy`
+- [#934](https://github.com/wasmerio/wasmer/pull/934) Simplify float expressions in the LLVM backend.


Changelog entry should be moved

MarkMcCaskey · 2019-11-26T19:38:56Z

lib/llvm-backend/src/state.rs

    // machine, but which might not be in the LLVM value. The conversion to
    // arithmetic NaN is pending. It is required for correctness.
-    PendingF32NaN,
+    pub fn pending_f32_nan() -> ExtraInfo {


These functions can be const

MarkMcCaskey · 2019-11-26T19:39:14Z

lib/llvm-backend/src/state.rs

+        ExtraInfo { state: 8 }
+    }
+
+    pub fn has_pending_f32_nan(&self) -> bool {


These functions may be able to be const

They are. Done.

MarkMcCaskey · 2019-11-26T19:39:48Z

lib/llvm-backend/src/state.rs

-        ExtraInfo::None
+        ExtraInfo { state: 0 }
+    }
+}


This is the same as derive(Default) on ExtraInfo

So it is! Done.

MarkMcCaskey · 2019-11-26T19:40:31Z

lib/llvm-backend/src/state.rs

+
+    fn bitor(self, other: Self) -> Self {
+        assert!(!(self.has_pending_f32_nan() && other.has_pending_f64_nan()));
+        assert!(!(self.has_pending_f64_nan() && other.has_pending_f32_nan()));


If you only want these in debug mode, use debug_assert!, these asserts will run on every use

Good idea, done.

lib/llvm-backend/src/code.rs

…ackend. Not wired up yet.

It seemed like a good idea at the time, but in practice we discard the extra info all or almost all of the time. This also introduces a new bug. In an operation like multiply, it's valid to multiply two values, one with a pending NaN and one without. As written, in the SIMD case (because of the two kinds of pending in play), we assert.

Unfortunately, this is quite buggy. For something as simple as F32Sub, to combine two ExtraInfos, we want to add a new pending_f32_nan(), unless both of the inputs are arithmetic_f32(). In this commit, we incorrectly calculate that we don't need a pending_f32_nan if either one of the inputs was arithmetic_f32().

We want to ignore the incoming pending NaN state (since the pending will propagate to the output if there was one on the input), and we want to add a new pending NaN state if we can (that is to say, if it isn't cancelled out by both inputs having arithmetic state). Do this by discarding the pending states on the inputs, intersecting them (to keep only the arithmetic state), then union in a pending nan state (which might do nothing, if it's arithmetic). If the above sounds confusing, keep in mind that when a value is arithmetic, the act of performing the "NaN canonicalization" is a no-op. Thus, being arithmetic cancels out pending NaN states.

Fix a bug in Operator::Select and add a comment to explain the intention. Use derived default for ExtraInfo. Make ExtraInfo associated functions const. Turn two asserts into debug_asserts.

nlewycky · 2019-11-26T20:29:16Z

bors r+

934: In LLVM backend, track which floats are guaranteed to be arithmetic, which makes the canonicalization a no-op. r=nlewycky a=nlewycky # Description This is a reimplementation of the patch in PR #651. Extend state.rs ExtraInfo to track more information about floats. In addition to tracking whether the value has a pending canonicalization of NaNs, also track whether the value is known to be arithmetic (which includes infinities, regular values, and non-signalling NaNs (aka. "arithmetic NaNs" in the webassembly spec)). When the value is arithmetic, the correct sequence of operations to canonicalize the value is a no-op. Therefore, we create a lattice where pending+arithmetic=arithmetic. Also, this extends the tracking to track all values, including non-SIMD integers. That's why there are more places where pending canonicalizations are applied. Looking at c-wasm-simd128-example, this provides no performance change to the non-SIMD case (takes 58s on my noisy dev machine). The SIMD case drops from 46s to 29s. # Review - [ ] Add a short description of the the change to the CHANGELOG.md file Co-authored-by: Nick Lewycky <nick@wasmer.io>

bors · 2019-11-26T21:09:51Z

Build succeeded

wasmerio.wasmer

nlewycky requested a review from losfair as a code owner November 7, 2019 18:53

losfair reviewed Nov 10, 2019

View reviewed changes

losfair approved these changes Nov 11, 2019

View reviewed changes

nlewycky force-pushed the feature/llvm-nan-fix-2 branch from 446b48c to 6b2f17b Compare November 19, 2019 21:25

MarkMcCaskey approved these changes Nov 26, 2019

View reviewed changes

nlewycky added 8 commits November 26, 2019 12:20

Add "known to not contain non-arithmetic NaNs" to ExtraInfo in LLVM b…

fafc7ad

…ackend. Not wired up yet.

Initial implementation of "known to be arithmetic NaN / not NaN".

26c8fd5

Give that panic! a message. Also, make it an unreachable!.

d1ce8ee

Add changelog entry.

d3fabe5

Address review feedback from Mark.

ff73c5d

Fix a bug in Operator::Select and add a comment to explain the intention. Use derived default for ExtraInfo. Make ExtraInfo associated functions const. Turn two asserts into debug_asserts.

nlewycky force-pushed the feature/llvm-nan-fix-2 branch from 6b2f17b to ff73c5d Compare November 26, 2019 20:29

bors bot merged commit ff73c5d into master Nov 26, 2019

bors bot deleted the feature/llvm-nan-fix-2 branch November 26, 2019 21:09

nlewycky mentioned this pull request Dec 2, 2019

Fix LLVM speed regression #651

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In LLVM backend, track which floats are guaranteed to be arithmetic, which makes the canonicalization a no-op. #934

In LLVM backend, track which floats are guaranteed to be arithmetic, which makes the canonicalization a no-op. #934

nlewycky commented Nov 7, 2019 •

edited

Loading

losfair Nov 10, 2019

nlewycky Nov 10, 2019

MarkMcCaskey left a comment

MarkMcCaskey Nov 26, 2019

nlewycky Nov 26, 2019

MarkMcCaskey Nov 26, 2019

nlewycky Nov 26, 2019

MarkMcCaskey Nov 26, 2019

nlewycky Nov 26, 2019

MarkMcCaskey Nov 26, 2019

nlewycky Nov 26, 2019

MarkMcCaskey Nov 26, 2019

nlewycky Nov 26, 2019

nlewycky commented Nov 26, 2019

bors bot commented Nov 26, 2019

In LLVM backend, track which floats are guaranteed to be arithmetic, which makes the canonicalization a no-op. #934

In LLVM backend, track which floats are guaranteed to be arithmetic, which makes the canonicalization a no-op. #934

Conversation

nlewycky commented Nov 7, 2019 • edited Loading

Description

Review

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarkMcCaskey left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nlewycky commented Nov 26, 2019

bors bot commented Nov 26, 2019

Build succeeded

nlewycky commented Nov 7, 2019 •

edited

Loading