Use copysign LLVM intrinsic rather than bithack ourselves #39768

wsmoses · 2021-02-20T20:49:13Z

LLVM has an internal intrinsic for copysign which is both more compatible across architecture than assuming an and of a sign bit and also better enables LLVM optimizations that understand what functional operation is occuring.

Moreover, while downstream users such as Enzyme.jl are differentiating through an increasing number of bithacks, using the proper intrinsic for this operation makes analysis dramatically easier.

oscardssmith · 2021-02-20T21:30:19Z

Have you done performance testing to ensure this isn't a regression?

wsmoses · 2021-02-20T21:39:58Z

Update: it appears this (and likely flipsign and signbit) implementations are broken on some systems.

From Julia's libm implementation (https://github.com/JuliaMath/openlibm/blob/b34f107e24e97cd7b4eedc6868e330a9ff321120/src/fpmath.h#L98), the sign bit is not guaranteed to be in the place Julia current expects it to be.

Keno

Seems fine to me. I believe this intrinsic didn't exist when this code was first written.

Keno · 2021-02-20T22:25:41Z

nanosoldier run would be good

vchuravy · 2021-02-20T22:29:47Z

After briefly being reachable, Nanosoidier came down with the cold again...

I did some small scale tests for vectorization and performance was equivalent.

Keno · 2021-02-23T00:59:13Z

I'm not sure there is justification to backport this to 1.6 at this point. It's not a bugfix, and there is always the possibility of finding fun new LLVM behavior with changes like this. It can go into 1.6.1 if it proves safe on master.

wsmoses · 2021-02-23T05:06:45Z

Per my reading of the Julia libm source code (linked above), I believe there would be a miscompilation without this patch for any architecture that has __BYTE_ORDER__ == __ORDER_LITTLE_ENDIAN__ and __FLOAT_WORD_ORDER__ == __ORDER_BIG_ENDIAN__.

Now I'm not sure what architecture actually has those properties (@vchuravy and I did some thinking and came up empty), but thought would throw out there.

Keno · 2021-02-23T05:09:04Z

None of our supported platforms have that property. I'm not even sure any LLVM-supported platforms do.

vtjnash · 2021-02-26T17:18:50Z

@nanosoldier runbenchmarks("scalar" && ("Complex{Float64}" || "Complex{Float32}"), vs=":master")

vtjnash · 2021-02-26T17:19:58Z

@nanosoldier runbenchmarks("scalar" && ("Complex{Float64}" || "Complex{Float32}"), vs=":master")

nanosoldier · 2021-02-26T17:43:29Z

Your benchmark job has completed, but no benchmarks were actually executed. Perhaps your tag predicate contains misspelled tags? cc @christopher-dG

vchuravy · 2021-03-03T22:11:31Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2021-03-04T14:41:01Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @christopher-dG

Use copysign LLVM intrinsic rather than bithack ourselves

832c924

wsmoses force-pushed the copysign branch from feffeac to 832c924 Compare February 20, 2021 20:56

vchuravy requested review from Keno and vtjnash February 20, 2021 20:59

vchuravy added the compiler:codegen Generation of LLVM IR and native code label Feb 20, 2021

Keno approved these changes Feb 20, 2021

View reviewed changes

Keno added the needs nanosoldier run This PR should have benchmarks run on it label Feb 20, 2021

vchuravy added the backport 1.6 Change should be backported to release-1.6 label Feb 20, 2021

KristofferC mentioned this pull request Feb 22, 2021

Backports for 1.6-RC2 #39614

Merged

52 tasks

Keno removed the backport 1.6 Change should be backported to release-1.6 label Feb 23, 2021

vtjnash approved these changes Feb 23, 2021

View reviewed changes

vchuravy merged commit 3d0b60d into JuliaLang:master Mar 4, 2021

vchuravy added the backport 1.6 Change should be backported to release-1.6 label May 6, 2021

KristofferC mentioned this pull request May 11, 2021

Backports for Julia-1.6.2 #40702

Merged

45 tasks

KristofferC removed the backport 1.6 Change should be backported to release-1.6 label Jul 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use copysign LLVM intrinsic rather than bithack ourselves #39768

Use copysign LLVM intrinsic rather than bithack ourselves #39768

wsmoses commented Feb 20, 2021 •

edited

Loading

oscardssmith commented Feb 20, 2021

wsmoses commented Feb 20, 2021

Keno left a comment

Keno commented Feb 20, 2021

vchuravy commented Feb 20, 2021

Keno commented Feb 23, 2021

wsmoses commented Feb 23, 2021 •

edited

Loading

Keno commented Feb 23, 2021

vtjnash commented Feb 26, 2021 •

edited

Loading

vtjnash commented Feb 26, 2021

nanosoldier commented Feb 26, 2021

vchuravy commented Mar 3, 2021

nanosoldier commented Mar 4, 2021

Use copysign LLVM intrinsic rather than bithack ourselves #39768

Use copysign LLVM intrinsic rather than bithack ourselves #39768

Conversation

wsmoses commented Feb 20, 2021 • edited Loading

oscardssmith commented Feb 20, 2021

wsmoses commented Feb 20, 2021

Keno left a comment

Choose a reason for hiding this comment

Keno commented Feb 20, 2021

vchuravy commented Feb 20, 2021

Keno commented Feb 23, 2021

wsmoses commented Feb 23, 2021 • edited Loading

Keno commented Feb 23, 2021

vtjnash commented Feb 26, 2021 • edited Loading

vtjnash commented Feb 26, 2021

nanosoldier commented Feb 26, 2021

vchuravy commented Mar 3, 2021

nanosoldier commented Mar 4, 2021

wsmoses commented Feb 20, 2021 •

edited

Loading

wsmoses commented Feb 23, 2021 •

edited

Loading

vtjnash commented Feb 26, 2021 •

edited

Loading