-
Notifications
You must be signed in to change notification settings - Fork 13.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Change rint
to roundeven
in round_ties_even
implementation
#136459
Comments
I tested the below code on a handful of Tier1/2/3 targets with llc define noundef float @round_f32(float noundef %x) unnamed_addr {
start:
%0 = tail call float @llvm.round.f32(float %x)
ret float %0
}
define noundef float @rint_f32(float noundef %x) unnamed_addr {
start:
%0 = tail call float @llvm.rint.f32(float %x)
ret float %0
}
define noundef float @roundeven_f32(float noundef %x) unnamed_addr {
start:
%0 = tail call float @llvm.roundeven.f32(float %x)
ret float %0
}
define noundef double @round_f64(double noundef %x) unnamed_addr {
start:
%0 = tail call double @llvm.round.f64(double %x)
ret double %0
}
define noundef double @rint_f64(double noundef %x) unnamed_addr {
start:
%0 = tail call double @llvm.rint.f64(double %x)
ret double %0
}
define noundef double @roundeven_f64(double noundef %x) unnamed_addr {
start:
%0 = tail call double @llvm.roundeven.f64(double %x)
ret double %0
} |
Keep in mind that the C standard and related documents are written with FENV_ACCESS in mind, but LLVM’s normal (not “constrained”) intrinsics operate in what LLVM calls the default floating point environment, where rounding mode is always to nearest and exceptions are ignored, same as what Rust does. The lowering on most platforms still raises exceptions but that’s because it’s the most efficient lowering and the difference doesn’t matter for programs that adhere to the “default fenv” rules. So I don’t see any reason to change the lowering from Rust to LLVM IR. |
rint
to roundeven
rint
to roundeven
in round_ties_even
implementation
I should have clarified this but the impression I got from other threads is that |
FWIW, LLVM also has As far as I can tell, the LLVM semantics of |
See also llvm/llvm-project#77561 |
I think that documentations is just outdated, in reality all three support vectorization: https://github.com/llvm/llvm-project/blob/76e73ae6af1cecffaf977391e52bb9c410c14ff1/llvm/lib/Analysis/VectorUtils.cpp#L94-L97 |
That would also work. But since C23 is now out I feel as though we may as well just use |
Ideally LLVM would have only one intrinsic for this operation and deprecate/remove (auto-upgrade) the other equivalent ones, so that this confusion doesn’t continue indefinitely. If this happens, Rust should of course be updated accordingly. Otherwise, switching from one equivalent intrinsic to another seems like pointless churn that risks regressions and risks propagating the confusion further. |
I guess we can just rename the |
I agree that it would be best to only have one intrinsic. But I disagree that changing to a function with obvious implications would somehow make things more confusing; Also the intrinsics already exist https://doc.rust-lang.org/std/intrinsics/fn.roundevenf32.html, removing |
Oh dear, why do we have that one too? It doesn’t seem to be used anywhere in-tree. Would be good to clean that up and only have one intrinsic, but I don’t think anything has changed since #95317 (comment) so we probably can’t just replace rint with roundeven. At minimum we’d need a fallback definition of the C symbols that LLVM may try to call in compiler-builtins. That seems like more work than consolidating on a single intrinsic on the Rust side but still lowering it to LLVM’s rint intrinsics. |
Yeah, having a single Rust intrinsic for this would definitely be a good first cleanup step, so that the decision of how to lower to LLVM can be adjusted entirely locally in the LLVM codegen backend. |
#136543 cleans up our intrinsics. Unless we have any evidence that |
There are plans to replace the constrained FP intrinsics with operand bundles on normal intrinsics, so I'm not sure this makes sense anymore. |
I’m the one who added these intrinsics to Rust, and the stable AFAICT, the only difference between these intrinsics is which C function call LLVM will lower them to, if it does lower them to such a call. |
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35`
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by ``@hanna-kruppe`` in rust-lang#136459; Cc ``@tgross35``
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by ```@hanna-kruppe``` in rust-lang#136459; Cc ```@tgross35```
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35` try-job: test-various
Rollup merge of rust-lang#136543 - RalfJung:round-ties-even, r=tgross35 intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang/rust#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang/rust#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang#136459; Cc `@tgross35` try-job: test-various
intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in rust-lang/rust#136459; Cc `@tgross35` try-job: test-various
C has three reasonably similar ways to round a single float to even. I'll just copy the docs here,
nearbyint
:rint
:And
roundeven
, which is available since C23:We currently use
rintf16
,rintf32
,rintf64
, andrintf128
(e.g. https://doc.rust-lang.org/std/intrinsics/fn.rintf32.html) as the intrinsics to implementround_ties_even
. These map to LLVM'srint
and similar:rust/compiler/rustc_codegen_llvm/src/intrinsic.rs
Lines 129 to 132 in 613bdd4
rint
is not exactly what we want here because it is supposed to raise inexact. We should change toroundeven
because it matches Rust's preference of ignoring rounding modes and never raising fp exceptions.This may be blocked on platform support, e.g. llvm/llvm-project#73588.that issue looks like it is about the vector version.Cc #55107.
The text was updated successfully, but these errors were encountered: