-
Notifications
You must be signed in to change notification settings - Fork 286
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for feature detection on FreeBSD/aarch64 (WIP) #611
Conversation
How old are we talking regarding old oses? Trapping signals is quite bad for libraries. |
@lu-zero well, older than FreeBSD 12 (which is currently in release candidate phase). The number of people who use Rust + FreeBSD + AArch64 is currently quite small, especially considering that rustc currently doesn't build on FreeBSD/aarch64 without a very recent patch that hasn't landed yet :) So it should be fine to ignore that. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this! I'm trying to verify the implementation against online docs but I'm getting lost pretty quickly unfortunately. This looks though like it's based on another implementation perhaps?
I’d like to review this with calm tomorrow afternoon if it isn’t very
pressing.
w.r.t. signal-handling-based detection, there were some intrinsic issues
with this (e.g. how reliable is it that the proper exception will be raised
on a CPU that “doesn’t know” the instruction being executed?). However, in
some OSes the alternatives aren’t clear (e.g. windows), as long as “it
works for you” and doesn’t break anything I think it is something worth
getting experience with.
Would it be possible to get CI set up for aarch64-FreeBSD?
|
Of course :)
Every aarch64 CPU knows I can see how trying to detect the SIMD/etc instructions themselves via signal handling can be unreliable, yes.
I have a public buildbot.. |
Update: no need to read the values on different cores, the kernel already presents the same view of the registers on all cores https://reviews.freebsd.org/D17137#393947 So this is good as is, pretty much. I can add some code comments about this I guess. |
FWIW I personally think that we shouldn't try to trap instructions here, if FreeBSD handles this and "makes the instruction work" that's fine by me and if we're not compatible with older versions that's probably ok for now too |
stdsimd/arch/detect/mod.rs
Outdated
cfg_if! { | ||
if #[cfg(all(target_arch = "aarch64", any(target_os = "freebsd")))] { | ||
#[path = "os/aarch64.rs"] | ||
mod aarch64; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think I would prefer to move this into the cfg_if!
below:
cfg_if! {
...
} else if #[cfg(target_os = "freebsd")] {
#[cfg(target_arch = "aarch64")]
#[path = ...]
mod aarch64;
#[path = "os/freebsd/mod.rs"]
mod os;
}
...
}
When we need to use aarch64
from somewhere else, we can figure out how to do that, but there is no need to do so here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work @myfreeweb, I just have a few minor comments.
So I think this is a good idea. We do the same for
IIRC
On Linux, we ask the OS "What is this process allowed to do?" via the OS interfaces (e.g. auxiliary vectors). If Linux decides to migrate the process to a different core, it has to make sure that the core is at least as capable as the one we are running on because we can cache the answer to the question. If the FreeBSD way of querying this information is via the When the OS fails, all bets are off anyways. For example, see A tale of big.LITTLE gone wrong, where Samsung Exynos aarch64 LITTLE cores had atomic support but the big cores did not (usually big cores are "better" than little cores, and the OS assumed that all SoCs were like this). So the Apps would start in LITTLE cores, do run-time feature detection, detect that they can use atomics, cache that, start using them, and some time after the OS would move them to a big core and the process would SIGILL because the big cores did not support atomics. Working around "bugs" like these here feels like trying to do too much: we'd need OS APIs that allow us to query the set of potential cores the process might be moved to, APIs to query the features of these cores, and then select the minimum subset which the process supports, which might be very sub-optimal if the OS never decides to move the process to a different core.
What's the recommended FreeBSD way of handling this ? We don't support Windows+aarch64 but the recommended approach to do run-time feature detection there is to "try to use an intrinsic, and use SEH to handle the error if its not available" (see #120 (comment)) . I agree with @lu-zero and @alexcrichton that we should avoid this if possible. |
Yes this works on Linux now too, since version 4.11. Whether MRS is privileged depends on what register is being read, i.e., only ones ending with '_EL0' can possibly be read directly from userspace. EL1 ones like these cause an exception and must be emulated by the kernel if it wants (normally it will abort the process with SIGILL or similar). |
Cool! We should explore using this in Linux then, it is way easier than using auxiliary vectors. |
Yep, the emulated msr only returns features supported by all cores. (Looks like you replied before I updated the original post.)
That's where I linked with the "weird SoCs where cores have unequal features" text :)
Similar to Windows, I guess. For an API that's literally "run a privileged instruction and expect it to be emulated", not much you can do other than trapping SIGILL. But we can just ignore this. The FreeBSD/aarch64 userbase is small enough that "just upgrade your OS to >=12" is fine :) |
We can leave adding support for FreeBSD == 11 to "future work". If someone complains, I'd guess we can just disable this with a config flag or by querying the FreeBSD version somehow later. This is looking good, just a couple of nits open (don't forget to run |
e07fb5d
to
b426542
Compare
b426542
to
1f3885b
Compare
Updated with comments and suggested
The CI rustfmt failure is:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Thank you a lot @myfreeweb ! |
On AArch64, FreeBSD does not currently provide
HWCAP
in the ELF auxiliary vector.Reads of the ID registers are trapped though, and they are accessible in userspace.
TODO: The OS-dependent code should trapI guess it's fine™ to expect that to work.SIGILL
before calling the reading function, in order to work on older OS versions that didn't expose the registers.TODO: The OS-dependent code should run the reading function on all cores (via thread affinity) and take a union of features, because there are weird SoCs where cores have unequal features.On FreeBSD, the register view is the same on each coreReferences: