-
Notifications
You must be signed in to change notification settings - Fork 136
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Graviton2-b #330
base: master
Are you sure you want to change the base?
Graviton2-b #330
Conversation
#include <x86intrin.h> | ||
#if defined(__GNUC__) && defined(__aarch64__) | ||
#define SIMDE_ENABLE_NATIVE_ALIASES | ||
#include "../simde/x86/avx2.h" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the PR.
Where is ../simde/x86/avx2.h
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also, why limit it to aarch64? ppc64* could probably be fixed by this as well.
Apologies, I forgot that bit. I could update the branch either via a git
submodule or local copy (from https://github.com/simd-everywhere/simde). Do
you have a preference?
…On Mon, Apr 4, 2022 at 9:48 PM Dinghua Li ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In src/kmlib/kmrns.h
<#330 (comment)>:
> @@ -8,7 +8,12 @@
#include <stdint.h>
#include <stdio.h>
#include <stdlib.h>
-#include <x86intrin.h>
+#if defined(__GNUC__) && defined(__aarch64__)
+ #define SIMDE_ENABLE_NATIVE_ALIASES
+ #include "../simde/x86/avx2.h"
Thank you for the PR.
Where is ../simde/x86/avx2.h?
—
Reply to this email directly, view it on GitHub
<#330 (review)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAE7ERRJN45FKO3V6RIIABLVDPAYPANCNFSM5SRCFAEA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Found that with this PR
and change Line 268 in 0abdd48
to
should be sufficient to get it compiled. |
FYI. We have three Lines 613 to 630 in f8afe5d
If we have a good enough way to check POPCNT or BMI2 support on ARM we can revisit the POPCNT and BMI2 hardware acceleration options for ARM. |
@@ -5,6 +5,11 @@ | |||
#ifndef MEGAHIT_CPU_DISPATCH_H | |||
#define MEGAHIT_CPU_DISPATCH_H | |||
|
|||
#if defined(__GNUC__) && defined(__aarch64__) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Again, why only aarch64?
Another attempt at an aarch64 support. I made some changes to CMakelists.txt to prevent errors at config time. Added some ifdefs around the arch-specific parts so you should get identical x86_64 performance now and working binaries on aarch64 (only graviton2 tested so far).