Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More simd/Neon functions #950

Merged
merged 8 commits into from
Nov 2, 2019
Merged

More simd/Neon functions #950

merged 8 commits into from
Nov 2, 2019

Conversation

amaury-ml
Copy link

Continuing on the Neon support. Simple geometric functions that Neon can help make faster.
Not expecting a lot of performance improvement from that PR, but keeps going from scalar to vector all the time for simple arithmetic functions helps the overall performance of glm on Arm cores.

Amaury Le Leyzour and others added 8 commits September 10, 2019 10:44
__aarch64__ is the correct way, __arm64__ is for iOS.
But the memory model does not need any of these architecture defines to be fully
relevant. __LP64__ means Long Pointer 64 (ie 64bits), and __ILP32__ is Integer Long Pointer 32 (ie 32bits).
That's enough and avoids errors like __arch64__ (correct way is __aarch64__) but forgets __arm64__ (iOS).
@amaury-ml
Copy link
Author

Any chance this could go in in a near future?

@Groovounet Groovounet self-assigned this Nov 2, 2019
@Groovounet Groovounet added this to the GLM 0.9.9 milestone Nov 2, 2019
@Groovounet Groovounet merged commit c11944c into g-truc:master Nov 2, 2019
@Groovounet
Copy link
Member

One test was red but it was just a test platform instability.

Thanks for contributing !
Christophe

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants