-
Notifications
You must be signed in to change notification settings - Fork 744
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
openblas for ios failing with libopenblas.a #571
Comments
Does this happen with |
This exact test fails with ios-x86_64 |
It's strange that Xcode would have problems with x86 assembly. In any case, the optimized kernels for ARM64 were already disabled as the assembler just doesn't work with the optimized kernels, so that should be OK. |
I've made the changes in the latest commit disabling optimizations in assembly language. Let me know if you have the chance to try it out with version 0.3.0-1.4.2-SNAPSHOT. Thanks! |
The changes are included in version 1.4.2. Thanks for reporting and let me know if you still see any issues related to that! |
When bundling an application with the accelerate framework, Nd4j and the blas calls work fine. When bundling with libopenblas.a, there are incorrect results. For example, the resulting NDArray only contains 0 elements.
INDArray unin = Nd4j.create(1, 2);
unin.putScalar(0, 0d);
unin.putScalar(1, 1d);
INDArray unw = Nd4j.create(2,3);
unw.putScalar(0, 1d);
unw.putScalar(1, 2d);
unw.putScalar(2, 1d);
unw.putScalar(3, 3d);
unw.putScalar(4, 1d);
unw.putScalar(5, 4d);
INDArray unret = Nd4j.createUninitialized(1,3);
unin.mmuli(unw,unret);
The text was updated successfully, but these errors were encountered: