Description
Hello Xianyi !!! Hope you're doing well.
I am actually using Openblas for my neural network application on ARM.
Openblas builds without any issues on CortexA15.
However, when I run my application , it always crashes with seg fault.
When I inspect the core on GDB , I get the following info:
Program terminated with signal 11, Segmentation fault.
#0 0x00194dc0 in axpy_kernel_S4 ()
(gdb) where
#0 0x00194dc0 in axpy_kernel_S4 ()
#1 0x00167340 in cblas_saxpy ()
#2 0xb63946d0 in ?? ()
#3 0xb63946d0 in ?? ()
Backtrace stopped: previous frame identical to this frame (corrupt stack?)
FYI : my arm board has 4 cortex A15 cores and vfpv3,vpfv4 and neon are all supported.
I compiled Openblas with the following options supported by my toolchain:
-marm -mfpu=vfpv3 -mfloat-abi=softfp
Can you please look into the matter , as it is becoming very difficult for me to debug the assembly code.