Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bump package version + bump openblas version to 0.3.22 so that building on avx512 platforms pass #103

Closed
wants to merge 1 commit into from

Conversation

chebbyChefNEQ
Copy link

@chebbyChefNEQ chebbyChefNEQ commented May 20, 2023

CPU: 7950X
OS: Ubuntu 22.04

build was failing on:

  In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:49,
                   from ../kernel/x86_64/../arm/../simd/intrin.h:51,
                   from ../kernel/x86_64/../arm/sum.c:33:
  /usr/lib/gcc/x86_64-linux-gnu/11/include/avx512fintrin.h:4517:1: error: inlining failed in call to ‘always_inline’ ‘_mm512_shuffle_f32x4’: target specific option mismatch
   4517 | _mm512_shuffle_f32x4 (__m512 __A, __m512 __B, const int __imm)
        | ^~~~~~~~~~~~~~~~~~~~
  In file included from ../kernel/x86_64/../arm/../simd/intrin.h:61,
                   from ../kernel/x86_64/../arm/sum.c:33:
  ../kernel/x86_64/../arm/../simd/intrin_avx512.h:29:18: note: called from here
     29 |     __m512 h32 = _mm512_shuffle_f32x4(sum32, sum32, _MM_SHUFFLE(1, 0, 3, 2));
        |                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:49,
                   from ../kernel/x86_64/../arm/../simd/intrin.h:51,
                   from ../kernel/x86_64/../arm/sum.c:33:
  /usr/lib/gcc/x86_64-linux-gnu/11/include/avx512fintrin.h:12520:1: error: inlining failed in call to ‘always_inline’ ‘_mm512_add_ps’: target specific option mismatch
  12520 | _mm512_add_ps (__m512 __A, __m512 __B)
        | ^~~~~~~~~~~~~
  In file included from ../kernel/x86_64/../arm/../simd/intrin.h:61,
                   from ../kernel/x86_64/../arm/sum.c:33:
  ../kernel/x86_64/../arm/../simd/intrin_avx512.h:28:20: note: called from here
     28 |     __m512 sum32 = _mm512_add_ps(a, h64);
        |                    ^~~~~~~~~~~~~~~~~~~~~
  In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:49,
                   from ../kernel/x86_64/../arm/../simd/intrin.h:51,
                   from ../kernel/x86_64/../arm/sum.c:33:
  /usr/lib/gcc/x86_64-linux-gnu/11/include/avx512fintrin.h:4517:1: error: inlining failed in call to ‘always_inline’ ‘_mm512_shuffle_f32x4’: target specific option mismatch
   4517 | _mm512_shuffle_f32x4 (__m512 __A, __m512 __B, const int __imm)
        | ^~~~~~~~~~~~~~~~~~~~
  In file included from ../kernel/x86_64/../arm/../simd/intrin.h:61,
                   from ../kernel/x86_64/../arm/sum.c:33:
  ../kernel/x86_64/../arm/../simd/intrin_avx512.h:27:18: note: called from here
     27 |     __m512 h64 = _mm512_shuffle_f32x4(a, a, _MM_SHUFFLE(3, 2, 3, 2));
        |                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  make[1]: *** [Makefile.L1:699: ssum_k.o] Error 1
  make[1]: *** Waiting for unfinished jobs....
  ../kernel/x86_64/omatcopy_rt.c: In function ‘somatcopy_k_rt’:
  ../kernel/x86_64/omatcopy_rt.c:130:3: warning: ‘dst_tmp’ may be used uninitialized in this function [-Wmaybe-uninitialized]
    130 |   __asm__ __volatile__(\
        |   ^~~~~~~
  ../kernel/x86_64/omatcopy_rt.c:145:22: note: ‘dst_tmp’ was declared here
    145 |   float *src, *dst, *dst_tmp, *src_base, *dst_base;
        |                      ^~~~~~~
  make: *** [Makefile:177: libs] Error 1

I checked out https://github.com/xianyi/OpenBLAS on v0.3.21 and got the same error with make. Once I bumped to v0.3.22 the build passes. This smell like a bug in OpenBLAS. I'll dig around and see if there was an issue opened on this

@termoshtt
Copy link
Member

OpenBLAS update is done in #108 . Thanks PR.

@termoshtt termoshtt closed this Jan 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants