[BLAS][portBLAS] Add bindings for half and some gemm_batch group APIs #576

Rbiessy · 2024-09-30T16:00:51Z

Description

Add missing features needed to run llama.cpp with oneMKL+portBLAS:

Add an option to enable sycl::half bindings
Add some missing bindings for batch_gemm
Avoid some warnings in the tests

Checklist

All Submissions

Do all unit tests pass locally? Log using the CT tests and examples: test_log_portblas_example.txt test_log_portblas_ct.txt. The tests are currently very slow so not all the RT tests were run.
Have you formatted the code using clang-format?

Rbiessy · 2024-09-30T16:06:34Z

Hello @al3x-jp, I have been told you are keen to test oneMKL+portBLAS with llama.cpp. This branch should enable everything that is needed. Would you be able to try it and let us know if you run into any issue?
You should just need to add -DENABLE_PORTBLAS_BACKEND=ON -DPORTBLAS_ENABLE_HALF=ON when compiling oneMKL Interface.

s-Nick · 2024-10-01T08:26:10Z

src/CMakeLists.txt

@@ -45,6 +45,11 @@ foreach(domain ${TARGET_DOMAINS})
  add_subdirectory(${domain})
 endforeach()

+if (PORTBLAS_ENABLE_HALF)
+  # Set the variable used for C++ macro
+  set(ENABLE_PORTBLAS_HALF ON)


According to #554 and PR #571 this var should be named with the prefix ONEAPI_ONEMKL_. Could you update it?
I know it would be the only one named correctly now, but I hope that PR will be merged soon.

Yes I was planning to update this PR once #571 is merged. It will be easier to do once I can add ONEAPI_ONEMKL_ENABLE_PORTBLAS_HALF to the list you introduce in https://github.com/oneapi-src/oneMKL/pull/571/files#diff-148715d6ea0c0ea0a346af3f6bd610d010d490eca35ac6a9b408748f7ca9e3f4R54

s-Nick · 2024-10-01T08:26:15Z

src/blas/backends/portblas/portblas_batch.cxx

@@ -695,7 +710,7 @@ sycl::event gemm_batch(sycl::queue &queue, oneapi::mkl::transpose *transa,
                       const double **b, std::int64_t *ldb, double *beta, double **c,
                       std::int64_t *ldc, std::int64_t group_count, std::int64_t *group_size,
                       const std::vector<sycl::event> &dependencies) {
-    throw unimplemented("blas", "gemm_batch", " for USM");
+    throw unimplemented("blas", "gemm_batch", " for USM using double");


Could you explain me why it doesn't work with double? I thought that in portBLAS if it works with float it works with double. Did you test it?

It can probably be added, I'm looking this. I have found that the gemm_batch tests only use group_count=5. I need to test this with group_count=1.

s-Nick · 2024-10-01T08:26:30Z

src/blas/backends/portblas/portblas_level3_half.cpp

 // BUFFER
 void gemm(sycl::queue &queue, oneapi::mkl::transpose transa, oneapi::mkl::transpose transb,
          std::int64_t m, std::int64_t n, std::int64_t k, sycl::half alpha,
          sycl::buffer<sycl::half, 1> &a, std::int64_t lda, sycl::buffer<sycl::half, 1> &b,
          std::int64_t ldb, sycl::half beta, sycl::buffer<sycl::half, 1> &c, std::int64_t ldc) {
+#ifdef ENABLE_PORTBLAS_HALF
+    CALL_PORTBLAS_FN(::blas::_gemm, queue, transa, transb, m, n, k, alpha, a, lda, b, ldb, beta, c,
+                     ldc);


Currently, we don't support any row major operator in portBLAS, so they are generally not enabled.
Even if it works now, unless it is necessary, I would not enable it to avoid the confusion of having only one row major operator supported.

There are already checks that throw unimplemented when row_major is used here: https://github.com/oneapi-src/oneMKL/blob/develop/src/blas/backends/portblas/portblas_common.hpp#L193
All the other functions have already their bindings ready for row_major because they use this pattern: https://github.com/oneapi-src/oneMKL/blob/develop/src/blas/backends/portblas/portblas_level3_float.cpp#L54
Just portblas_level3_half.cpp looks a bit different

Rbiessy · 2024-11-07T13:54:36Z

Closing as I don't think there is enough interest right now.

[BLAS][portBLAS] Add bindings for half and some gemm_batch group APIs

3f683b3

s-Nick reviewed Oct 1, 2024

View reviewed changes

Rbiessy added 2 commits October 1, 2024 13:35

Add support for double gemm_batch

e0e23d0

Test group API with group_count=1

783f000

Rbiessy closed this Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BLAS][portBLAS] Add bindings for half and some gemm_batch group APIs #576

[BLAS][portBLAS] Add bindings for half and some gemm_batch group APIs #576

Rbiessy commented Sep 30, 2024

Rbiessy commented Sep 30, 2024 •

edited

Loading

s-Nick Oct 1, 2024

Rbiessy Oct 1, 2024

s-Nick Oct 1, 2024

Rbiessy Oct 1, 2024

s-Nick Oct 1, 2024

Rbiessy Oct 1, 2024

Rbiessy commented Nov 7, 2024

[BLAS][portBLAS] Add bindings for half and some gemm_batch group APIs #576

[BLAS][portBLAS] Add bindings for half and some gemm_batch group APIs #576

Conversation

Rbiessy commented Sep 30, 2024

Description

Checklist

All Submissions

Rbiessy commented Sep 30, 2024 • edited Loading

s-Nick Oct 1, 2024

Choose a reason for hiding this comment

Rbiessy Oct 1, 2024

Choose a reason for hiding this comment

s-Nick Oct 1, 2024

Choose a reason for hiding this comment

Rbiessy Oct 1, 2024

Choose a reason for hiding this comment

s-Nick Oct 1, 2024

Choose a reason for hiding this comment

Rbiessy Oct 1, 2024

Choose a reason for hiding this comment

Rbiessy commented Nov 7, 2024

Rbiessy commented Sep 30, 2024 •

edited

Loading