An example using cuBLAS library in alpaka #2430

mehmetyusufoglu · 2024-11-22T11:08:01Z

This example uses cuBLAS library for matrix multiplication by using allocated alpaka buffers and alpaka queue. Another example is using rocBLAS library.

Cmake file is still to be changed for CI to not fail for other backends.

    auto alpakaStream = alpaka::getNativeHandle(queue);

    // cuBLAS setup from alpaka stream
    cublasHandle_t cublasHandle;
    cublasCreate(&cublasHandle);
    cublasSetStream(cublasHandle, alpakaStream);

    // Perform matrix multiplication: C = A * B
    float alpha = 1.0f, beta = 0.0f; // Set beta to 0.0f to overwrite C
    cublasSgemm(
        cublasHandle,
        CUBLAS_OP_N,
        CUBLAS_OP_N, // No transpose for A and B
        M,
        N,
        K, // Dimensions: C = A * B
        &alpha,
        alpaka::getPtrNative(bufDevA), M ...
    );
    alpaka::wait(queue); // Wait for multiplication to complete```

psychocoderHPC · 2024-11-25T11:53:31Z

example/useBLASInAlpaka/src/useBLASInAlpaka.cpp

+        N,
+        K, // Dimensions: C = A * B
+        &alpha,
+        alpaka::getPtrNative(bufDevA),


better use std::data() instead of alpaka::getPtrNative()

ok, done, thanks.

psychocoderHPC · 2024-11-25T11:55:20Z

example/useBLASInAlpaka/src/useBLASInAlpaka.cpp

+    Idx const K = 3; // Columns in A and rows in B
+
+    // Define device and queue
+    using Acc = alpaka::AccGpuCudaRt<Dim1D, Idx>;


Could you please use the CUDA tag and derive the ACC from the tag? THis will reduce the work as soon as we refactor the accelerators.

I used the standard tags like other examples, but prevented the configuration of this example at cmake if ACC_CUDA_ONLY cmake variable is not set.

Anyway i used cuda tag as you suggested. This example could have a direct main rather than using ExampleTags since only will run with single backend.

using Acc = alpaka::TagToAcc<alpaka::TagGpuCudaRt, Dim1D, Idx>;

@psychocoderHPC I agree with @mehmetyusufoglu . It does not make sense to use the same template, like in the other examples. The code can be only used with the CUDA backend. Therefore we need no complicated iteration over the enabled tags.

example/useCuBLASInAlpaka/CMakeLists.txt

mehmetyusufoglu · 2024-12-11T12:28:45Z

This PR is closed because the changes is added to #2433 since 2 PR's will share the same directory in examples directory. Rather than being 2 separate directories in examples.

mehmetyusufoglu marked this pull request as draft November 22, 2024 11:08

mehmetyusufoglu changed the title ~~[Wip] An example using BLAS library in alpaka~~ An example using BLAS library in alpaka Nov 22, 2024

mehmetyusufoglu force-pushed the exampleUsingCuBlas branch 2 times, most recently from ae8e4ed to a0dee21 Compare November 22, 2024 11:47

psychocoderHPC requested changes Nov 25, 2024

View reviewed changes

psychocoderHPC added this to the 2.0.0 milestone Nov 25, 2024

psychocoderHPC added the Type:Example label Nov 25, 2024

mehmetyusufoglu force-pushed the exampleUsingCuBlas branch 3 times, most recently from 07acce5 to 2b3ef4f Compare November 26, 2024 14:53

mehmetyusufoglu marked this pull request as ready for review November 26, 2024 14:53

mehmetyusufoglu force-pushed the exampleUsingCuBlas branch from 2b3ef4f to f920100 Compare November 26, 2024 14:56

mehmetyusufoglu changed the title ~~An example using BLAS library in alpaka~~ An example using cuBLAS library in alpaka Nov 28, 2024

mehmetyusufoglu force-pushed the exampleUsingCuBlas branch 2 times, most recently from 4f98208 to 3194bb3 Compare November 29, 2024 12:20

mehmetyusufoglu commented Nov 29, 2024

View reviewed changes

example/useCuBLASInAlpaka/CMakeLists.txt Outdated Show resolved Hide resolved

mehmetyusufoglu force-pushed the exampleUsingCuBlas branch from 3194bb3 to 4d2bd86 Compare December 2, 2024 10:34

Run cuBLAS functions from alpaka

c2dee1c

mehmetyusufoglu force-pushed the exampleUsingCuBlas branch from 4d2bd86 to c2dee1c Compare December 2, 2024 12:22

mehmetyusufoglu closed this Dec 11, 2024

mehmetyusufoglu mentioned this pull request Dec 11, 2024

an example using rocBLAS and cuBLAS in alpaka code #2433

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An example using cuBLAS library in alpaka #2430

An example using cuBLAS library in alpaka #2430

mehmetyusufoglu commented Nov 22, 2024 •

edited

Loading

psychocoderHPC Nov 25, 2024

mehmetyusufoglu Nov 26, 2024

psychocoderHPC Nov 25, 2024

mehmetyusufoglu Nov 26, 2024

mehmetyusufoglu Nov 28, 2024

SimeonEhrig Dec 2, 2024

mehmetyusufoglu commented Dec 11, 2024

An example using cuBLAS library in alpaka #2430

An example using cuBLAS library in alpaka #2430

Conversation

mehmetyusufoglu commented Nov 22, 2024 • edited Loading

psychocoderHPC Nov 25, 2024

Choose a reason for hiding this comment

mehmetyusufoglu Nov 26, 2024

Choose a reason for hiding this comment

psychocoderHPC Nov 25, 2024

Choose a reason for hiding this comment

mehmetyusufoglu Nov 26, 2024

Choose a reason for hiding this comment

mehmetyusufoglu Nov 28, 2024

Choose a reason for hiding this comment

SimeonEhrig Dec 2, 2024

Choose a reason for hiding this comment

mehmetyusufoglu commented Dec 11, 2024

mehmetyusufoglu commented Nov 22, 2024 •

edited

Loading