ci : add cublas to windows release #1271

Green-Sky · 2023-05-02T00:10:02Z

This adds a cuBLAS build to the windows ci.

edit: did a test release, please test https://github.com/ggerganov/llama.cpp/releases/tag/ci_cublas-31ff9e2

Open questions:

~~the cuda dll's are huge, should we ship them? they also don't change (often)~~
- it generates a separate .zip with just the cuda dlls
~~do we need the blasLt dll? ironically named "lite", it's the largest dll (>400mb)~~
- yes
the toolkit install takes ages. I tried using only a select install, but that never worked.
since it takes ages, maybe not require it for merge.
~~which cuda version to use. I set it to 12.1, but that requires the very latest driver.~~
- I decided on both 11.7.1 and 12.1.0
should we enable shared for all builds, so we distribute the .dll
other stuff i forgot, since the turn around time is >20min, it's a real hell to debug.

eula allowing redist https://docs.nvidia.com/cuda/eula/index.html#attachment-a

slaren · 2023-05-02T00:21:07Z

If GitHub can handle the big releases, it is probably better to distribute the DLLs than to ask users to install the CUDA toolkit. Maybe also include an additional artifact that doesn't include the DLLs so that people who already have them don't need to download them again?

slaren · 2023-05-02T00:28:14Z

do we need the blasLt dll? ironically named "lite", it's the largest dll (>400mb)

I think so, the cublas DLL depends on cublaslt:

Green-Sky · 2023-05-02T00:40:48Z

Maybe also include an additional artifact that doesn't include the DLLs so that people who already have them don't need to download them again?

sounds good

Green-Sky · 2023-05-03T19:03:30Z

performing a manual release here https://github.com/ggerganov/llama.cpp/actions/runs/4875560144

edit: good i tested this, i forgot a depends

Green-Sky · 2023-05-03T20:27:44Z

@slaren can you try the ci binaries here https://github.com/ggerganov/llama.cpp/releases/tag/ci_cublas-45d94c8 ?

Green-Sky · 2023-05-03T20:36:56Z

Looks like no one is running a cuda 12 capable driver 🤣 , gonna add 11.7

slaren · 2023-05-03T21:32:15Z

These binaries work well for me! No issues.

Green-Sky · 2023-05-04T00:03:48Z

@slaren can you try the 11.7 binaries too? https://github.com/ggerganov/llama.cpp/releases/tag/ci_cublas-31ff9e2

slaren · 2023-05-04T00:11:21Z

Also works, but as expected I need to download the DLLs as well, as my CUDA toolkit is 12.1. The 12.1 binaries work without the DLL.

slaren · 2023-05-04T18:14:47Z

since it takes ages, maybe not require it for merge.

IMO it is fine as is, but we could consider running the CI tests for cuBLAS only when ggml-cuda.h/cu have been modified. However, that may not achieve much beyond testing that the builds completes until we have a model small enough to use in CI tests.

Green-Sky · 2023-05-04T18:46:00Z

but we could consider running the CI tests for cuBLAS only when ggml-cuda.h/cu have been modified.

afaik we cant run cuda at all in the ci. 🙂
its the install of the cuda toolkit that takes for ever.

slaren · 2023-05-04T18:57:17Z

Yeah of course, I don't know what I was thinking. Checking that the build completes on Windows can still be useful, usually I do my testing under Linux/WSL2. It may also help find issues with the CUDA 11.7 build.

Green-Sky · 2023-05-04T19:00:58Z

My plan was to do the linux side next and provide ubuntu focal based binaries. :)

Green-Sky · 2023-05-05T20:56:34Z

let's see if someone complains, doing linux next.

Green-Sky · 2023-05-05T20:58:57Z

should have done linux first, it's an order of magnitude faster...
https://github.com/Green-Sky/llama.cpp/releases/tag/ci_cublas_linux-f1758f6

Green-Sky added build Compilation issues windows Issues specific to Windows labels May 2, 2023

Green-Sky mentioned this pull request May 2, 2023

cuBLAS - windows - static not compiling #1092

Closed

Green-Sky marked this pull request as ready for review May 2, 2023 00:44

sw mentioned this pull request May 2, 2023

CI: add Windows CLBlast and OpenBLAS builds #1277

Merged

Green-Sky mentioned this pull request May 3, 2023

Add cuBLAS build workflow and fix error causing lines in CMakeLists ggml-org/whisper.cpp#867

Merged

Green-Sky force-pushed the ci_cublas branch 6 times, most recently from eb69d98 to 44286d3 Compare May 3, 2023 18:31

Green-Sky force-pushed the ci_cublas branch from 44286d3 to 45d94c8 Compare May 3, 2023 19:11

Green-Sky force-pushed the ci_cublas branch from 45d94c8 to 0e38458 Compare May 3, 2023 20:39

ci : add cublas to windows release

31ff9e2

Green-Sky force-pushed the ci_cublas branch from 0e38458 to 31ff9e2 Compare May 3, 2023 21:21

slaren approved these changes May 5, 2023

View reviewed changes

Green-Sky merged commit a3b85b2 into ggml-org:master May 5, 2023

KerfuffleV2 pushed a commit to KerfuffleV2/llama.cpp that referenced this pull request May 6, 2023

ci : add cublas to windows release (ggml-org#1271)

1413a69

Green-Sky deleted the ci_cublas branch May 15, 2023 14:32

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci : add cublas to windows release #1271

ci : add cublas to windows release #1271

Green-Sky commented May 2, 2023 •

edited

Loading

slaren commented May 2, 2023

slaren commented May 2, 2023

Green-Sky commented May 2, 2023

Green-Sky commented May 3, 2023 •

edited

Loading

Green-Sky commented May 3, 2023

Green-Sky commented May 3, 2023

slaren commented May 3, 2023

Green-Sky commented May 4, 2023

slaren commented May 4, 2023

slaren commented May 4, 2023

Green-Sky commented May 4, 2023

slaren commented May 4, 2023 •

edited

Loading

Green-Sky commented May 4, 2023

Green-Sky commented May 5, 2023

Green-Sky commented May 5, 2023

ci : add cublas to windows release #1271

ci : add cublas to windows release #1271

Conversation

Green-Sky commented May 2, 2023 • edited Loading

slaren commented May 2, 2023

slaren commented May 2, 2023

Green-Sky commented May 2, 2023

Green-Sky commented May 3, 2023 • edited Loading

Green-Sky commented May 3, 2023

Green-Sky commented May 3, 2023

slaren commented May 3, 2023

Green-Sky commented May 4, 2023

slaren commented May 4, 2023

slaren commented May 4, 2023

Green-Sky commented May 4, 2023

slaren commented May 4, 2023 • edited Loading

Green-Sky commented May 4, 2023

Green-Sky commented May 5, 2023

Green-Sky commented May 5, 2023

Green-Sky commented May 2, 2023 •

edited

Loading

Green-Sky commented May 3, 2023 •

edited

Loading

slaren commented May 4, 2023 •

edited

Loading