Fix for issue #876 #1012

burningion · 2023-06-13T20:38:49Z

For the following issue:

GPU support won't build without the change from native to any. Confirmed working with a 4090.

ggerganov

Do we know that all does not reduce the performance compared to native ?

byte-6174 · 2023-06-25T14:21:50Z

It most certainly would in some sense. For example when I compile with all the binary is 1.5MB vs. 812KB when I give the exact architecture I am compiling for.
The exact architecture is the best way to make sure you have the most optimized code for it, as it does not create the intermediate ptx code. With option all, nvcc would generate the ptx and postpone compilation at runtime. Nvidia notes this would lead to app startup delays. It could be addressed by caching, however.

byte-6174 · 2023-06-26T02:31:48Z

for example I just ran w a binary the load time for a model (models/ggml-base.en.bin) was:

-gpu-architecture=all:
whisper_print_timings: load time = 4452.92 ms

vs
-gpu-architecture=sm_72:
whisper_print_timings: load time = 224.77 ms

ggerganov · 2023-06-26T21:01:21Z

So, is it worth switching to all?
Maybe have native by default and only use all upon setting certain build option

byte-6174 · 2023-06-27T01:29:58Z

yes, cuda optimizations are highly specific to the hardware. all will allow compilation without errors, but with suboptimal binary.
whereas native would work sometimes based on arch. / cuda version, but when it works will produce the best binary 😄

In that regard, I like the above suggestion. However, I'm reading in other comments though for some arch/cuda combos neither works :( and one has to specify the exact arch like sm_72 etc.
So there is some finetuning of the makefile when it comes to cuda.

FerLuisxd · 2023-07-08T03:10:21Z

It doesn't work on 2080 and version 12.1
See #1082

…nov#1012)

fix for native not working as an option on Ubuntu

853802a

ggerganov approved these changes Jun 25, 2023

View reviewed changes

ggerganov merged commit 207a12f into ggerganov:master Jun 25, 2023

FerLuisxd mentioned this pull request Jul 8, 2023

WHISPER_CUBLAS=1 make is not working on Windows CUDA 12.1 #1082

Closed

jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023

make : fix for CUDA native not working as an option on Ubuntu (ggerga…

3f9b470

…nov#1012)

jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023

make : fix for CUDA native not working as an option on Ubuntu (ggerga…

9b9a57c

…nov#1012)

landtanin pushed a commit to landtanin/whisper.cpp that referenced this pull request Dec 16, 2023

make : fix for CUDA native not working as an option on Ubuntu (ggerga…

21e0288

…nov#1012)

iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024

make : fix for CUDA native not working as an option on Ubuntu (ggerga…

824cfbb

…nov#1012)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for issue #876 #1012

Fix for issue #876 #1012

burningion commented Jun 13, 2023

ggerganov left a comment

byte-6174 commented Jun 25, 2023

byte-6174 commented Jun 26, 2023

ggerganov commented Jun 26, 2023

byte-6174 commented Jun 27, 2023

FerLuisxd commented Jul 8, 2023 •

edited

Loading

Fix for issue #876 #1012

Fix for issue #876 #1012

Conversation

burningion commented Jun 13, 2023

ggerganov left a comment

Choose a reason for hiding this comment

byte-6174 commented Jun 25, 2023

byte-6174 commented Jun 26, 2023

ggerganov commented Jun 26, 2023

byte-6174 commented Jun 27, 2023

FerLuisxd commented Jul 8, 2023 • edited Loading

FerLuisxd commented Jul 8, 2023 •

edited

Loading