Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/vulkan #170

Merged
merged 6 commits into from
Sep 13, 2024
Merged

Feat/vulkan #170

merged 6 commits into from
Sep 13, 2024

Conversation

thewh1teagle
Copy link
Contributor

@thewh1teagle thewh1teagle commented Aug 18, 2024

Add support for vulkan backend in latest whisper.cpp.

Depends on #169
Improve version of #168


Also changed whisper.cpp commit to a bit older commit since latest whisper.cpp is broken with vulkan.


Tested on Windows 11. It works fast.

arizhih and others added 6 commits August 8, 2024 15:55
@newfla newfla mentioned this pull request Sep 13, 2024
@tazz4843 tazz4843 merged commit d8b6aef into tazz4843:master Sep 13, 2024
13 checks passed
@tazz4843
Copy link
Owner

Going to make one final release with OpenCL and then another major release with Vulkan, as I found a 80% performance loss in my testing when I tried Vulkan.

@thewh1teagle
Copy link
Contributor Author

as I found a 80% performance loss in my testing when I tried Vulkan

What specific performance do you mean? the speed or the quality? (Because there's some issues I heard about the quality)
If it's about speed, which OS and what GPU you have?

@tazz4843
Copy link
Owner

tazz4843 commented Sep 13, 2024

Speed, Arch Linux with an Arc A770

@thewh1teagle
Copy link
Contributor Author

Speed, Arch Linux with an Arc A770

If it's that drastic like 80% slower maybe the logs suggest that whisper fallback to CPU?

@tazz4843
Copy link
Owner

The logs do show whisper running on Vulkan. It's not a major hit to my use case to stay on an old version of whisper forever so I'm likely just going to do that.

ggml_vulkan: Found 1 Vulkan devices:
Vulkan0: Intel(R) Arc(tm) A770 Graphics (DG2) (Intel open-source Mesa driver) | uma: 0 | fp16: 1 | warp size: 32
whisper_model_load: Intel(R) Arc(tm) A770 Graphics (DG2) total size =  1176.00 MB
whisper_model_load: model size    = 1176.00 MB
whisper_backend_init_gpu: using Vulkan backend

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants