sync : ggml (new ops, new backend, etc) #1602

ggerganov · 2023-12-07T13:50:28Z

No description provided.

ggerganov · 2023-12-07T15:07:03Z

@slaren Did I mess-up the sync, or is this some issue with Windows:

https://github.com/ggerganov/whisper.cpp/actions/runs/7129122907/job/19412511626

slaren · 2023-12-07T15:23:26Z

This seems to be caused by the code to register backends automatically on startup. The macros used for this depend heavily on the compiler and can be finicky, but it should work on MSVC. It does work on ggml and llama.cpp, so I am not sure what is different here.

zshannon · 2023-12-07T22:03:02Z

there's a bug somewhere in this commit for metal. running the whisper.swiftui example on macOS, it compiles then crashes when clicking "transcribe" with a console error "validateComputeFunctionArguments:1056: failed assertion `Compute Function(kernel_soft_max_4): missing buffer binding at index 1 for src1[0].'".

same thing works fine with previous commit 3163090.

sorry to not have more info, the metal bits go way over my head and make me so so grateful for these awesome libraries :)

ggerganov · 2023-12-08T11:45:46Z

@zshannon Should be fixed on master.

By the way, can you explain how you run the whisper.swiftui example?
When I try to run it, I get the following error, and I cannot figure out how to fix it:

Edit: nevermind - something was wrong with my Xcode project. I re-cloned the repo from scratch and it works now

landtanin · 2023-12-12T07:41:21Z

Hi @ggerganov , on the latest commit 6335933, I'm still seeing crashes from ggml-metal with the error Compute Function(kernel_soft_max_4): missing buffer binding at index 1 for src1[0].

I naively tried the solution from #1607 but to no avail.

Crashes occur when I run it on an iPhone 13 with the ggml-base model.

To make it work on the same phone, I either

switch to the ggml-tiny
switch to commit 3163090

Notes: I use a simple wav file with only a few seconds of spoken words. My app code is based on whisper.swiftui

P.S. this can never be said enough, thanks for this amazing project 🙌

landtanin · 2023-12-16T09:06:47Z

This is fixed in v1.5.2 released 👍

* sync : ggml (new ops, new backend, etc) * whisper : remove obsolete broadcasting code * ggml : remove backend self-registers + fix ggml_concat + n_task logic * metal : fix assert * metal : print resource path * whisper : fix bug if metal init fails

AuroraWright · 2023-12-18T01:33:35Z

This commit seems to break large-v3 CoreML for me.
I reverted 8171e62 7bc4d22 and this one, and using this audio file https://drive.google.com/file/d/1HWS1pExtOR2mYr0q7w_8ZCl69BjlFZad/view?usp=sharing with large-v3 (command line: -t 10 -pp -osrt -l ja output.wav ) it starts repeating "カタカナで裁判官" at 00:09:30.680 and never recovers. I can consistently reproduce it every time I try
Interestingly, the same issue is not occurring on large-v3 without CoreML

ggerganov · 2023-12-18T08:23:26Z

@AuroraWright Can you confirm that on latest master using CoreML with large-v3 does not work for you? Does it work if you add the -mc 0 flag?

Does anyone else experience issues on latest master with Metal / CoreML?

AuroraWright · 2023-12-18T16:38:00Z

@ggerganov yeah with latest master too, it seems mc = 0 fixes it
actually my bad, even before this commit with CoreML it avoids that specific repetition but then eventually breaks/starts repeating at 00:48:54 (same audio).
I also noticed with large-v2+CoreML and latest master it just prints garbage from 50:37 to 51:05 (eg 00:50:39,430 --> 00:50:40,730 ーーーーーーーーーーーーーーーーーーーー ), and everything is fine without CoreML.

* origin/master: bench.py : add different large models (ggerganov#1655) wchess : update README.md release : v1.5.2 wchess : update readme wchess : whisper assisted chess (ggerganov#1595) sync : ggml (Metal fixes, new ops, tests) (ggerganov#1633) cmake : target windows 8 or above for prefetchVirtualMemory in llama-talk (ggerganov#1617) cmake : Fix bug in httplib.h for mingw (ggerganov#1615) metal : fix `ggml_metal_log` vargs (ggerganov#1606) whisper.objc : disable timestamps for real-time transcription whisper : more debug messages + fix fallback logic metal : fix soft_max kernel src1 argument (ggerganov#1602) sync : ggml (new ops, new backend, etc) (ggerganov#1602) server : pass max-len argument to the server (ggerganov#1574) ios : Remove `#if arch(arm)` check for using Metal (ggerganov#1561) ggml : Fix 32-bit compiler warning (ggerganov#1575) ggml : re-enable blas for src0 != F32 (ggerganov#1583)

* sync : ggml (new ops, new backend, etc) * whisper : remove obsolete broadcasting code * ggml : remove backend self-registers + fix ggml_concat + n_task logic * metal : fix assert * metal : print resource path * whisper : fix bug if metal init fails

sync : ggml (new ops, new backend, etc)

f5ca5b0

whisper : remove obsolete broadcasting code

ba5c2ba

ggerganov added 4 commits December 7, 2023 20:28

ggml : remove backend self-registers + fix ggml_concat + n_task logic

997c3a6

metal : fix assert

3fcd3cf

metal : print resource path

a7a1211

whisper : fix bug if metal init fails

9e480f5

ggerganov merged commit afce6fa into master Dec 7, 2023
72 checks passed

ggerganov deleted the sync branch December 7, 2023 20:27

ggerganov added a commit that referenced this pull request Dec 8, 2023

metal : fix soft_max kernel src1 argument (#1602)

7bc4d22

landtanin pushed a commit to landtanin/whisper.cpp that referenced this pull request Dec 16, 2023

metal : fix soft_max kernel src1 argument (ggerganov#1602)

c798a49

iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024

metal : fix soft_max kernel src1 argument (ggerganov#1602)

0d6dc2e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync : ggml (new ops, new backend, etc) #1602

sync : ggml (new ops, new backend, etc) #1602

ggerganov commented Dec 7, 2023

ggerganov commented Dec 7, 2023

slaren commented Dec 7, 2023

zshannon commented Dec 7, 2023

ggerganov commented Dec 8, 2023 •

edited

Loading

landtanin commented Dec 12, 2023 •

edited

Loading

landtanin commented Dec 16, 2023

AuroraWright commented Dec 18, 2023 •

edited

Loading

ggerganov commented Dec 18, 2023

AuroraWright commented Dec 18, 2023 •

edited

Loading

sync : ggml (new ops, new backend, etc) #1602

sync : ggml (new ops, new backend, etc) #1602

Conversation

ggerganov commented Dec 7, 2023

ggerganov commented Dec 7, 2023

slaren commented Dec 7, 2023

zshannon commented Dec 7, 2023

ggerganov commented Dec 8, 2023 • edited Loading

landtanin commented Dec 12, 2023 • edited Loading

landtanin commented Dec 16, 2023

AuroraWright commented Dec 18, 2023 • edited Loading

ggerganov commented Dec 18, 2023

AuroraWright commented Dec 18, 2023 • edited Loading

ggerganov commented Dec 8, 2023 •

edited

Loading

landtanin commented Dec 12, 2023 •

edited

Loading

AuroraWright commented Dec 18, 2023 •

edited

Loading

AuroraWright commented Dec 18, 2023 •

edited

Loading