Add `q8_0` models to `download-ggml-model.sh` #2589

mrienstra · 2024-11-25T20:29:04Z

Models names as per v1.7.2 announcement.

PR adds -q8_0 models for tiny, base, small, medium, large-v2, & large-v3-turbo.

With the changes in this PR, the remaining differences between the model list in the v1.7.2 announcement and models/download-ggml-model.sh are:

Only in `models/download-ggml-model.sh`

tiny.en
tiny.en-q5_1
base.en-q5_1
small.en
small.en-tdrz
small.en-q5_1
medium.en
medium.en-q5_0
large-v1
large-v3
large-v3-q5_0

Only in v1.7.2 announcement

tiny
tiny-q5_0
base-q5_0
small-q5_0
medium-q5_1
medium-dis
large-v2-q5_1
large-v2-dis

* ggerganov/master: (447 commits) ruby : Add low-level methods to transcribe (ggerganov#2585) models : add `q8_0` models to `download-ggml-model.sh` (ggerganov#2589) ruby : Follow source tree change (ggerganov#2580) whisper : use backend registry (#0) ggml/sched : do not skip views in pre-assignments whisper : adapt to new ggml (wip) talk-llama : sync llama.cpp sync : ggml ggml : sync resolve (skip) (#0) Add required ggml-base and backend libs to cmake pkg (llama/10407) cuda : fix CUDA_FLAGS not being applied (llama/10403) sycl : Add option to set the SYCL architecture for all targets (llama/10266) vulkan: Optimize soft_max (llama/10301) sycl: Revert MUL_MAT_OP support changes (llama/10385) cuda : only use native when supported by cmake (llama/10389) vulkan: remove use of null initializer (llama/10372) metal : fox offset integer overflows in im2col (ggml/1015) Vulkan: Fix device info output format specifiers (llama/10366) metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018) CUDA: fix MMV kernel being used for FP16 src1 (llama/10357) ...

* ggerganov/master: (44 commits) ruby : Add low-level methods to transcribe (ggerganov#2585) models : add `q8_0` models to `download-ggml-model.sh` (ggerganov#2589) ruby : Follow source tree change (ggerganov#2580) whisper : use backend registry (#0) ggml/sched : do not skip views in pre-assignments whisper : adapt to new ggml (wip) talk-llama : sync llama.cpp sync : ggml ggml : sync resolve (skip) (#0) Add required ggml-base and backend libs to cmake pkg (llama/10407) cuda : fix CUDA_FLAGS not being applied (llama/10403) sycl : Add option to set the SYCL architecture for all targets (llama/10266) vulkan: Optimize soft_max (llama/10301) sycl: Revert MUL_MAT_OP support changes (llama/10385) cuda : only use native when supported by cmake (llama/10389) vulkan: remove use of null initializer (llama/10372) metal : fox offset integer overflows in im2col (ggml/1015) Vulkan: Fix device info output format specifiers (llama/10366) metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018) CUDA: fix MMV kernel being used for FP16 src1 (llama/10357) ...

Introduced in ggerganov#2589

Introduced in #2589

Introduced in ggerganov#2589

Add q8_0 models to download-ggml-model.sh

9c77e8f

ggerganov approved these changes Nov 28, 2024

View reviewed changes

ggerganov merged commit a9d06ce into ggerganov:master Nov 28, 2024
42 of 44 checks passed

mrienstra added a commit to mrienstra/whisper.cpp that referenced this pull request Dec 11, 2024

Fix typo in download-ggml-model.sh

49c33aa

Introduced in ggerganov#2589

mrienstra mentioned this pull request Dec 11, 2024

Fix typo in download-ggml-model.sh #2623

Merged

ggerganov pushed a commit that referenced this pull request Dec 12, 2024

models : fix typo in download-ggml-model.sh (#2623)

6aa1d7b

Introduced in #2589

lyapple2008 pushed a commit to lyapple2008/whisper.cpp.mars that referenced this pull request Feb 4, 2025

models : add q8_0 models to download-ggml-model.sh (ggerganov#2589)

7fa5701

lyapple2008 pushed a commit to lyapple2008/whisper.cpp.mars that referenced this pull request Feb 4, 2025

models : fix typo in download-ggml-model.sh (ggerganov#2623)

ba757e3

Introduced in ggerganov#2589

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `q8_0` models to `download-ggml-model.sh` #2589

Add `q8_0` models to `download-ggml-model.sh` #2589

mrienstra commented Nov 25, 2024

Add q8_0 models to download-ggml-model.sh #2589

Add q8_0 models to download-ggml-model.sh #2589

Conversation

mrienstra commented Nov 25, 2024

Only in models/download-ggml-model.sh

Only in v1.7.2 announcement

Add `q8_0` models to `download-ggml-model.sh` #2589

Add `q8_0` models to `download-ggml-model.sh` #2589

Only in `models/download-ggml-model.sh`