Retry allocation with fallback flags #2451

SRHMorris · 2024-10-04T09:00:54Z

This fixes issue #2450

I'm unsure what the exact cause is as I'm not overly familiar with Vulkan. So feel free to reject if you can think of a better solution.

ggerganov · 2024-10-05T09:41:13Z

@0cc4m What do you think about this patch? Should we merge it upstream?

0cc4m · 2024-10-06T06:41:53Z

@ggerganov Yeah, this should go upstream. I haven't tested it yet, but it looks good. It might even fix ggerganov/llama.cpp#9734.

ggerganov

I just tested it and it also fixes the issue reported in #2411 (comment)

# By Georgi Gerganov (18) and others # Via Georgi Gerganov * tag 'v1.7.1': (43 commits) release : v1.7.1 vulkan : retry allocation with fallback flags (ggerganov#2451) release : v1.7.0 scripts : bench v3-turbo whisper : remove mel leftover constants (396089f) whisper : zero-out the KV cache upon clear (ggerganov#2445) objc : fix build metal : zero-init buffer contexts (#0) whisper : revert mel-related changes (#0) whisper : adapt to latest ggml (skip) (#0) ggml : fix typo in example usage ggml_gallocr_new (ggml/984) ggml : fixes after sync (ggml/983) ggml-backend : add device and backend reg interfaces (llama/9707) Fixed dequant precision issues in Q4_1 and Q5_1 (llama/9711) ggml-backend : add device and backend reg interfaces (llama/9707) Initial cmake support of SYCL for AMD GPUs (llama/9658) vulkan : do not use tensor->extra (llama/9407) ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) ggml: refactor cross entropy loss CPU impl. (ggml/976) scripts : sync ggml-backend.cpp ... # Conflicts: # bindings/javascript/package.json

Co-authored-by: Samuel Morris <samuel.morris@artlist.io>

Retry allocation with fallback flags

50011e6

0cc4m mentioned this pull request Oct 6, 2024

vulkan : add GGML_VK_FORCE_HEAP_INDEX env var ggerganov/llama.cpp#9734

Open

4 tasks

ggerganov approved these changes Oct 6, 2024

View reviewed changes

ggerganov merged commit 9f346d0 into ggerganov:master Oct 6, 2024
44 checks passed

lyapple2008 pushed a commit to lyapple2008/whisper.cpp.mars that referenced this pull request Nov 2, 2024

vulkan : retry allocation with fallback flags (ggerganov#2451)

e856402

Co-authored-by: Samuel Morris <samuel.morris@artlist.io>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry allocation with fallback flags #2451

Retry allocation with fallback flags #2451

SRHMorris commented Oct 4, 2024

ggerganov commented Oct 5, 2024

0cc4m commented Oct 6, 2024

ggerganov left a comment

Retry allocation with fallback flags #2451

Retry allocation with fallback flags #2451

Conversation

SRHMorris commented Oct 4, 2024

ggerganov commented Oct 5, 2024

0cc4m commented Oct 6, 2024

ggerganov left a comment

Choose a reason for hiding this comment