Fix #3404 #3437

xaedes · 2023-10-02T12:50:34Z

This PR fixes #3404. The shapes for init model of grouped-query-attention models was wrong.
It was probably a code regression that happened during refactoring.

the shapes for init model of gqa models was wrong

…example * 'master' of github.com:ggerganov/llama.cpp: (24 commits) convert : fix Baichuan2 models by using vocab size in config.json (ggerganov#3299) readme : add project status link ggml : fix build after ggerganov#3329 llm : add Refact model (ggerganov#3329) sync : ggml (conv 1d + 2d updates, UB fixes) (ggerganov#3468) finetune : readme fix typo (ggerganov#3465) ggml : add RISC-V Vector Support for K-Quants and improved the existing intrinsics (ggerganov#3453) main : consistent prefix/suffix coloring (ggerganov#3425) llama : fix session saving/loading (ggerganov#3400) llama : expose model's rope_freq_scale in the API (ggerganov#3418) metal : alibi for arbitrary number of heads (ggerganov#3426) cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (ggerganov#3273) Work on the BPE tokenizer (ggerganov#3252) convert : fix vocab size when not defined in hparams (ggerganov#3421) cmake : increase minimum version for add_link_options (ggerganov#3444) CLBlast: Add broadcast support for matrix multiplication (ggerganov#3402) gguf : add BERT, MPT, and GPT-J arch info (ggerganov#3408) gguf : general usability improvements (ggerganov#3409) cmake : make CUDA flags more similar to the Makefile (ggerganov#3420) finetune : fix ggerganov#3404 (ggerganov#3437) ...

the shapes for init model of gqa models was wrong

fix ggerganov#3404

f80994b

the shapes for init model of gqa models was wrong

ggerganov approved these changes Oct 2, 2023

View reviewed changes

ggerganov merged commit a03ce38 into ggerganov:master Oct 2, 2023
31 checks passed

yusiwen pushed a commit to yusiwen/llama.cpp that referenced this pull request Oct 7, 2023

finetune : fix ggerganov#3404 (ggerganov#3437)

9f5302d

the shapes for init model of gqa models was wrong

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #3404 #3437

Fix #3404 #3437

xaedes commented Oct 2, 2023 •

edited

Loading

Fix #3404 #3437

Fix #3404 #3437

Conversation

xaedes commented Oct 2, 2023 • edited Loading

xaedes commented Oct 2, 2023 •

edited

Loading