llama : allow gguf RoPE keys to be overridden with defaults #3240

cebtenzzre · 2023-09-18T00:48:00Z

@klosax is this what you had in mind? It seems like the simplest way to resolve your TODO comment. I'd like to find a solution to this before I add more RoPE keys in PR #2268.

This way, the fallback value is hardcoded in llm_load_hparams and not exposed anywhere else. This shouldn't matter, because with GGUF the API user can't assume the value of these parameters before they've loaded the model.

shibe2 · 2023-09-22T22:13:35Z

I have an older model codellama-34b.Q6_K.gguf, and it outputs garbage after commit a5661d7. Why would that be?

cebtenzzre · 2023-09-22T22:21:33Z

I have an older model codellama-34b.Q6_K.gguf, and it outputs garbage after commit a5661d7.

Well, codellama-34b does use a different rope_freq_base, so maybe this commit did somehow break it. What is the sha256sum of your model file? There are two versions.

shibe2 · 2023-09-22T22:37:48Z

1002ef07afb15d208819c844f9b0e28b98b3c0c8da3df54521e727c17b45917c

cebtenzzre · 2023-09-23T04:45:26Z

I'm still looking into this, but there's something really odd about that model file - the strings for the keys all have an embedded NUL byte at the end, which is counted towards their length. I don't know of any version of the llama.cpp GGUF writer that did that.

…#3240)

ggerganov · 2023-09-29T13:26:50Z

@shibe2 Do you observe the issue with a newly created converted model with the latest version of the code?

shibe2 · 2023-09-29T14:09:26Z

LlongOrca-7B-16k works with default parameters on 569550d. I converted it again just now. Output on a short prompt matches exactly the output of my previous conversion. Also, information retrieval from a long text works.

cebtenzzre requested a review from klosax September 18, 2023 00:48

cebtenzzre added 2 commits September 17, 2023 20:49

llama : remove unused llama_hparams defaults

78c45b7

llama : allow gguf rope keys to be overridden with defaults

76988cd

cebtenzzre force-pushed the rope-params-override branch from d863550 to 76988cd Compare September 18, 2023 00:49

cebtenzzre changed the title ~~llama : allow gguf rope keys to be overridden with defaults~~ llama : allow gguf RoPE keys to be overridden with defaults Sep 18, 2023

cebtenzzre added 2 commits September 17, 2023 20:55

examples : do not print meaningless defaults

ae33438

remove unused variable

95e168a

ggerganov approved these changes Sep 20, 2023

View reviewed changes

cebtenzzre merged commit a5661d7 into ggml-org:master Sep 20, 2023

cebtenzzre deleted the rope-params-override branch September 20, 2023 16:12

cebtenzzre removed the request for review from klosax September 20, 2023 16:13

cebtenzzre restored the rope-params-override branch September 20, 2023 16:13

cebtenzzre added a commit to cebtenzzre/llama.cpp that referenced this pull request Sep 23, 2023

examples : fix RoPE defaults to match PR ggml-org#3240

49caf47

cebtenzzre mentioned this pull request Sep 23, 2023

examples : fix RoPE defaults to match PR #3240 #3315

Merged

ggerganov pushed a commit that referenced this pull request Sep 23, 2023

examples : fix RoPE defaults to match PR #3240 (#3315)

51a7cf5

pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023

llama : allow gguf RoPE keys to be overridden with defaults (ggml-org…

3711c43

…#3240)

pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023

examples : fix RoPE defaults to match PR ggml-org#3240 (ggml-org#3315)

28731a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : allow gguf RoPE keys to be overridden with defaults #3240

llama : allow gguf RoPE keys to be overridden with defaults #3240

cebtenzzre commented Sep 18, 2023

shibe2 commented Sep 22, 2023

cebtenzzre commented Sep 22, 2023

shibe2 commented Sep 22, 2023

cebtenzzre commented Sep 23, 2023

ggerganov commented Sep 29, 2023

shibe2 commented Sep 29, 2023

llama : allow gguf RoPE keys to be overridden with defaults #3240

llama : allow gguf RoPE keys to be overridden with defaults #3240

Conversation

cebtenzzre commented Sep 18, 2023

shibe2 commented Sep 22, 2023

cebtenzzre commented Sep 22, 2023

shibe2 commented Sep 22, 2023

cebtenzzre commented Sep 23, 2023

ggerganov commented Sep 29, 2023

shibe2 commented Sep 29, 2023