Support for Mistral-7b #434

ranaya-formant · 2023-09-27T17:49:03Z

Do you all support this? https://mistral.ai/news/announcing-mistral-7b/

codesoda · 2023-09-28T15:47:07Z

philpax · 2023-10-31T01:52:27Z

Apologies for the late reply on this. I've tested two Mistral 7B-derived models (https://huggingface.co/TheBloke/Mistral-7B-Claude-Chat-GGUF and https://huggingface.co/TheBloke/zephyr-7B-beta-GGUF) with #412, and all seems to work. I'll keep this issue open until that PR lands.

svenstaro · 2023-12-03T13:22:53Z

Is there an update on this?

svenstaro · 2023-12-03T14:04:06Z

Ok I sort of figured it out. For others looking into this:

Use the GGUF branch
Run it like this: cargo run --release -- infer -m ~/llms/mistral-7b-instruct-v0.1.Q4_K_S.gguf -p "Write a long story" -n 5000 -r mistralai/Mistral-7B-v0.1
The important bit is to tell it to use the tokenizer at the end.

philpax · 2023-12-04T00:29:37Z

Yup - the gguf branch's embedded tokenizer support doesn't quite work, so I've disabled it for now. I'm rebuilding the library in develop to target the latest llama.cpp, but that's going to take some time.

philpax added issue:enhancement New feature or request topic:model-support Support for new models labels Oct 31, 2023

skirodev mentioned this issue Dec 12, 2023

Fix embedded tokenizer #444

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Mistral-7b #434

Support for Mistral-7b #434

ranaya-formant commented Sep 27, 2023

codesoda commented Sep 28, 2023

philpax commented Oct 31, 2023

svenstaro commented Dec 3, 2023

svenstaro commented Dec 3, 2023 •

edited

Loading

philpax commented Dec 4, 2023

Support for Mistral-7b #434

Support for Mistral-7b #434

Comments

ranaya-formant commented Sep 27, 2023

codesoda commented Sep 28, 2023

philpax commented Oct 31, 2023

svenstaro commented Dec 3, 2023

svenstaro commented Dec 3, 2023 • edited Loading

philpax commented Dec 4, 2023

svenstaro commented Dec 3, 2023 •

edited

Loading