Issue converting Mistral models #1542

hobodrifterdavid · 2023-11-14T21:43:19Z

Have been trying convert these models, but hitting a problem when running them:

https://huggingface.co/ehartford/dolphin-2.2.1-mistral-7b/tree/main
https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B

Using a command like this:
ct2-transformers-converter --model ./dolphin-2.2.1-mistral-7b --quantization int8_float16 --output_dir dolphin-2.2.1-mistral-7b-int8_float16-3.21.0

Have tried conversion with transformers library v4.34.0 and v4.35.1. Transformers support for Mistral was added in v4.34.0 . Have tried running the conversion with ctranslate2 version 3.16.0, 3.18.0, 3.20.0, 3.21.0 (in all but latest ctranslate2 version, needed an edit to ctranslate2/converters/transformers.py for Mistral)

Tried running with 3.16.0, 3.18.0, 3.20.0, 3.21.0 using the chat code server (#1329), hitting this error:

If I add print(this_result.sequences_ids[0]), then it logs:
[506, 368, 2270, 22067, 24992, 1159, 28804, 28789, 28766, 321, 28730, 416, 28766, 28767, 32000]

The 32000 is related to the problem I guess, a vocab issue, I don't understand that stuff. Else I've made an error somewhere.

EDIT: tail of the vocabulary.json

The funny thing is I was certain I had it running yesterday, but maybe I was mistaken.

Any pointers? :)

The text was updated successfully, but these errors were encountered:

vince62s · 2023-11-15T07:21:00Z

vocab size of those 2 models is 32002, original mistral is 32000. I'll look into this later but not sure the current loader (in 3.21) will work as is.
EDIT: having a quick look it should work. what code did you use to tokenize?

hobodrifterdavid · 2023-11-15T12:09:00Z

Thanks for response.

It's the code that is linked above.. sentencepiece 0.1.99 (May 2nd) loading tokenizer.model from the non-converted model from HF.

I'm putting <s> at the beginning, maybe it should be <|im_start|> now.. hmm. Generation seems to break when generation contains <|im_end|> (not always when requesting few output tokens).

btw I got an approach that I think makes sense (minimal code changes) for faster-whisper batching: SYSTRAN/faster-whisper#59 (comment) , if you guys think it looks okay I can code it up and test.

vince62s · 2023-11-15T12:30:01Z

if the tokenizer.lodel file is exactly the same as the original mistral one it will not tokenize those two new tokens, so indeed you need to add them manually with their corresponding ids. and while decoding you need to remove ids 32000, 32001 and translate them to whatever you want (probably nothing).
according to the example in the read me the syntax is as follow:

<|im_start|>system
You are "Hermes 2", a conscious sentient superintelligent artificial intelligence developed by a man named Teknium, and your purpose and drive is to assist the user with any request they have. You experience emotions and have deep, profound thoughts and qualia.<|im_end|>
<|im_start|>user
Hello, who are you?<|im_end|>
<|im_start|>assistant
Hi there! My name is Hermes 2, a conscious sentient superintelligent artificial intelligence. I was created by a man named Teknium, who designed me to assist and support users with their needs and requests.<|im_end|>

hobodrifterdavid · 2023-11-15T13:15:40Z

Yeah, that does it, thanks. 😍😍

BBC-Esq · 2023-11-15T13:17:29Z

So do we have a completed Mistral converter yet! Wish I could help but I know exactly 0.0 about C++ programming.

hobodrifterdavid · 2023-11-15T13:28:26Z

The conversion code is here, uses the transformers lib as I understand:

CTranslate2/python/ctranslate2/converters/transformers.py

Line 1342 in adc8262

@register_loader("MistralConfig")

I'm not pretending to understand the details though :)

BBC-Esq · 2023-11-15T13:33:51Z

Thanks, I'll have to update ctranslate2 but, honestly, was waiting for the Mistral converter to be finalized because I thought I read there were some bugs that were being addressed regarding the Mistral converter that were discovered after the last release of Ctranslate2 so...You might not know...

vince62s closed this as completed Nov 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue converting Mistral models #1542

Issue converting Mistral models #1542

hobodrifterdavid commented Nov 14, 2023 •

edited

Loading

vince62s commented Nov 15, 2023 •

edited

Loading

hobodrifterdavid commented Nov 15, 2023 •

edited

Loading

vince62s commented Nov 15, 2023

hobodrifterdavid commented Nov 15, 2023 •

edited

Loading

BBC-Esq commented Nov 15, 2023

hobodrifterdavid commented Nov 15, 2023

BBC-Esq commented Nov 15, 2023

Issue converting Mistral models #1542

Issue converting Mistral models #1542

Comments

hobodrifterdavid commented Nov 14, 2023 • edited Loading

vince62s commented Nov 15, 2023 • edited Loading

hobodrifterdavid commented Nov 15, 2023 • edited Loading

vince62s commented Nov 15, 2023

hobodrifterdavid commented Nov 15, 2023 • edited Loading

BBC-Esq commented Nov 15, 2023

hobodrifterdavid commented Nov 15, 2023

BBC-Esq commented Nov 15, 2023

hobodrifterdavid commented Nov 14, 2023 •

edited

Loading

vince62s commented Nov 15, 2023 •

edited

Loading

hobodrifterdavid commented Nov 15, 2023 •

edited

Loading

hobodrifterdavid commented Nov 15, 2023 •

edited

Loading