Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Where to Get the Tokenizer When Converting GPT4ALL #887

Closed
reynolds087 opened this issue Apr 11, 2023 · 3 comments
Closed

Where to Get the Tokenizer When Converting GPT4ALL #887

reynolds087 opened this issue Apr 11, 2023 · 3 comments

Comments

@reynolds087
Copy link

I'm trying to follow the instructions for GPT4ALL.
Here is the first line of code it says to run:

python3 convert-gpt4all-to-ggml.py models/gpt4all-7B/gpt4all-lora-quantized.bin ./models/tokenizer.model

I am getting an error because of the "tokenizer.model" which does not exist in this repository in the specified location, as far as I can tell. It also does not come with the model if you download it from nomic-ai's github. Where is this file supposed to originate from?

Traceback (most recent call last):
File "E:\Programs\llama.cpp\convert-gpt4all-to-ggml.py", line 107, in
main()
File "E:\Programs\llama.cpp\convert-gpt4all-to-ggml.py", line 102, in main
tokenizer = SentencePieceProcessor(args.tokenizer_model)
File "C:\Temp\WPy64-31001\python-3.10.0.amd64\lib\site-packages\sentencepiece_init_.py", line 447, in Init
self.Load(model_file=model_file, model_proto=model_proto)
File "C:\Temp\WPy64-31001\python-3.10.0.amd64\lib\site-packages\sentencepiece_init_.py", line 905, in Load
return self.LoadFromFile(model_file)
File "C:\Temp\WPy64-31001\python-3.10.0.amd64\lib\site-packages\sentencepiece_init_.py", line 310, in LoadFromFile
return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg)
OSError: Not found: "./models/tokenizer.model": No such file or directory Error #2

@clxyder
Copy link

clxyder commented Apr 11, 2023

https://huggingface.co/decapoda-research/llama-7b-hf/blob/main/tokenizer.model

Download this and drop it in the models directory.

@clxyder
Copy link

clxyder commented Apr 12, 2023

Hey @reynolds087 if this solved your issue can you close the issue?

@j-f1 j-f1 closed this as completed Apr 12, 2023
@lordadamson
Copy link

Could you guys add this to the documentation, I had the same issue and expected someone would ask it in the issues, and luckily I did.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants