Add a missing step to the gpt4all instructions #690

ThatcherC · 2023-04-01T21:08:14Z

migrate-ggml-2023-03-30-pr613.py is needed to get gpt4all running.

When following the README instructions for gpt4all, I encounted the following error:

data/llama.cpp$ python3 convert-gpt4all-to-ggml.py models/gpt4all-models/gpt4all-lora-quantized.bin models/tokenizer.model

/data/llama.cpp$ ./main -m models/gpt4all-models/gpt4all-lora-unfiltered-quantized.bin -n 128
main: seed = 1680382134
llama_model_load: loading model from 'models/gpt4all-models/gpt4all-lora-unfiltered-quantized.bin' - please wait ...
models/gpt4all-models/gpt4all-lora-unfiltered-quantized.bin: invalid model file (bad magic [got 0x67676d66 want 0x67676a74])
    you most likely need to regenerate your ggml files
    the benefit is you'll get 10-100x faster load times
    see https://github.com/ggerganov/llama.cpp/issues/91
    use convert-pth-to-ggml.py to regenerate from original pth
    use migrate-ggml-2023-03-30-pr613.py if you deleted originals
llama_init_from_file: failed to load model
main: error: failed to load model 'models/gpt4all-models/gpt4all-lora-unfiltered-quantized.bin'

I think the gpt4all conversion script is a bit out of date and produces files that need to be converted with migrate-ggml-2023-03-30-pr613.py.

I was able to run python3 migrate-ggml-2023-03-30-pr613.py models/gpt4all-models/gpt4all-lora-unfiltered-quantized.bin models/gpt4all-models/gpt4all-lora-unfiltered-quantized-pr613.bin which seemed to do the trick. After that, ./main -m models/gpt4all-models/gpt4all-lora-unfiltered-quantized-pr613.bin ran successfully!

This PR updates the README to add that migrate-ggml-2023-03-30-pr613.py step.

`migrate-ggml-2023-03-30-pr613.py` is needed to get gpt4all running.

sw · 2023-04-02T09:50:54Z

Yes, the *-to-ggml.py scripts need to be updated, I've opened an issue for this: #704

In the meantime, I think it makes sense to update the README.

Edit: looks like #545 might make this irrelevant, let's see if that one can be merged soon.

…terial-9.3.1 Bump mkdocs-material from 9.2.8 to 9.3.1

Add a missing step to the gpt4all instructions

d282143

`migrate-ggml-2023-03-30-pr613.py` is needed to get gpt4all running.

linouxis9 mentioned this pull request Apr 2, 2023

Support For ggml format for gpt4all #696

Closed

This was referenced Apr 2, 2023

Update *-to-ggml.py scripts for new ggjt model format #704

Closed

New conversion script #545

Merged

prusnak approved these changes Apr 2, 2023

View reviewed changes

prusnak merged commit d8d4e86 into ggml-org:master Apr 2, 2023

Deadsg pushed a commit to Deadsg/llama.cpp that referenced this pull request Dec 19, 2023

Merge pull request ggml-org#690 from abetlen/dependabot/pip/mkdocs-ma…

8668f59

…terial-9.3.1 Bump mkdocs-material from 9.2.8 to 9.3.1

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a missing step to the gpt4all instructions #690

Add a missing step to the gpt4all instructions #690

ThatcherC commented Apr 1, 2023

sw commented Apr 2, 2023 •

edited

Loading

Add a missing step to the gpt4all instructions #690

Add a missing step to the gpt4all instructions #690

Conversation

ThatcherC commented Apr 1, 2023

sw commented Apr 2, 2023 • edited Loading

sw commented Apr 2, 2023 •

edited

Loading