The problem with the conversion with the new convert.py #966

SrVill · 2023-04-14T13:52:17Z

Hello! Help me figure out:

F:\Models\digitous-Alpacino13b>convert.py --dump-single F:\Models\digitous-Alpacino13b\4bit.safetensors
Traceback (most recent call last):
File "F:\Models\digitous-Alpacino13b\convert.py", line 1145, in
main()
File "F:\Models\digitous-Alpacino13b\convert.py", line 1116, in main
model_plus = lazy_load_file(args.model)
File "F:\Models\digitous-Alpacino13b\convert.py", line 853, in lazy_load_file
return lazy_load_safetensors_file(fp, path)
File "F:\Models\digitous-Alpacino13b\convert.py", line 753, in lazy_load_safetensors_file
model = {name: convert(info) for (name, info) in header.items()}
File "F:\Models\digitous-Alpacino13b\convert.py", line 753, in
model = {name: convert(info) for (name, info) in header.items()}
File "F:\Models\digitous-Alpacino13b\convert.py", line 745, in convert
assert 0 <= begin <= end <= len(byte_buf)
AssertionError

What is the error here - in the script or maybe there is a problem in the model? The model is from here: https://huggingface.co/digitous/Alpacino13b/tree/main

kickturn · 2023-04-14T23:39:02Z

Same error here going on with me, i'm using a different model however but get the exact same error produced on latest commit

Traceback (most recent call last):
  File "D:\llama\llama.cpp\convert.py", line 1146, in <module>
    main()
  File "D:\llama\llama.cpp\convert.py", line 1126, in main
    model_plus = load_some_model(args.model)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\llama\llama.cpp\convert.py", line 1053, in load_some_model
    models_plus.append(lazy_load_file(path))
                       ^^^^^^^^^^^^^^^^^^^^
  File "D:\llama\llama.cpp\convert.py", line 854, in lazy_load_file
    return lazy_load_safetensors_file(fp, path)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\llama\llama.cpp\convert.py", line 754, in lazy_load_safetensors_file
    model = {name: convert(info) for (name, info) in header.items()}
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\llama\llama.cpp\convert.py", line 754, in <dictcomp>
    model = {name: convert(info) for (name, info) in header.items()}
                   ^^^^^^^^^^^^^
  File "D:\llama\llama.cpp\convert.py", line 746, in convert
    assert 0 <= begin <= end <= len(byte_buf)
AssertionError

kickturn · 2023-04-15T00:22:32Z

Apparently I fixed it by using Linux? Use WSL or an alternative and run the script again. I'm not versed in python that much to figure out the error but it doesn't seem to fall into an assertionerror on linux with no alterations whatsoever.

comex · 2023-04-15T01:18:00Z

Let's see...

Calling `mmap.mmap` on Windows apparently resets the file offset of the raw file object (and makes the BufferedReader return a *negative* file offset). For safetensors, avoid using the file offset after calling mmap. For GGML format, explicitly save and restore the offset. Fixes ggml-org#966.

kickturn · 2023-04-15T02:34:12Z

just checked and the commit fixed it!

SrVill · 2023-04-15T09:24:42Z

The same mistake. Maybe I'm doing something wrong?

C:\llama.cpp>convert.py 4bit.safetensors --outtype q4_1 --outfile 4ggml.bin
Loading model file 4bit.safetensors
Traceback (most recent call last):
File "C:\llama.cpp\convert.py", line 1145, in
main()
File "C:\llama.cpp\convert.py", line 1125, in main
model_plus = load_some_model(args.model)
File "C:\llama.cpp\convert.py", line 1052, in load_some_model
models_plus.append(lazy_load_file(path))
File "C:\llama.cpp\convert.py", line 853, in lazy_load_file
return lazy_load_safetensors_file(fp, path)
File "C:\llama.cpp\convert.py", line 753, in lazy_load_safetensors_file
model = {name: convert(info) for (name, info) in header.items()}
File "C:\llama.cpp\convert.py", line 753, in
model = {name: convert(info) for (name, info) in header.items()}
File "C:\llama.cpp\convert.py", line 745, in convert
assert 0 <= begin <= end <= len(byte_buf)
AssertionError

Calling `mmap.mmap` on Windows apparently resets the file offset of the raw file object (and makes the BufferedReader return a *negative* file offset). For safetensors, avoid using the file offset after calling mmap. For GGML format, explicitly save and restore the offset. Fixes #966.

NextGA-OSS · 2023-12-16T10:23:05Z

Seeing similar issues still. Latest repo trying to set up an apple m2 implementation using mixtral:

% python3 convert.py ./models/mixtral-instruct-8x7b/ \
         --outfile ./models/mixtral-instruct-8x7b/ggml-model-f16.gguf \
         --outtype f16
Loading model file models/mixtral-instruct-8x7b/model-00001-of-00019.safetensors
Traceback (most recent call last):
  File "/Users/user/Documents/Workspace/llama/convert.py", line 1279, in <module>
    main()
  File "/Users/user/Documents/Workspace/llama/convert.py", line 1207, in main
    model_plus = load_some_model(args.model)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/user/Documents/Workspace/llama/convert.py", line 1140, in load_some_model
    models_plus.append(lazy_load_file(path))
                       ^^^^^^^^^^^^^^^^^^^^
  File "/Users/user/Documents/Workspace/llama/convert.py", line 797, in lazy_load_file
    raise ValueError(f"unknown format: {path}")
ValueError: unknown format: models/mixtral-instruct-8x7b/model-00001-of-00019.safetensors

prusnak · 2023-12-16T10:45:37Z

Seeing similar issues still.

It's a very different issue. Open a new issue.

shrijayan · 2023-12-19T04:45:09Z

SrVill

Facing the same issue in Google Colab. Any Solution?

comex mentioned this issue Apr 15, 2023

convert.py: Fix loading safetensors and ggml format on Windows #991

Merged

prusnak closed this as completed in #991 Apr 15, 2023

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The problem with the conversion with the new convert.py #966

The problem with the conversion with the new convert.py #966

SrVill commented Apr 14, 2023

kickturn commented Apr 14, 2023 •

edited

Loading

kickturn commented Apr 15, 2023 •

edited

Loading

comex commented Apr 15, 2023

kickturn commented Apr 15, 2023 •

edited

Loading

SrVill commented Apr 15, 2023

NextGA-OSS commented Dec 16, 2023 •

edited

Loading

prusnak commented Dec 16, 2023

shrijayan commented Dec 19, 2023

The problem with the conversion with the new convert.py #966

The problem with the conversion with the new convert.py #966

Comments

SrVill commented Apr 14, 2023

kickturn commented Apr 14, 2023 • edited Loading

kickturn commented Apr 15, 2023 • edited Loading

comex commented Apr 15, 2023

kickturn commented Apr 15, 2023 • edited Loading

SrVill commented Apr 15, 2023

NextGA-OSS commented Dec 16, 2023 • edited Loading

prusnak commented Dec 16, 2023

shrijayan commented Dec 19, 2023

kickturn commented Apr 14, 2023 •

edited

Loading

kickturn commented Apr 15, 2023 •

edited

Loading

kickturn commented Apr 15, 2023 •

edited

Loading

NextGA-OSS commented Dec 16, 2023 •

edited

Loading