Ollama create Error: unsupported architecture #6

simonnxren · 2024-11-21T12:20:26Z

the architecture error when creating models or converting to gguf with llama.cpp

it look like the model has the same architecture as the llama3.2V, could you help me with this?

XuGW-Kevin · 2024-11-21T12:21:29Z

Hi, This model has exactly the same architecture as the llama3.2V.

XuGW-Kevin · 2024-11-21T12:22:02Z

In theory, you can run this model on any platform that supports llama3.2V.
I'm not familiar with Ollama, but if you can share the error, I'll try to help with that.

simonnxren · 2024-11-21T12:52:27Z

In theory, you can run this model on any platform that supports llama3.2V. I'm not familiar with Ollama, but if you can share the error, I'll try to help with that.

cmd:
ollama create llama32V_cot -- this name points to the hf repo i downloaded

output:
transferring model data 100%
converting model
Error: unsupported architecture

ollama run llama3.2-vision, this cmd can run without problem.
I just figured out why it is not working. the official llama3.2vision is converted to gguf already. so it can run directly. But when using ollama create, the server need to convert the safetensor to gguf with llama.cpp in the background, while the mllama is not supported by the llama.cpp yet. therefore, the converting process failed

simonnxren · 2024-11-21T12:54:17Z

Hi, This model has exactly the same architecture as the llama3.2V.

could you upload a gguf version as well? i am not sure how the official llama3.2v is converted

XuGW-Kevin · 2024-11-21T15:06:53Z

It seems that one can use:

pip install model2gguf
model2gguf convert --model_id "Xkev/Llama-3.2V-11B-cot" --output_file "output_file.gguf"

I will try to upload one later. (I haven't tried that before, honestly.)

simonnxren · 2024-11-22T02:11:31Z

It seems that one can use:

pip install model2gguf model2gguf convert --model_id "Xkev/Llama-3.2V-11B-cot" --output_file "output_file.gguf"

I will try to upload one later. (I haven't tried that before, honestly.)

ehh... just tried the package. it is using llama.cpp in the backend as well. It does not work, got the same error:

INFO:hf-to-gguf:Loading model: Llama-3.2V-11B-cot
ERROR:hf-to-gguf:Model MllamaForConditionalGeneration is not supported

simonnxren closed this as completed Nov 22, 2024

simonnxren reopened this Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama create Error: unsupported architecture #6

Ollama create Error: unsupported architecture #6

simonnxren commented Nov 21, 2024

XuGW-Kevin commented Nov 21, 2024

XuGW-Kevin commented Nov 21, 2024 •

edited

Loading

simonnxren commented Nov 21, 2024

simonnxren commented Nov 21, 2024

XuGW-Kevin commented Nov 21, 2024

simonnxren commented Nov 22, 2024

Ollama create Error: unsupported architecture #6

Ollama create Error: unsupported architecture #6

Comments

simonnxren commented Nov 21, 2024

XuGW-Kevin commented Nov 21, 2024

XuGW-Kevin commented Nov 21, 2024 • edited Loading

simonnxren commented Nov 21, 2024

simonnxren commented Nov 21, 2024

XuGW-Kevin commented Nov 21, 2024

simonnxren commented Nov 22, 2024

XuGW-Kevin commented Nov 21, 2024 •

edited

Loading