Skip to content

llama_runner crash when running HF models #12528

@guangy10

Description

@guangy10

🐛 Describe the bug

Build and run llama_runner from latest trunk (7534d4854b94cb49ea54d0e937c5af59ab18d2a9) still crashes:
cmake-out/examples/models/llama/llama_main --model_path=smollm3/smollm3-3b-8da4w.pte --tokenizer_path=smollm3/tokenizer.json --prompt="Once upon a time"

stacktrace:

I tokenizers:regex.cpp:27] Registering override fallback regex
I tokenizers:hf_tokenizer.cpp:109] Setting up normalizer...
libc++abi: terminating due to uncaught exception of type nlohmann::json_abi_v3_11_3::detail::type_error: [json.exception.type_error.304] cannot use at() with null
Abort trap: 6

The PTE and tokenizer can be downloaded here: https://huggingface.co/pytorch/SmolLM3-3B-8da4w (need to join the organization first), the one I previously validated on-device using llama_runner built from a commit about 15 days ago.

Follow this instruction to build llama_runner: https://github.com/pytorch/executorch/blob/main/examples/models/llama/README.md#step-3-run-on-your-computer-to-validate

Versions

latest trunk (7534d4854b94cb49ea54d0e937c5af59ab18d2a9)

cc @larryliu0820 @JacobSzwejbka @lucylq

Metadata

Metadata

Assignees

Labels

module: runtimeIssues related to the core runtime and code under runtime/triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions