llama_runner crash when running HF models

### 🐛 Describe the bug

Build and run llama_runner from latest trunk (`7534d4854b94cb49ea54d0e937c5af59ab18d2a9`) still crashes:
`cmake-out/examples/models/llama/llama_main --model_path=smollm3/smollm3-3b-8da4w.pte --tokenizer_path=smollm3/tokenizer.json --prompt="Once upon a time"`

stacktrace:
```
I tokenizers:regex.cpp:27] Registering override fallback regex
I tokenizers:hf_tokenizer.cpp:109] Setting up normalizer...
libc++abi: terminating due to uncaught exception of type nlohmann::json_abi_v3_11_3::detail::type_error: [json.exception.type_error.304] cannot use at() with null
Abort trap: 6
```

The PTE and tokenizer can be downloaded here: https://huggingface.co/pytorch/SmolLM3-3B-8da4w (need to join the organization first), the one I previously validated on-device using llama_runner built from a commit about 15 days ago. 

Follow this instruction to build llama_runner: https://github.com/pytorch/executorch/blob/main/examples/models/llama/README.md#step-3-run-on-your-computer-to-validate

### Versions

latest trunk (`7534d4854b94cb49ea54d0e937c5af59ab18d2a9`)

cc @larryliu0820 @JacobSzwejbka @lucylq

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama_runner crash when running HF models #12528

🐛 Describe the bug

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

llama_runner crash when running HF models #12528

Description

🐛 Describe the bug

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions