Hard-coded Llama-3 model name pattern matching breaks scripts/convert_hf_checkpoint.py


This line makes the HF-to-gptfast converter treat `CodeLlama-34b` as a Llama-3 model, which results in errors.
https://github.com/pytorch-labs/gpt-fast/blob/main/scripts/convert_hf_checkpoint.py#L37
```
is_llama3 = "Llama-3" in model_name
```

```
File "gpt-fast/scripts/convert_hf_checkpoint.py", line 43, in convert_hf_checkpoint
    bin_files = [bin for bin in original_dir.iterdir() if pattern.match(bin.name)]
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Hard-coded Llama-3 model name pattern matching breaks scripts/convert_hf_checkpoint.py #177

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hard-coded Llama-3 model name pattern matching breaks scripts/convert_hf_checkpoint.py #177

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions