-
Notifications
You must be signed in to change notification settings - Fork 368
Support latest (breaking) changes to GGML file format #61
Conversation
Hi! Thanks a lot for the PR 😄
That's really strange. IIRC the only changes to the format are that now vocab scores are embedded in the model. Weights should be exactly the same. Can you make a test with |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Thanks again for the changes. Can you sort out the merge conflict and the formatting / clippy issues from CI?
Once that's taken care of, we're ready to merge 😄
Yup, will doublecheck output vs. main branch on an older model to ensure all is okay. |
Breaking changes in ggml: - Scores added in vocabulary - Format version added - Magic updated
- Generalize u32, i32 and f32 reading without a boilerplate-y trait
Yeah looks good to me. @setzer22 can you approve the workflow? |
Merged! Took care of some minor clippy lints myself. Thanks again :) |
Latest ggml models have:
This PR maintains support for older ('legacy') models
A model downloaded and converted on latest master in
llama.cpp
now works withllama-rs
but the output seems qualitatively worse. Haven't played around much withllama-rs
though so it's hard to say whether this is a regression.