-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Phi-3-mini-4k-instruct
checkpoint
#1341
Conversation
There is a modeling_*.py file. |
Haha, I finally get the weights loaded but of course it's never easy ... of course it's generating gibberish
Let the easter egg hunt begin 😭 |
Some more tidbits via Daniel Han:
|
Ok, it's becoming more interesting. |
Looks like the sliding window number was a typo: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/commit/b043e05a86cfc77f8d53eb0edf6a33e39afbcb5e |
Current code is an ugly state, but at least the model produces the same output as HF one. The missing piece is the Tokenizer: it has a smaller vocab size (32k vs 50k) that was extended by 64 special tokens. |
Yeah, that sounds about right based on the Phi-3 paper:
|
Phi-3-mini-4k-instruct
checkpoint
Required some number of changes but it works. @rasbt Could you check the changes in READMEs? Not 100% sure that I've done them correctly. |
Thanks so much! I am currently moving and offline until weekend/monday. Will take a look when I am back! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a tremendous PR. Thanks so much @Andrei-Aksionov . Just tried the model and it works great!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, thanks!
I think the failing tests are because of the new Eval Harness release: https://pypi.org/project/lm-eval/#history I can look into it in a separate PR |
Yep, this is the reason. |
All good now. Big thanks again @Andrei-Aksionov !! |
Add other config files(will be added in a separate PR)