-
Notifications
You must be signed in to change notification settings - Fork 169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove hf_auth_token use #1822
base: main
Are you sure you want to change the base?
Remove hf_auth_token use #1822
Conversation
Currently marking it as draft since 13B and 70B paths need testing. |
If we only download the mlir we wouldn't hit the token right? |
I did try doing that but during the run saw that we will hit that issue - because we're using tokenizers to decode each generated token. And this tokenizer is being instantiated as per the HF repo we use. |
Even this would work since we're anyway blocking the IR generation. I verified it on CPU for llama2 7B. With this PR we don't need to maintain config files for tokenizer but we're changing the base HF repo and this would impact the workflow when the IR generation is given a green signal. But with the other PR we only need to incur an overhead for maintaining the config files - keeping rest of the infra same. |
fe4fcd6
to
1745c8f
Compare
-- This commit removes `--hf_auth_token` uses from vicuna.py. -- It adds llama2 models based on daryl49's HF. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>
1745c8f
to
c76c519
Compare
-- This commit removes
--hf_auth_token
uses from vicuna.py.-- It adds llama2 models based on daryl49's HF.
Signed-off-by: Abhishek Varma abhishek@nod-labs.com