-
Notifications
You must be signed in to change notification settings - Fork 10.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tracking: LoRA #964
Comments
really desperate to start uing LoRA, however I use GPTQ-4bit-32g.GGML will this be a problem? |
So far, we've seen issues with quality on 4 bit base model. That being said, it has produced reasonable output for me some of the time. It is still under investigation. |
Would this be a good place to request support for multiple lora adapters sharing a similar base model? See here for inspiration: lm-sys/FastChat#1905 |
was done here #2095 also not sure this issue is the right one |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
Here are some outstanding issues for LoRA:
mul_mat([16 X 5120], [16 X 5120])
takes 120ms - 24x slower than expected #956)--export-lora
flag); interactively (?)) (feature: interactively exporting loaded model to binfile #904)The text was updated successfully, but these errors were encountered: