Fix `revision` and other huggingface_hub kwargs in .from_quantized() #205

TheBloke · 2023-07-25T13:34:22Z

These kwargs were not being passed through to .from_quantized() so none of them were working. That's now fixed.

I also fixed it so that huggingface_hub args are used for the AutoGPTQFromCausalLM.from_pretrained() call, so a model can be loaded direct from the hub including using revision or token and then quantized and pushed back to hub as GPTQ.

…ich were not being passed through

…d() so models can be quantised from the hub including using a private token and revision/branch etc

TheBloke added 2 commits July 25, 2023 12:48

Fix revision and other huggingface_hub args for .from_quantized(), wh…

c9124e3

…ich were not being passed through

Extend huggingface_hub features to AutoGPTQForCausalLM.from_pretraine…

eeaf5eb

…d() so models can be quantised from the hub including using a private token and revision/branch etc

TheBloke mentioned this pull request Jul 25, 2023

Fix cuda bug #202

Merged

PanQiWei merged commit 2456f71 into AutoGPTQ:main Jul 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `revision` and other huggingface_hub kwargs in .from_quantized() #205

Fix `revision` and other huggingface_hub kwargs in .from_quantized() #205

TheBloke commented Jul 25, 2023 •

edited

Loading

Fix revision and other huggingface_hub kwargs in .from_quantized() #205

Fix revision and other huggingface_hub kwargs in .from_quantized() #205

Conversation

TheBloke commented Jul 25, 2023 • edited Loading

Fix `revision` and other huggingface_hub kwargs in .from_quantized() #205

Fix `revision` and other huggingface_hub kwargs in .from_quantized() #205

TheBloke commented Jul 25, 2023 •

edited

Loading