-
Couldn't load subscription status.
- Fork 13.5k
Closed
Labels
Description
Name and Version
Tested on latest pull.
Problem description & steps to reproduce
I'm not sure if this is a bug or intended:
- Is
--outtypesupposed to convert to the type, eg:--outtype f16on af32LoRA adapter in PEFT format is supposed to create anf16LoRA in GGUF format (ie: the same command semantics asconvert_hf_to_gguf_update.py)? - or is it just a "type hint" incase it can't infer the type of the PEFT format LoRA?
I also tried running llama-quant on the resulting LoRA in f32 GGUF format, but it doesn't like being given these files...
Are non-f32 LoRAs even allowed?
If it is a bug then I'll do some more investigation and see if I can find out what is happening.