Community contribution: Adding GGUF support for more architectures #33260

SunMarc · 2024-09-02T13:41:47Z

VladOS95-cyber · 2024-09-02T14:46:24Z

@SunMarc I am going to take Qwen2Moe

KingNish24 · 2024-09-02T16:26:38Z

@SunMarc I want to take Gemma2

junejae · 2024-09-03T05:21:09Z

@SunMarc May I suggest & take T5? Seems GGUF version of T5 encoder is highly used for getting along with FLUX.

010kim · 2024-09-03T06:13:06Z

@SunMarc Hello! Unless someone else is working on this model already, may I take MiniCPM-V?

SunMarc · 2024-09-03T11:47:15Z

@SunMarc May I suggest & take T5? Seems GGUF version of T5 encoder is highly used for getting along with FLUX.

Added @junejae !

@SunMarc Hello! Unless someone else is working on this model already, may I take MiniCPM-V?

Hi @010kim, thanks for the interest ! MiniCPM-V model relies on trust_remote_code=True, so I don't think we can add this model for now with gguf support. We don't want to have code in transformers that relies on modeling files that are on the hub. I will think about extending trust_remote_code=True to gguf support, so that the author of the model can add it himself !

010kim · 2024-09-03T12:34:50Z

Hi @010kim, thanks for the interest ! MiniCPM-V model relies on trust_remote_code=True, so I don't think we can add this model for now with gguf support. We don't want to have code in transformers that relies on modeling files that are on the hub. I will think about extending trust_remote_code=True to gguf support, so that the author of the model can add it himself !

@SunMarc Thank you so much for your response. It also makes sense the author should work on it. What about Cohere? Can I take it?

jungnerd · 2024-09-05T05:23:06Z

Hi @SunMarc 👋🏻
May I work with CLIP model if nobody is working on it?

SunMarc · 2024-09-05T14:01:21Z

Hey @jungnerd ! The model you choose needs to be in the conversion script from hf to gguf. See the script here

g-prz · 2024-09-09T07:36:33Z

Hey @SunMarc 🙋‍♂️
I'd like to try my chance to contribute to this issue, can I take Falcon? 🦅

VladOS95-cyber · 2024-09-11T16:09:01Z

Hi @SunMarc, I take bloom if nobody is working on it

fabxoe · 2024-09-12T13:08:03Z

Hi @SunMarc, I'd like to handle the work related to Codestrall :)

jungnerd · 2024-09-13T03:25:04Z

Hey @jungnerd ! The model you choose needs to be in the conversion script from hf to gguf. See the script here

There is conversion script for clip model(clip.cpp). Can I use this to contribute?

cjfghk5697 · 2024-09-14T03:57:08Z

Hi @SunMarc,
I'm interested in this issue. Would it be okay if I worked on the BLIP model?

cjfghk5697 · 2024-09-14T17:49:52Z

Hi @SunMarc, I'm interested in this issue. Would it be okay if I worked on the BLIP model?

Hi @SunMarc, I'd like to work on the BLIP model, but after researching, I found that it might be challenging due to the Vision model structure. Would it be alright if I switched to working on the Smol model instead?

g-prz · 2024-09-19T09:30:05Z

Hey @SunMarc 🤗
Gonna continue with granite 🪨

cjfghk5697 · 2024-09-21T08:33:07Z

@SunMarc
I checked the Smol model and confirmed that it's already functioning well without needing any further work. In the issue mentioned that supporting the Smol model would be beneficial, but is there any specific work required?

If not, I’ll proceed with switching to the dbrx model.

SunMarc · 2024-09-23T15:47:30Z

@SunMarc
I checked the Smol model and confirmed that it's already functioning well without needing any further work. In the issue mentioned that supporting the Smol model would be beneficial, but is there any specific work required?

If not, I’ll proceed with switching to the dbrx model.

Oh indeed, this is because it is a llama architecture.

VladOS95-cyber · 2024-09-29T07:53:06Z

Hi @SunMarc! I am going to start working on StableLM model

yijun-lee · 2024-10-02T16:08:22Z

Is any work being done on the Gemma2? If not, I would like to proceed with it!
@SunMarc @KingNish24

VladOS95-cyber · 2024-10-03T13:47:45Z

Hi @SunMarc! I suppose GPT2 gguf is not supported yet, if this is a case, I'll take it

fabxoe · 2024-10-05T08:25:49Z

Hi @SunMarc, I'd like to handle the work related to Codestrall :)

Codestrall's tokenizer was just llama tokenizer.
It looks like I don't need to handle codes of Codestrall

010kim · 2024-10-06T12:48:53Z

@SunMarc Thank you so much for your response. It also makes sense the author should work on it. What about Cohere? Can I take it?

I went through the codes, and i was able to to load Cohere gguf model, but could not load the tokenizer. This is because Cohere slow tokenizer is not implemented in HuggingFace. (Only FastTokenizer is available for Cohere) Is there a way around to fix this? @SunMarc

VladOS95-cyber · 2024-10-11T13:42:18Z

Hey @SunMarc! I"ll take Starcoder2 as next model

VladOS95-cyber · 2024-10-14T14:53:12Z

Hi @SunMarc! I am going to start working on Mamba

farrosalferro · 2024-11-08T07:26:19Z

Are you still working on Gemma2? @yijun-lee @KingNish24 ? If not, is it possible for me to try working on it? Thank you!

yijun-lee · 2024-11-08T07:31:27Z

Are you still working on Gemma2? @yijun-lee @KingNish24 ? If not, is it possible for me to try working on it? Thank you!

I’m running behind schedule, but I’m making progress! I’ll handle it.

farrosalferro · 2024-11-08T07:36:20Z

I’m running behind schedule, but I’m making progress! I’ll handle it.

Glad to know! Then is it possible for me to try working on Nemotron? @SunMarc

farrosalferro · 2024-11-14T07:06:56Z

Could you please kindly check my PR @SunMarc? Thank you
Add Nemotron GGUF Loading Support

SunMarc added the Feature request Request for a new feature label Sep 2, 2024

SunMarc mentioned this issue Sep 2, 2024

Supprot for qwen2moe gguf models #33243

Closed

4 tasks

SunMarc added the Good Second Issue Issues that are more difficult to do than "Good First" issues - give it a try if you want! label Sep 2, 2024

VladOS95-cyber mentioned this issue Sep 2, 2024

Add Qwen2Moe GGUF loading support #33264

Merged

5 tasks

junejae mentioned this issue Sep 9, 2024

Add T5 GGUF loading support #33389

Merged

5 tasks

g-prz mentioned this issue Sep 13, 2024

Add falcon gguf #33437

Merged

5 tasks

VladOS95-cyber mentioned this issue Sep 13, 2024

Add gguf support for bloom #33473

Merged

5 tasks

vladmandic mentioned this issue Sep 20, 2024

Add GGUF loader for FluxTransformer2DModel huggingface/diffusers#9487

Closed

VladOS95-cyber mentioned this issue Sep 29, 2024

Add gguf support for StableLM #33793

Merged

5 tasks

g-prz mentioned this issue Oct 3, 2024

Add gguf for granite #33908

Draft

5 tasks

This was referenced Oct 5, 2024

Add DBRX GGUF Support #33977

Closed

Add DBRX GGUF Support Error #33978

Closed

SunMarc mentioned this issue Oct 7, 2024

Gemma2 GGUF: modeling_gguf_pytorch_utils.py: ValueError: Architecture gemma2 not supported #32577

Closed

4 tasks

VladOS95-cyber mentioned this issue Oct 9, 2024

Add gguf support for gpt2 #34044

Merged

5 tasks

VladOS95-cyber mentioned this issue Oct 11, 2024

Add GGUF for starcoder2 #34094

Merged

5 tasks

This was referenced Oct 16, 2024

Add GGUF for Mamba #34200

Merged

GGUF support for BERT architecture #34238

Open

ValueError: Architecture deepseek2 not supported #34335

Closed

VladOS95-cyber mentioned this issue Oct 30, 2024

Improve gguf tensor processing #34515

Merged

4 tasks

farrosalferro mentioned this issue Nov 14, 2024

Add Nemotron GGUF Loading Support #34725

Merged

5 tasks

VladOS95-cyber mentioned this issue Dec 16, 2024

Qwen2vl support for GGUF #35282

Open

yijun-lee mentioned this issue Dec 30, 2024

Add Gemma2 GGUF support #34002

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Community contribution: Adding GGUF support for more architectures #33260

Community contribution: Adding GGUF support for more architectures #33260

SunMarc commented Sep 2, 2024 •

edited

Loading

VladOS95-cyber commented Sep 2, 2024

KingNish24 commented Sep 2, 2024

junejae commented Sep 3, 2024 •

edited

Loading

010kim commented Sep 3, 2024

SunMarc commented Sep 3, 2024

010kim commented Sep 3, 2024

jungnerd commented Sep 5, 2024

SunMarc commented Sep 5, 2024

g-prz commented Sep 9, 2024

VladOS95-cyber commented Sep 11, 2024

fabxoe commented Sep 12, 2024 •

edited

Loading

jungnerd commented Sep 13, 2024

cjfghk5697 commented Sep 14, 2024

cjfghk5697 commented Sep 14, 2024

g-prz commented Sep 19, 2024

cjfghk5697 commented Sep 21, 2024 •

edited

Loading

SunMarc commented Sep 23, 2024

VladOS95-cyber commented Sep 29, 2024 •

edited

Loading

yijun-lee commented Oct 2, 2024

VladOS95-cyber commented Oct 3, 2024

fabxoe commented Oct 5, 2024 •

edited

Loading

010kim commented Oct 6, 2024

VladOS95-cyber commented Oct 11, 2024

VladOS95-cyber commented Oct 14, 2024

farrosalferro commented Nov 8, 2024

yijun-lee commented Nov 8, 2024

farrosalferro commented Nov 8, 2024 •

edited

Loading

farrosalferro commented Nov 14, 2024

Community contribution: Adding GGUF support for more architectures #33260

Community contribution: Adding GGUF support for more architectures #33260

Comments

SunMarc commented Sep 2, 2024 • edited Loading

Feature request

Motivation

Your contribution

VladOS95-cyber commented Sep 2, 2024

KingNish24 commented Sep 2, 2024

junejae commented Sep 3, 2024 • edited Loading

010kim commented Sep 3, 2024

SunMarc commented Sep 3, 2024

010kim commented Sep 3, 2024

jungnerd commented Sep 5, 2024

SunMarc commented Sep 5, 2024

g-prz commented Sep 9, 2024

VladOS95-cyber commented Sep 11, 2024

fabxoe commented Sep 12, 2024 • edited Loading

jungnerd commented Sep 13, 2024

cjfghk5697 commented Sep 14, 2024

cjfghk5697 commented Sep 14, 2024

g-prz commented Sep 19, 2024

cjfghk5697 commented Sep 21, 2024 • edited Loading

SunMarc commented Sep 23, 2024

VladOS95-cyber commented Sep 29, 2024 • edited Loading

yijun-lee commented Oct 2, 2024

VladOS95-cyber commented Oct 3, 2024

fabxoe commented Oct 5, 2024 • edited Loading

010kim commented Oct 6, 2024

VladOS95-cyber commented Oct 11, 2024

VladOS95-cyber commented Oct 14, 2024

farrosalferro commented Nov 8, 2024

yijun-lee commented Nov 8, 2024

farrosalferro commented Nov 8, 2024 • edited Loading

farrosalferro commented Nov 14, 2024

SunMarc commented Sep 2, 2024 •

edited

Loading

junejae commented Sep 3, 2024 •

edited

Loading

fabxoe commented Sep 12, 2024 •

edited

Loading

cjfghk5697 commented Sep 21, 2024 •

edited

Loading

VladOS95-cyber commented Sep 29, 2024 •

edited

Loading

fabxoe commented Oct 5, 2024 •

edited

Loading

farrosalferro commented Nov 8, 2024 •

edited

Loading