Investigate supporting starcode #1326

Azeirah · 2023-05-04T21:04:22Z

Bigcode just released starcoder. This is a 15B model trained on 1T Github tokens. This seems like it could be an amazing replacement for gpt-3.5 and maybe gpt-4 for local coding assistance and IDE tooling!

More info: https://huggingface.co/bigcode

ghost · 2023-05-04T21:38:59Z

It looks like gpt-2. I'd imagine it's like codegen where you can use the normal gpt-2/gpt-j conversion/quantization scripts/main binary. You just need enough memory to convert/run it.

ggerganov · 2023-05-04T21:43:04Z

If it is like Saleforce's CodeGen model, checkout this repo: https://github.com/ravenscroftj/turbopilot

mrdc · 2023-05-08T16:25:27Z

If it is like Saleforce's CodeGen model, checkout this repo: https://github.com/ravenscroftj/turbopilot

It’s an independent model. BTW it uses tokenizer from gpt-2.

redthing1 · 2023-05-09T05:13:54Z

I'm very much interested in this too.

NouamaneTazi · 2023-05-10T19:03:27Z

Working on it! Should be coming soon

NouamaneTazi · 2023-05-12T09:09:35Z

PR to support StarCoder/SantaCoder is ready! And results looking fine 🔥
ggml-org/ggml#146

christianwengert · 2023-05-18T10:59:55Z

I have seen it is supported in ggml-org/ggml#146 but how I can this be used in llama.cpp?

NouamaneTazi · 2023-05-18T12:34:06Z

It cannot be used in llama.cpp as it's not the same model architecture. but you can find a similar repo to llama.cpp for starcoder in https://github.com/bigcode-project/starcoder.cpp @christianwengert

ggerganov added help wanted Extra attention is needed model Model specific labels May 5, 2023

NouamaneTazi self-assigned this May 10, 2023

NouamaneTazi closed this as completed May 18, 2023

wsxiaoys mentioned this issue Sep 8, 2023

Support starcoder family architectures (1B/3B/7B/13B) #3076

Closed

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate supporting starcode #1326

Investigate supporting starcode #1326

Azeirah commented May 4, 2023 •

edited

Loading

ghost commented May 4, 2023 via email

ggerganov commented May 4, 2023

mrdc commented May 8, 2023

redthing1 commented May 9, 2023

NouamaneTazi commented May 10, 2023

NouamaneTazi commented May 12, 2023

christianwengert commented May 18, 2023

NouamaneTazi commented May 18, 2023 •

edited

Loading

Investigate supporting starcode #1326

Investigate supporting starcode #1326

Comments

Azeirah commented May 4, 2023 • edited Loading

ghost commented May 4, 2023 via email

ggerganov commented May 4, 2023

mrdc commented May 8, 2023

redthing1 commented May 9, 2023

NouamaneTazi commented May 10, 2023

NouamaneTazi commented May 12, 2023

christianwengert commented May 18, 2023

NouamaneTazi commented May 18, 2023 • edited Loading

Azeirah commented May 4, 2023 •

edited

Loading

NouamaneTazi commented May 18, 2023 •

edited

Loading