Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate supporting starcode #1326

Closed
Azeirah opened this issue May 4, 2023 · 8 comments
Closed

Investigate supporting starcode #1326

Azeirah opened this issue May 4, 2023 · 8 comments
Assignees
Labels
help wanted Extra attention is needed model Model specific

Comments

@Azeirah
Copy link
Contributor

Azeirah commented May 4, 2023

Bigcode just released starcoder. This is a 15B model trained on 1T Github tokens. This seems like it could be an amazing replacement for gpt-3.5 and maybe gpt-4 for local coding assistance and IDE tooling!

More info: https://huggingface.co/bigcode

@ghost
Copy link

ghost commented May 4, 2023 via email

@ggerganov
Copy link
Member

If it is like Saleforce's CodeGen model, checkout this repo: https://github.com/ravenscroftj/turbopilot

@ggerganov ggerganov added help wanted Extra attention is needed model Model specific labels May 5, 2023
@mrdc
Copy link

mrdc commented May 8, 2023

If it is like Saleforce's CodeGen model, checkout this repo: https://github.com/ravenscroftj/turbopilot

It’s an independent model. BTW it uses tokenizer from gpt-2.

@redthing1
Copy link

I'm very much interested in this too.

@NouamaneTazi
Copy link
Collaborator

Working on it! Should be coming soon

@NouamaneTazi NouamaneTazi self-assigned this May 10, 2023
@NouamaneTazi
Copy link
Collaborator

PR to support StarCoder/SantaCoder is ready! And results looking fine 🔥
ggml-org/ggml#146

@christianwengert
Copy link

I have seen it is supported in ggml-org/ggml#146 but how I can this be used in llama.cpp?

@NouamaneTazi
Copy link
Collaborator

NouamaneTazi commented May 18, 2023

It cannot be used in llama.cpp as it's not the same model architecture. but you can find a similar repo to llama.cpp for starcoder in https://github.com/bigcode-project/starcoder.cpp @christianwengert

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed model Model specific
Projects
None yet
Development

No branches or pull requests

6 participants