Skip to content

Making a "quantize-ggml_16bit-to-gptq.py" script? #618

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
BadisG opened this issue Mar 30, 2023 · 3 comments
Closed

Making a "quantize-ggml_16bit-to-gptq.py" script? #618

BadisG opened this issue Mar 30, 2023 · 3 comments
Labels
enhancement New feature or request stale

Comments

@BadisG
Copy link

BadisG commented Mar 30, 2023

Hello,

I know the quantize.py converts a ggml 16 bits into a ggml 4 bits RTN.
Do you think it's possible to create a script that converts a ggml 16 bits into a ggml 4bits GPTQ?

Referring to this repository, it appears that the current implementation of the quantization relies only on GPU, which demands a significant amount of VRAM and might not be suitable for the average user.

A new script, which we could call "quantize-ggml_16bit-to-gptq.py", could be designed to use only CPU and RAM resources, making it more accessible to the general public.

@FNsi
Copy link
Contributor

FNsi commented Mar 30, 2023

Maybe you can check ggml.c to edit the q4 type?

@gjmulder gjmulder added the enhancement New feature or request label Mar 30, 2023
@prusnak
Copy link
Collaborator

prusnak commented Mar 30, 2023

quantize.py does not do any conversion by itself - it just calls ./quantize, so you might want to dig into examples/quantize/quantize.cpp for answers

@github-actions github-actions bot added the stale label Mar 25, 2024
Copy link
Contributor

This issue was closed because it has been inactive for 14 days since being marked as stale.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request stale
Projects
None yet
Development

No branches or pull requests

4 participants