Convert `quantize.cpp` to Rust #40

philpax · 2023-03-18T14:42:28Z

Split this off from #21 as it's a separate issue.

This should be relatively straightforward - it reads in the original ggml model, runs the quantization functions over the data, and writes it out to disk.

The exciting possibility is for parallelisation 👀 - all you should have to do is scan through the file to determine the tensor boundaries, then build an iterator from it and feed it to rayon. It would be a huge improvement over the C++ version, and it would be practically free!

The text was updated successfully, but these errors were encountered:

FloppyDisck · 2023-03-19T06:31:44Z

If theres nobody working on this I could tackle it in the week

FloppyDisck · 2023-03-25T05:09:37Z

Is there currently a way to convert models to ggml format? Im close to getting quantize into a working demo and was wondering if this should also be ported for the PR

philpax · 2023-03-25T11:06:35Z

Nope, that still requires the original Python code. If you want to tackle #21 as well, that would be awesome!

FloppyDisck · 2023-03-25T17:20:54Z

I have no problem working on it after I finish this issue.

Another question, how should the feature be used? Should it be another argument in the cli app?

philpax · 2023-03-25T20:22:46Z

Hm, just do the simplest possible thing for now and we'll figure out a new CLI. There's several changes landing to the CLI soon, so we should avoid doing anything complicated until it's entirely resolved.

philpax added the issue:enhancement New feature or request label Mar 24, 2023

philpax mentioned this issue Mar 26, 2023

WIP: Refactor Cli #74

Closed

3 tasks

philpax assigned FloppyDisck Mar 27, 2023

FloppyDisck mentioned this issue Mar 27, 2023

Ported quantize.cpp #84

Merged

philpax added this to the 0.1 milestone Apr 10, 2023

philpax closed this as completed in #84 Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert `quantize.cpp` to Rust #40

Convert `quantize.cpp` to Rust #40

philpax commented Mar 18, 2023

FloppyDisck commented Mar 19, 2023

FloppyDisck commented Mar 25, 2023

philpax commented Mar 25, 2023

FloppyDisck commented Mar 25, 2023

philpax commented Mar 25, 2023

Convert quantize.cpp to Rust #40

Convert quantize.cpp to Rust #40

Comments

philpax commented Mar 18, 2023

FloppyDisck commented Mar 19, 2023

FloppyDisck commented Mar 25, 2023

philpax commented Mar 25, 2023

FloppyDisck commented Mar 25, 2023

philpax commented Mar 25, 2023

Convert `quantize.cpp` to Rust #40

Convert `quantize.cpp` to Rust #40