Support importing GGUF files #1187

richardanaya · 2024-01-29T20:24:08Z

I apologize if this seems too far fetched, but it seemed in line with how ONNX generation works.

antimora · 2024-01-29T20:43:22Z

If gguf contains the model graph information, then we can use what burn-import ONNX facility. In our burn-import, we convert ONNX graph to IR (intermediate representation) (see this doc). So, it would possible to convert the model graph to IR and generate source code + weights.

If gguf contains only weights, we can go burn-import pytorch route, where we only download weights.

antimora · 2024-01-29T20:48:49Z

From my brief research, GGUF format contains metadata + tensor weights. This aligns with burn-import pytorch route and not burn-import/ONNX. This will mean model needs to be constructed in Burn first and use the weights to load.

Here is one Rust lib to parse GGUF file: https://github.com/Jimexist/gguf

antimora · 2024-03-15T19:26:39Z

GGUF spec: ggerganov/ggml#302

antimora · 2024-03-15T19:33:44Z

Parser in Rust: https://github.com/Jimexist/gguf

antimora changed the title ~~Support generating burn models from GGUF files?~~ Support importing GGUF files Mar 28, 2024

antimora added the feature The feature request label Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support importing GGUF files #1187

Support importing GGUF files #1187

richardanaya commented Jan 29, 2024 •

edited by antimora

Loading

antimora commented Jan 29, 2024

antimora commented Jan 29, 2024 •

edited

Loading

antimora commented Mar 15, 2024

antimora commented Mar 15, 2024

Support importing GGUF files #1187

Support importing GGUF files #1187

Comments

richardanaya commented Jan 29, 2024 • edited by antimora Loading

antimora commented Jan 29, 2024

antimora commented Jan 29, 2024 • edited Loading

antimora commented Mar 15, 2024

antimora commented Mar 15, 2024

richardanaya commented Jan 29, 2024 •

edited by antimora

Loading

antimora commented Jan 29, 2024 •

edited

Loading