koboldcpp-1.0.8beta #13

LostRuins · 2023-04-03T03:58:16Z

LostRuins
Apr 3, 2023
Maintainer

koboldcpp-1.0.8beta

Rebranded to koboldcpp (formerly llamacpp-for-kobold). Library file names and references are changed too, Please let me know if anything is broken!
Added support for the original GPT4ALL.CPP format!
Added support for GPT-J formats, including the original 16bit legacy format as well as the 4bit version from Pygmalion.cpp
Switched compiler flag from -O3 to -Ofast. This should increase generation speed even more, but I dunno if anything will break, please let me know if so.

To use, download and run the koboldcpp.exe
Alternatively, drag and drop a compatible quantized model for llamacpp on top of the .exe, or run it and manually select the model in the popup dialog.

and then once loaded, you can connect like this (or use the full koboldai client):
http://localhost:5001

This discussion was created from the release koboldcpp-1.0.8beta.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

koboldcpp-1.0.8beta #13

{{title}}

Replies: 0 comments

Select a reply

koboldcpp-1.0.8beta #13

LostRuins Apr 3, 2023 Maintainer

Replies: 0 comments

LostRuins
Apr 3, 2023
Maintainer