Hot-swap LoRA with updated llama.cpp #212

ltoniazzi · 2024-08-19T22:38:07Z

Describe the feature

Feature

Since the PRs linked below in llama.cpp, it is possible to how-swap lora adapters. This allows to personalise NPCs and other GenAi game features by fine-tuning adapters that then can be quickly swapped on the same base model in memory.

Since llama.cpp has is being updated in #209. It would be nice to check how easily one can integrate this feature.

Todo list

bin files for adapters are now deprecated in favour of new gguf files, which should be ameneded in the documentation, if appearing.
Add in documentation link to how to convert adapters to gguf.
Add example on performing hot-swap
Test on using the new gguf formats should be run (I suspect the new adapter will automatically be used in hot-swapping mode)
Test using multiple adapters and hot swapping them, add code if needed, and add an example in the examples folder.

Related links

Hot lora PRs in llama.cpp:

Discord threads:

Discussion on lora

The text was updated successfully, but these errors were encountered:

ltoniazzi · 2024-08-19T22:39:06Z

I am happy to have a go at it 😃

amakropoulos · 2024-08-20T09:24:00Z

Wow thanks for the detailed request ⭐!
I implemented multiple loras and hot swapping in #210 . Do you want to have a look at it and let me know what you think?
I haven't added a sample or description in the Readme yet.

amakropoulos · 2024-08-26T08:54:07Z

I added a unit test for lora at #219 .
I'm adding some documentation and will release once I fix some build issue.

ltoniazzi added the enhancement New feature or request label Aug 19, 2024

ltoniazzi changed the title ~~Hot swap lora with llama.cpp updated~~ Hot swap lora with updated llama.cpp Aug 19, 2024

ltoniazzi changed the title ~~Hot swap lora with updated llama.cpp~~ Hot-swap LoRA with updated llama.cpp Aug 19, 2024

amakropoulos mentioned this issue Aug 20, 2024

Release v2.1.2 #209

Closed

ltoniazzi mentioned this issue Aug 20, 2024

Documentation/point to gguf format for lora #215

Merged

amakropoulos mentioned this issue Aug 26, 2024

Release v2.2.0 #220

Merged

amakropoulos closed this as completed in #220 Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hot-swap LoRA with updated llama.cpp #212

Hot-swap LoRA with updated llama.cpp #212

ltoniazzi commented Aug 19, 2024 •

edited

Loading

ltoniazzi commented Aug 19, 2024

amakropoulos commented Aug 20, 2024

amakropoulos commented Aug 26, 2024

Hot-swap LoRA with updated llama.cpp #212

Hot-swap LoRA with updated llama.cpp #212

Comments

ltoniazzi commented Aug 19, 2024 • edited Loading

Describe the feature

Feature

Todo list

Related links

ltoniazzi commented Aug 19, 2024

amakropoulos commented Aug 20, 2024

amakropoulos commented Aug 26, 2024

ltoniazzi commented Aug 19, 2024 •

edited

Loading