Adding topic steering on layers #5119

benxh1995 · 2024-01-24T20:46:09Z

Prerequisites

Please answer the following questions for yourself before submitting an issue.

[ x ] I am running the latest code. Development is very rapid so there are no tagged versions as of now.
[ x ] I carefully followed the README.md.
[ x ] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
[ x ] I reviewed the Discussions, and have a new bug or useful enhancement to share.

Feature Description

Add the ability to set/apply steers on a layer by layer basis in order to ensure alignment with specific concepts, tokens or words.

Motivation

Based on recent papers and research on LLM steerability through applying steers on specific layers, I believe that llama.cpp would greatly benefit from incorporating a feature like this.

The colab in the section below should provide a very interesting application which results in far higher reasoning abilities, for example from a 3B.

Possible Implementation

Links to known implementations:
https://github.com/Mihaiii/llm_steer
https://colab.research.google.com/github/Mihaiii/llm_steer/blob/main/demo/llm_steer_demo.ipynb

And

https://github.com/nrimsky/LM-exp/tree/main/steering

The text was updated successfully, but these errors were encountered:

Engininja2 · 2024-01-24T21:48:35Z

Fyi there was draft PR experimenting with steering vectors #1472.

peerschuett · 2024-02-06T15:53:47Z

I took a look at #1472 , but the structure of llama.cpp changed in the meantime too much. In #1472 the steering was done in this function https://github.com/ggerganov/llama.cpp/blob/95dc4d7270e04bef3792a61be9bb045cb74a9fc4/llama.cpp#L1132 , but the whole function doesn't exist in this form any more.

github-actions · 2024-03-18T01:33:08Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2024-04-02T01:08:25Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

benxh1995 added the enhancement New feature or request label Jan 24, 2024

peerschuett mentioned this issue Feb 6, 2024

Request: Allow for adjustments at the layer-level, for a practically two-fold increase in LLM handling ability by prompters #4843

Closed

github-actions bot added the stale label Mar 18, 2024

github-actions bot closed this as completed Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding topic steering on layers #5119

Adding topic steering on layers #5119

benxh1995 commented Jan 24, 2024

Engininja2 commented Jan 24, 2024

peerschuett commented Feb 6, 2024 •

edited

Loading

github-actions bot commented Mar 18, 2024

github-actions bot commented Apr 2, 2024

Adding topic steering on layers #5119

Adding topic steering on layers #5119

Comments

benxh1995 commented Jan 24, 2024

Prerequisites

Feature Description

Motivation

Possible Implementation

Engininja2 commented Jan 24, 2024

peerschuett commented Feb 6, 2024 • edited Loading

github-actions bot commented Mar 18, 2024

github-actions bot commented Apr 2, 2024

peerschuett commented Feb 6, 2024 •

edited

Loading