Skip to content

v0.3.0

Compare
Choose a tag to compare
@github-actions github-actions released this 07 Mar 12:12
· 98 commits to main since this release

v0.3.0 (2024-03-07)

Feature

  • feat: steering experiments (#132)

  • Add experimental code

  • fix: make_country_capital script

  • feat: add code to run steering experiment

  • update experiments code

  • fix: add --config_path arg

  • fix: config yaml parsing

  • chore: add more configs

  • chore: add even more configs

  • refactor: plotting

  • feat: add script to run sweep

  • fix: do not set completion template by default

  • refactor sweeps

  • refactor: token concept sweep

  • fix: bugbears

  • chore: add comments

  • fix: steering_index of datasets

  • test: steering token index

  • updating steering_vectors library version

  • evaluate on more layers

  • refactor: use steering-vectors code, log instead of print

  • chore: fix docstring

  • test: training, evaluating steering vectors

  • fix: minor


Co-authored-by: Daniel CH Tan <dtch1997@users.noreply.github.com>
Co-authored-by: David Chanin <chanindav@gmail.com> (8d1bd7d)