-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: improve steering experiments utils #147
Conversation
WalkthroughThe recent updates aim to enhance steering metric evaluation and analysis in the project. New functionalities and notebooks have been introduced to explore steering efficiency, compare aggregation methods, and evaluate steerability across diverse concepts. Changes include the addition of specific evaluation notebooks, updates to handle steering vectors, and improvements in steering experiment configurations. Changes
Note Pull Request Summarized by CodeRabbit FreeYour organization is on the Free plan. CodeRabbit will generate a high-level summary and a walkthrough for each pull request. For a comprehensive line-by-line review, please upgrade your subscription to CodeRabbit Pro by visiting https://coderabbit.ai TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (invoked as PR comments)
Additionally, you can add CodeRabbit Configration File (
|
Summary by CodeRabbit
statsmodels
dependency topyproject.toml
.choose_steerability_metric.ipynb
,evaluate_steering_efficiency.ipynb
,ablate_aggregator.ipynb
, andevaluate_steerability.ipynb
.repepo/steering/plots/utils.py
.repepo/steering/run_experiment.py
with new functions and modified signature.run_sweep.py
by accepting**kwargs
.run_experiment
functionality intest_run_experiment.py
.