building configs for training other models

Enjoyed this paper and was able to successfully train using the [gpt2_e2e_recon.yaml](https://github.com/ApolloResearch/e2e_sae/blob/83b4add4d2652a4e5c6775a0e2b752897c871a87/e2e_sae/scripts/train_tlens_saes/gpt2_e2e_recon.yaml) config! I am hoping to do followup experiments, and am wondering -

1) What's the simplest way to track if the training run was successful? There are so many graphs in [my wandb workspace](https://wandb.ai/dribnet/gpt2-e2e/runs/zmigknth), is there one or two that you generally tend to focus on? I'd like to try changing some of the parameters (SAE width, etc) and tracking the effects.

2) I'm interested in creating some e2e_saes on specific layers of different language models, for example layer 8 of [bert-based-uncased](https://huggingface.co/google-bert/bert-base-uncased), etc. But the [config files](https://github.com/ApolloResearch/e2e_sae/tree/83b4add4d2652a4e5c6775a0e2b752897c871a87/e2e_sae/scripts/train_tlens_saes) are a bit intimidating for me. Can you offer some guidance on how I could adapt a config file to a new language model like this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

building configs for training other models #72

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

building configs for training other models #72

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions