Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
lm		lm
probe		probe
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Repository files navigation

Emergent semantics

This repo contains code for the following papers

Emergent Representations of Program Semantics in Language Models Trained on Programs (ICML'24, arXiv)
Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data (COLM'24, arXiv)

General usage

Creating a conda env

conda create --prefix=./env --file environment.yml conda activate ./env
Then generate the Karel dataset (see data/lib/karel_lib/README.md)
Training an LM

./scripts/train_lm.sh base karel
Training probes for one checkpoint

./scripts/train_probe.sh karel 76000

ICML'24

Training the LM

./scripts/train_lm.sh karel_noloops_nocond "--output_dir filtered --learning_rate 5e-6 --num_warmup_steps 6000 --max_train_steps 80000 --lengths_to_filter 1 2 3 4 5"
Training probes for one checkpoint

./scripts/train_probe.sh karel_noloops_nocond 76000 "--eval_mode intervention --output_dir filtered --max_eval_samples 50000"

COLM'24

We reuse the checkpoints from ICML'24
Training probes for one checkpoint

./scripts/train_probe.sh karel 76000 "--eval_mode causal --output_dir filtered --eval_dataset karel_15only_uniform_noloops_nocond_nomarks --max_eval_samples 50000"

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages