Skip to content

Official implementation for NeurIPS 2024 paper "On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability".

License

Notifications You must be signed in to change notification settings

ML-GSAI/MesaOpt-AR-Transformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability

This is the official implementation for NeurIPS 2024 paper On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability.

Dependencies

conda env create -f environment.yaml

Hyperparameters Configuration

Detailed hyperparameters config can be found in Appendix B.

Simulation Experiments

bash main_train_ar.sh #with hyperparameters in Appendix B

Visualization

python plot.py #specify the output

About

Official implementation for NeurIPS 2024 paper "On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published