Doob’s Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling

A novel variational approach to transition path sampling (TPS) based on the Doob’s h-transform. Our method can be used to sample transition paths between two meta-stable states of molecular systems.

A transition path of alanine dipeptide sampled using our method.

Visualization of the optimization process using our algorithm for 2D potential.

Running the deterministic and stochastic simulations using our algorithm for 2D potential.

FAQ

I am getting NaN values when running experiments on alanine dipeptide!

This is an issue on certain devices, and, so far, we haven't figured out the underlying reason. However, we have found out that:

Changing your floats to 64-bit precision prevents this problem from happening (at least on our machines), albeit at ~2x slower performance. To change to float64, simply search for all instances of jnp.float32 (as can be seen here) and change it to jnp.float64.
First-order systems usually do not exhibit this behavior. So you can also change your ode in the config (e.g., here) to first_order and see if this resolves the issue. In our tests, first-order ODE was sufficient for most setups.

Getting started

The best way to understand our method is to look at the google colab notebook which contains the necessary code for 2D potentials in one place. However, this notebook is very limited in scope and only contains the most basic examples. In the following, we will show the interfaces to run more complex examples. You can also look at the setups in the configs/ folder.

Setup

You can use the environment.yml file to setup this project. However, it only works on CPU.

conda env create -f environment.yml

We also provide a requirements.txt, and a pyproject.toml. So if you are using pixi you can instead run

pixi install --frozen

to install the dependencies and setup a virtual environment. Either activate the environment with pixi shell or use the provided pixi run command to run the scripts.

Running the code

Baselines

You can either use the TPS shooting baselines provided by us, or re-create them by running

python tps_baseline_mueller.py
PYTHONPATH='.' python eval/evaluate_mueller.py

to generate and evaluate transitions for the Müller-Brown toy-potential or use

python tps_baseline.py --mechanism two-way-shooting --num_paths 1000 --states phi-psi
# num_steps compiles multiple MD steps into a single one. This makes sampling faster but increases startup time. Only really worth it for long running simulations
python tps_baseline.py --mechanism two-way-shooting --num_paths 100 --fixed_length 1000 --states phi-psi --num_steps 50
python tps_baseline.py --mechanism two-way-shooting --num_paths 1000 --states rmsd
PYTHONPATH='.' python eval/evaluate_tps.py

for ALDP respectively.

Note: In both cases, you might want to change the paths that you want to generate and evaluate in the baseline or evaluation scripts.

Our Method

To sample trajectories with our method, we provide ready to go config files in configs/. You can run them with

python main.py --config configs/toy/mueller_single_gaussian.yaml
python main.py --config configs/toy/dual_channel_single_gaussian.yaml
python main.py --config configs/toy/dual_channel_two_gaussians.yaml

for the toy examples and

python main.py --config configs/aldp_diagonal_single_gaussian.yaml

for real molecular systems.

Citation

If you find our work useful, please consider citing our paper:

@inproceedings{du2024doob,
  author = {Du, Yuanqi and Plainer, Michael and Brekelmans, Rob and Duan, Chenru and No{\'e}, Frank and Gomes, Carla P. and Aspuru-Guzik, Al{\'a}n and Neklyudov, Kirill},
  title = {Doob’s Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling},
  booktitle = {Advances in Neural Information Processing Systems},
  editor = {Globerson, A. and Mackey, L. and Belgrave, D. and Fan, A. and Paquet, U. and Tomczak, J. and Zhang, C.},
  pages = {65791--65822},
  publisher = {Curran Associates, Inc.},
  volume = {37},
  year = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
amber14		amber14
configs		configs
eval		eval
files		files
model		model
notebooks		notebooks
tests		tests
tps		tps
training		training
utils		utils
visualizations		visualizations
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
check_energy.py		check_energy.py
environment.yml		environment.yml
linear_interpolation.py		linear_interpolation.py
main.py		main.py
pixi.lock		pixi.lock
potentials.py		potentials.py
prepare_molecule.py		prepare_molecule.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
systems.py		systems.py
tps_baseline.py		tps_baseline.py
tps_baseline_mueller.py		tps_baseline_mueller.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Doob’s Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling

FAQ

I am getting NaN values when running experiments on alanine dipeptide!

Getting started

Setup

Running the code

Baselines

Our Method

Citation

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

plainerman/Variational-Doob

Folders and files

Latest commit

History

Repository files navigation

Doob’s Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling

FAQ

I am getting NaN values when running experiments on alanine dipeptide!

Getting started

Setup

Running the code

Baselines

Our Method

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages