Atomate2 OpenMM integration & broader classical MD framework #782

orionarcher · 2024-03-20T22:32:31Z

Developed by @xperrylinn and @orionarcher

Summary

This PR builds out support for OpenMM with a framework that could be extended to support other MD codes. Namely LAMMPS, Amber, and Gromacs. Rough visual example here

Core ideas:

Use the openff.interchange.Interchange object as a core engine-agnostic representation of an MD simulation.
Define a generic ClassicalMDTaskDocument and generic Interchange creation functions, then hand off responsibility for evolving the simulation to workflows for each MD engine. Currently only OpenMM support is implemented.
Support low-throughput workflows by making workflows that can easily output to local directories.
Use prev_task to pass meta-data between jobs. Since OpenMM is programmatic, the core information is not stored in the local directory.
No InputSets. Since OpenMM is programmatic, InputSets and InputGenerators don't make conceptual sense.

What's implemented:

Integration of Atomate2 and OpenFF for classical MD simulations
Support for energy minimization, NPT, NVT, and temperature change jobs
Implementation of annealing and production workflows
Serialization and deserialization of OpenFF objects (Molecule, Topology, Interchange, Quantity) with monty
Utility functions for generating OpenFF Interchange objects and creating molecule specifications

To do / open questions:

Need to add create conditional imports that fail nicely, where are some examples of this?

Future PR

Store trajectories in additional_stores, if available
The core openff.interchange.Interchange object can be larger than 16MB MongoDB doc limit. Need to implement a workaround by zipping the object and/or putting it on S3.
migrate schema dependencies to emmet
migrate utility dependencies to pymatgen

Related PRs:

PR to pymatgen would move most of atomate2.classical_md.utils upstream. PR #3729
PR to emmet would move atomate2.classical_md.schemas upstream. PR #975

Example usage

mol_specs_dicts = [
    {"smile": "CCO", "count": 10, "name": "ethanol"},
    {"smile": "O", "count": 200, "name": "water"},
]
inter_job = generate_interchange(mol_specs_dicts, mass_density=1)

production_maker = ProductionMaker(
    energy_maker=EnergyMinimizationMaker(),
    npt_maker=NPTMaker(steps=300000, pressure=1),
    anneal_maker=AnnealMaker.from_temps_and_steps(
        steps=1000000, anneal_temp=400, final_temp=300
    ),
    nvt_maker=NVTMaker(steps=5000000),
)

production_flow = production_maker.make(
    inter_job.output.interchange, 
    output_dir="my/directory/"
    prev_task=inter_job.output
)

wf = Flow([inter_job, production_flow])

run_locally(wf)

# will put outputs for the whole flow in "my/directory/"

Additional dependencies introduced

openmm
openff-toolkit
openff-interchange
packmol
openbabel

These are all necessary for the classical MD setup and execution workflow.

Checklist

Before a pull request can be merged, the following items must be checked:

Code is in the standard Python style.
The easiest way to handle this is to run the following in the correct sequence on
your local machine. Start with running ruff and ruff format on your new code. This will
automatically reformat your code to PEP8 conventions and fix many linting issues.
Doc strings have been added in the Numpy docstring format.
Run ruff on your code.
Type annotations are highly encouraged. Run mypy to
type check your code.
Tests have been added for any new functionality or bug fixes.
All linting and tests pass.

Note that the CI system will run all the above checks. But it will be much more
efficient if you already fix most errors prior to submitting the PR. It is highly
recommended that you use the pre-commit hook provided in the repository. Simply run
pre-commit install and a check will be run prior to allowing commits.

…generate_interchange methods.

… for classical_md.openmm.tasks.py

…rkflow

…with new BaseOpenMMMaker

…skDoc

…c function.

…ut code.

…al pycharm debugger.

…, and quantity.

…tion despite conflicting pydantic versioning between Interchange (using v1) and atomate2 (using v2)

…antic fix.

…ix CalculationOutput.from_directory

…ore.py

orionarcher · 2024-09-11T13:14:41Z

~~@utf I believe I've addressed all comments and this is ready to merge.~~

EDIT: hold off actually, I forgot that I disabled the openmm tests because a new emmet release is needed. Pinging the MP team.

…jobs.

…nd enhance readability.

…nd make importing openff to openmm base optional

orionarcher · 2024-09-20T15:36:04Z

I've pinned the emmet pre-release to get tests passing. @utf are you good to merge this once the new emmet version is released and tested against? If so, maybe @janosh can merge once that happens so you don't need to circle back.

janosh

huge amount of work here! 👍

.github/workflows/testing.yml

src/atomate2/openff/core.py

src/atomate2/openff/utils.py

src/atomate2/openmm/jobs/core.py

src/atomate2/openmm/jobs/generate.py

tests/openff_md/conftest.py

orionarcher · 2024-09-22T17:39:27Z

This is now pinned to the most recent version of emmet and ready to merge @utf.

Your previous suggestion to split out OpenMM and OpenFF was very good. They are now independent and the OpenMM workflows support MLFFs and can return structures to better interoperate with the rest of Atomate2.

janosh · 2024-09-23T18:08:45Z

.github/workflows/testing.yml

-      - uses: actions/setup-python@v5
-        with:
-          python-version: ${{ matrix.python-version }}
+      - name: Set up micromamba


@utf i think it could help to have the micromamba-dependent MD CI be its own job or even its own test-md.yml workflow. that would enable only running the MD tests when any MD source files change and would also uncouple the remaining CI from any install/env issues micromamba might encounter

I'd be happy with that solution!

Using micromamba saves us from manually building enumlib so I favor keeping it in the main CI.

utf · 2024-09-23T18:19:50Z

This really is fantastic. Thank you very much @orionarcher. @janosh, once you're happy I think we can merge.

rkingsbury · 2024-09-24T19:48:46Z

Amazing work, thanks for sticking with this @orionarcher !

…lsproject#782) * Move remaining content from common.py to renamed base.py. * Update core.py with skeleton code for openff_job and generate_interchange methods. * Complete Calculation, CalculationInput, and CalculationOutput schemas for classical_md.openmm.tasks.py * Implement BaseOpenMMMaker for classical_md.openmm workflow * Update core jobs for classical_md.openmm to be compatible with new BaseOpenMMMaker * Add MoleculeSpec to classical_md.schemas and modify OpenMMTaskDoc * Implement attribute inheritance logic for base openmm maker and openmm jobs * Add from/as_dict functions for openff topology, interchange, molecule, and quantity * Fix serialization issues with conflicting pydantic versions between Interchange and atomate2 * Implement anneal and production workflows * Update resolve_attr logic to set missing attributes * Change ClassicalMDTaskDocument to OpenMMTaskDocument in base.py * Store interchange intermediate as a JSON string to fix parsing issues * Implement temperature change logic in TempChangeMaker * Improve state reporter to append to state file * Output taskdoc_json file to directory for easy building * Enhance documentation for all components * Implement micromamba for testing environment * Change all docstrings to numpy format * Add CodeCov for classical_md tests * Rename "steps" argument to "n_steps" and "output_steps" to "steps" in CalculationOutput * Add support for writing trajectory to HDF5 file * Implement MDAReporter for trajectory output * Add embed_traj argument to base_openmm_maker * Add traj_blob keyword and switch interchange to type HexBytes * Move classical_md schemas to emmet * Implement OPLS force field support through ligpargen * Create FauxInterchange object for OPLS compatibility * Refactor OpenMMFlowMaker and BaseOpenMMMaker * Add XMLMoleculeFF class for manipulating XML files representing OpenMM-compatible ForceFields * Split utilities and jobs in jobs/opls.py into separate files * Refactor utilities to isolate OpenFF dependency * Add support for MACE-based interchanges * Update OpenMM tutorial * Support BaseOpenMMMaker returning structures * Implement reading and writing of structure to/from OpenMMTaskDocument --------- Co-authored-by: Alex Ganose <utf@users.noreply.github.com> Co-authored-by: Orion Cohen <orioncohen@Orions-MBP.localdomain> Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>

orionarcher added 30 commits March 6, 2024 21:34

Move remaining content from common.py to renamed base.py.

413c626

Update core.py to contain built out skeleton code for openff_job and …

28c3126

…generate_interchange methods.

Rename utils and remove tasks.py

08329a7

Complete Calculation, CalculationInput, and CalculationOutput schemas…

cf79341

… for classical_md.openmm.tasks.py

Completed initial draft of BaseOpenMMMaker for classical_md.openmm wo…

f21a031

…rkflow

Finished updating core jobs for classical_md.openmm to be compatible …

e6b648d

…with new BaseOpenMMMaker

Minor reformat

693ea77

update base_openmm_maker.py slightly, will soon be removed so irrelevant

33b8cda

Add TODO item to utils.py

f67c444

Add MoleculeSpec to classical_md.schemas and slightly modify OpenMMTa…

12779f4

…skDoc

Add new empty test files

a308801

Change from_prev_task -> resolve_attr

d255606

Add as_dict monkeypatch for openff.Molecule to init

57a06fb

Create attribute inheritance logic for base openmm maker and openmm jobs

a981d5e

Remove InputMoleculeSpec and transfer functionality to create_mol_spe…

09156ec

…c function.

Remove InputMoleculeSpec, Geometry, and process mol_specs commented o…

1ddf7c0

…ut code.

Add testing files, not all used.

b3e83a5

Add content to partial charge test files.

6aa4c4b

Change test dir name from openmm -> openmm_md to solve issue with loc…

38bf78a

…al pycharm debugger.

Create testing for utils.py functionality and fix small bug in utils.py.

c33a97e

Cleanup merge_specs test.

1e5ad58

Add interchange and temporary directory to conftest.py.

b31f940

Add from/as_dict functions for openff topology, interchange, molecule…

6549b02

…, and quantity.

Some bug fixes to BaseOpenMMMaker and some changes to allow serializa…

17eee5a

…tion despite conflicting pydantic versioning between Interchange (using v1) and atomate2 (using v2)

Change TaskDoc schema such that interchange is dict, in line with pyd…

d1c8c9b

…antic fix.

Make all CalculationInput and CalculationOutput arguments optional. F…

b7d43d6

…ix CalculationOutput.from_directory

Add tests for BaseOpenMMMaker.

331278d

Add tests for core openmm makers and fix several discovered bugs in c…

bd1bec1

…ore.py

Formatting change.

b851f2e

Add tests for as/from dict monkey patching.

af8f26b

orionarcher added 5 commits September 10, 2024 14:43

Fix type hinting on interchange.

fe9fec6

Merge branch 'main' into openff

7f32d24

Lint OpenMM and OpenFF

77ba3fa

Incorporate suggestions from utf into pyproject.toml and testing.yml.

745f1ff

Disable failing cclib test.

e2cbe68

orionarcher force-pushed the openff branch from 0720816 to e2cbe68 Compare September 10, 2024 19:57

orionarcher added 5 commits September 11, 2024 15:26

Refactor utilities so that openff is not a dependency for the openmm …

2fff845

…jobs.

Refactor openmm utilities and generate to isolate openff dependency a…

4c2e892

…nd enhance readability.

Add attempted import for openmmml to enable mace based interchanges a…

44d4fd6

…nd make importing openff to openmm base optional

Update OpenMM tutorial

2355c5d

Update pyproject.toml and testing.yml

54c88c2

orionarcher force-pushed the openff branch from fe89d2d to 54c88c2 Compare September 20, 2024 13:05

Skip tests that require MDAnalysis 2.8.0

7e9e561

Remove [classical_md] from testing.yml

902ff5a

janosh requested changes Sep 20, 2024

View reviewed changes

orionarcher added 4 commits September 20, 2024 14:05

Respond to minor comments from Janosh

acd5bf1

Replace temp_dir fixture with tmp_path

8b45984

Rename interchange_meta -> mol_specs

58ad842

Support BaseOpenMMMaker returning structures

a6f965b

orionarcher and others added 2 commits September 23, 2024 11:45

Fix reading and writing of structure to/from OpenMMTaskDocument

1d86910

Merge branch 'main' into openff

d96b3a0

janosh approved these changes Sep 23, 2024

View reviewed changes

janosh merged commit 3d6a3a3 into materialsproject:main Sep 23, 2024
6 checks passed

janosh mentioned this pull request Nov 14, 2024

Migrate CI from pip to uv #831

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Atomate2 OpenMM integration & broader classical MD framework #782

Atomate2 OpenMM integration & broader classical MD framework #782

orionarcher commented Mar 20, 2024 •

edited

Loading

orionarcher commented Sep 11, 2024 •

edited

Loading

orionarcher commented Sep 20, 2024 •

edited

Loading

janosh left a comment

orionarcher commented Sep 22, 2024

janosh Sep 23, 2024

utf Sep 23, 2024

orionarcher Sep 23, 2024

utf commented Sep 23, 2024

rkingsbury commented Sep 24, 2024

Atomate2 OpenMM integration & broader classical MD framework #782

Atomate2 OpenMM integration & broader classical MD framework #782

Conversation

orionarcher commented Mar 20, 2024 • edited Loading

Summary

Core ideas:

What's implemented:

To do / open questions:

Future PR

Related PRs:

Example usage

Additional dependencies introduced

Checklist

orionarcher commented Sep 11, 2024 • edited Loading

orionarcher commented Sep 20, 2024 • edited Loading

janosh left a comment

Choose a reason for hiding this comment

orionarcher commented Sep 22, 2024

janosh Sep 23, 2024

Choose a reason for hiding this comment

utf Sep 23, 2024

Choose a reason for hiding this comment

orionarcher Sep 23, 2024

Choose a reason for hiding this comment

utf commented Sep 23, 2024

rkingsbury commented Sep 24, 2024

orionarcher commented Mar 20, 2024 •

edited

Loading

orionarcher commented Sep 11, 2024 •

edited

Loading

orionarcher commented Sep 20, 2024 •

edited

Loading