OpenFreeEnergy · richardjgowers · Jun 15, 2023 · May 31, 2023 · richardjgowers · Jun 2, 2023
diff --git a/docs/guide.rst b/docs/guide.rst
@@ -6,6 +6,7 @@ Guide to GUFE
    :maxdepth: 2
    :caption: Topics:
 
+   guide/overview
    guide/components_and_systems
    guide/protocols
    guide/serialization

diff --git a/docs/guide/overview.rst b/docs/guide/overview.rst
@@ -0,0 +1,83 @@
+GUFE Overview
+=============
+
+GUFE exists to define interoperable APIs for free energy calculations, which
+can be used by an ecosystem of Python packages to develop tools that focus
+on performing specific aspects of the free energy pipeline, while
+benefitting from other tools for other aspects.
+
+In order to do this, GUFE distinguishes several aspects of the process of
+calculating free energies, and define APIs for each of them. GUFE provides
+the underlying infrastructure; the real science is done in packages that
+implement parts of the GUFE APIs.
+
+Core data models
+----------------
+
+In order to ensure interoperability, GUFE defines objects that represent the
+core chemistry and alchemistry of a free energy pipeline, including
+molecules, chemical systems, and alchemical transformations. This provides a
+shared language that different tools use.
+
+Ligand network setup
+--------------------
+
+GUFE defines a basic API for the common case of performing alchemical
+transformations between small molecules, either for relative binding free
+energies of relative hydration free energies. This handles how mappings
+between different molecules are defined for alchemical transformations, 
+by defining both the :class:`.LigandAtomMapping` object that contains the
+details of a specific mapping, and the :class:`.AtomMapper` abstract API for
+an object that creates the mappings.
+
+Simulation settings
+-------------------
+
+In order to facilitate comparisons of different approaches, GUFE defines a
+hierarchy of simulation settings. This allows certain settings (such as
+temperature and pressure) to be consistent across different simulation
+tools, while allowing additional custom settings specific to a given tool to
+be defined.
+
+Protocols
+---------
+
+The actual simulation of a free energy calculation is defined by a GUFE
+:class:`.Protocol`. The :class:`.Protocol` is described as a set of tasks,
+each a :class:`.ProtocolUnit`, which may depend on other tasks. As such,
+they form a directed acyclic graph.
+
+GUFE does not implement any free energy protocols, but by providing the
+abstract API, allows protocol authors to create new simulation protocols
+without needing to focus on the details of execution or storage.
+
+Executors
+---------
+
+Executors actually run the simulations described by the :class:`.Protocol`.
+GUFE does not define an executor API, although it includes the very simple
+serial executor in :func:`.execute_DAG`.
+
+The responsibilities of an executor include running the tasks (units) for a
+:class:`.Protocol` and managing storage of output. GUFE contains some tools
+to facilitate that, particularly around storage, but it is up to the
+executor to determine how to/whether to use those.
+
+Strategies
+----------
+
+Strategies have yet to be implemented, but the GUFE design leaves a place
+for an object that, at the scale of an alchemical network, can dynamically
+decide where to focus more simulation effort based on the results that have
+been received so far. This will be useful for adaptive approaches to
+sampling a network.
+
+Core GUFE infrastructure
+------------------------
+
+Behind the scenes, GUFE implements a number of details common to all of its
+objects. Nearly all objects in GUFE are subclasses of
+:class:`.GufeTokenizable`, which sets a few requirements and behaviors on
+these objects, including that they are immutable, serializable, and
+hashable. These provide important guarantees to downstream packages that
+facilitate repeatability and reproducibility.