[Q] How to calculate host+guest binding free energy and enthalpy changes from CREGEN output? #11

ghost · 2020-09-10T00:56:25Z

Greetings. I want to calculate the free energy & enthalpy changes for the binding of a series of 1+ small molecule guests, to a neutral macrocyclic host, in water. I've performed a set of calculations like the following for the host, guests and guest@host complexes:

 crest guest@host.xyz -chrg 1 -g h2o -nci -mdlen 10 -prop hess -keepdir

Below is the CREGEN output from one of these guest@host runs, from which I calculate the free energy and enthalpy at 298K for the final ensemble as follows:

G(guest@host) = reference state Etot -323.174960188900 Ha - ensemble free energy (-1.280 kcal/mol) / 627.5 kcal/mol/Ha = -323.17292 Ha

and

H(guest@host) = G(guest@host -323.17292 Ha) + 298.15K * ensemble entropy (4.292 cal/mol K) / 1000 cal/kcal / 627.5 kcal/mol/Ha = -323.17088 Ha

and then, from identical calculations for the free host and free guests
deltaG(guest@host)= G(guest@host) - G(host) - G(guest)

Are the above calcs correct? In particular, am I treating the sign of the ensemble free energy correctly?

CREGEN - CONFORMER SYMMETRY ANALYSIS
threads = 12

input file name : crest_property.xyz
output file name : crest_property.xyz.sorted
number of atoms : 174
number of points on xyz files : 47
RMSD threshold : 0.1250
Bconst threshold : 15.0000
population threshold : 0.0500
conformer energy window /kcal : 6.0000
fragment in coord : 2
number of reliable points : 47
reference state Etot : -323.174960188900
running RMSDs...
done.
number of doubles removed by rot/RMSD : 0
total number unique points considered further : 47
Erel/kcal Etot weight/tot conformer set degen origin
1 0.000 -323.17496 0.26606 0.26606 1 1
2 0.144 -323.17473 0.20885 0.20885 2 1
3 0.462 -323.17422 0.12199 0.12199 3 1
... 44 entries omitted for brevity
T /K : 298.15
E lowest : -323.17496
ensemble average energy (kcal) : 0.496
ensemble entropy (J/mol K, cal/mol K) : 17.960 4.292
ensemble free energy (kcal/mol) : -1.280
population of lowest in % : 26.606
number of unique conformers for further calc 44
list of relative energies saved as "crest.energies"
Normal termination.

The text was updated successfully, but these errors were encountered:

pprcht · 2020-09-10T09:33:01Z

Hello,
In the calculation of G at temperature T your reference state is the total energy of your system, plus thermostatistical contributions from the respective frequency calculation and solvation free energy terms, so
G(T) = E_gas+δG_solv(T)+G_RRHO(T)
The ensemble free energy is another additive term G_conf(T)=-TS_conf, so in your calculation it should be the opposite sign G_tot(T)=G(T)+G_conf(T). For the enthalpy I don't think it can be calculated like this, because the way it is written above one would end up with G(T) again.

Obtaining good ensemble entropies (and hence free energies) is very difficult. Short runtimes will result in incomplete ensembles and therefore in faulty ensemble free energies. Reproducibility can also be an issue. A part of our research is currently dedicated to these problems and we are hoping to finalize it soon with some precise recommendations for ensemble entropy calculations. This will be the update to version 2.11 of crest.

ghost · 2020-09-11T13:43:53Z

Thanks for this Philipp. Is it legitimate to calculate the guest@host binding enthalpy from the thermochemical output of separate vibrational frequency/hessian runs on the minimum energy ensemble structures obtained from a the above CREST runs? ie H(298K)_guest@host-H(298)_host-H(298)_guest

Your point about short runtimes is well taken. In my systems of interest, using the default CREST -nci setup, luckily the host (cucurbit[8]uril) is quite rigid and by visual inspection of the MTD trajectories I can see the guests leave the CB8 after 2-5ps (depending on the Vbias parameter combination), so I end up with a significant proportion of cavity-bound structures versus surface-bound structures. In replicate runs I have been pleasantly surprised at how well CREST always finds the same minimum energy structures from very different starting poses and although the ensembles can differ a little, the distribution of the dominant low-energy structures is always nearly the same, producing less than 1kcal/mol differences in G_conf(T).

I have also done a few longer runs and it appears that, for these systems, the ensemble entropies converge after 30ps to give free energies that differ by less than 1kcal/mol from the 10ps runs. As even the shorter guest@host runs typically take 24-36 hrs on my old 12-core MacPro, I'm OK with the tradeoff of a small loss of accuracy.

Looking forward to what version 2.11 has to offer.

pprcht · 2020-09-13T10:38:38Z

Yes, it is legimate to take G(T) (and related thermostatistical values) from seperate calculations and only use G_conf(T) from the crest output. In fact, this might even be the preferred way to do it since thermochemistry calculations purely at GFN level are not as reliable as, e.g. DFT level. Usually G_conf(T) is much smaller than G(T), so you want to have a good description of the latter. For these calculations and good post-processing of the GFN ensemble at DFT level you could take look at the enso repository. This is a project in which we mainly focus on those kind of things.

pprcht closed this as completed Sep 29, 2022

matteo-maria-tommasini mentioned this issue Mar 13, 2023

file qcg_tmp/tmp_MTD/crest_conformers_0.xyz gets corrupt and fills all available disk space #178

Closed

adamhorvath99 mentioned this issue Oct 24, 2024

Unexpected Fortran runtime error associated with legacy algos #364

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Q] How to calculate host+guest binding free energy and enthalpy changes from CREGEN output? #11

[Q] How to calculate host+guest binding free energy and enthalpy changes from CREGEN output? #11

ghost commented Sep 10, 2020 •

edited by ghost

Loading

pprcht commented Sep 10, 2020

ghost commented Sep 11, 2020

pprcht commented Sep 13, 2020

[Q] How to calculate host+guest binding free energy and enthalpy changes from CREGEN output? #11

[Q] How to calculate host+guest binding free energy and enthalpy changes from CREGEN output? #11

Comments

ghost commented Sep 10, 2020 • edited by ghost Loading

pprcht commented Sep 10, 2020

ghost commented Sep 11, 2020

pprcht commented Sep 13, 2020

ghost commented Sep 10, 2020 •

edited by ghost

Loading