Basic unit conversion for experiments #11

FNTwin · 2023-10-16T18:48:01Z

Unit conversion for handling the experiments. Opening the PR while I clean the API and add some tests
It will take care of #9

TODO

Tests
Some cleaning of the API

Implement a registry of the most common unit conversions (units.py) and modify the constructor for the datasets to keep track of the current units and provide a way to change them.

prtos · 2023-10-18T00:01:10Z

src/openqdc/datasets/gdml.py

-        "pbe-ts",
+        "ccsd/cc-pvdz",
+        "ccsd(t)/cc-pvdz",
+        # "pbe+mbd/light", #MD22


We are not using those methods should we?

It depends if by GDML we are talking about MD17, MD22 or both. Both of them are calculated from AIMD with FHI-aims with All Electron DFT. MD17 uses the pbe functional with vdw-ts, while MD22 uses pbe+mbd with different settings per system.
I'm going to calculate the isolated atoms energy for ccsd/cc-pvdz, ccsd(t)/cc-pvdz and I'm creating as we speak the pseudopotential for the pbe functional for the pbe+vdw-ts level of theory on DFT PAW. We would have issue on the many body dispersion as I can't use the correct convergence light/tight so I think just using the pbe+vdw-ts should be good enough

prtos · 2023-10-18T00:04:04Z

src/openqdc/datasets/geom.py

@@ -59,6 +59,10 @@ class GEOM(BaseDataset):
    __name__ = "geom"
    __energy_methods__ = ["gfn2_xtb"]


@FNTwin Do still need a basis set for semi-empirical methods

We don't to specify a basis set for semi empirical method as they are fully defined by their name. You don't get to choose the basis set as it is "builded inside" the technique itself.

prtos · 2023-10-18T00:07:37Z

src/openqdc/datasets/molecule3d.py

@@ -46,15 +46,19 @@ def _read_sdf(sdf_path, properties_path):

 class Molecule3D(BaseDataset):
    __name__ = "molecule3d"
-    __energy_methods__ = ["b3lyp_6-31g*"]
+    __energy_methods__ = ["b3lyp/6-31g*"]
+    # UNITS MOST LIKELY WRONG, MUST CHECK THEM MANUALLY


@FNTwin is this already checked?

No, I need to download the dataset and run some calculation and manually see what unit they are using as they don't provide any info. This PR should be just to have a quick method to interface openMLIP and convert the datasets between units for experimenting. A more heavy PR with every unit checked and validated will be with the calculated isolated atom energy

prtos · 2023-10-18T00:14:59Z

src/openqdc/datasets/qm7x.py


    energy_target_names = ["ePBE0", "eMBD"]

-    __force_methods__ = ["pbe-ts", "vdw"]
+    __force_methods__ = ["pbe+vdw-ts", "mbd"]


@FNTwin Are you sure the second force is mbd or just vdw forces?

My mistake, corrected. It is a semi empirical tight binding with a many body dispersion correlation and a pbe0 for the dft calculation

prtos · 2023-10-18T00:19:50Z

src/openqdc/datasets/waterclusters3_30.py

@@ -51,13 +53,13 @@ class WaterClusters(BaseDataset):

    # Energy in hartree, all zeros by default
    atomic_energies = np.zeros((MAX_ATOMIC_NUMBER,), dtype=np.float32)
-
+    # need to know where to find the data


https://sites.uw.edu/wdbase/database-of-water-clusters/

Ok so you downloaded the entire dataset made from the flexible potential. WIth the isolated atom energy PR I'll have a manual check of the data as with the other unsure dataset

FNTwin added 5 commits October 13, 2023 10:48

Units WIP

f17aa1f

pcqm info

8d50d36

ISO Conv + Black .

3b89a14

Spice units, QoL

0835699

black .

b092f74

FNTwin requested a review from prtos as a code owner October 16, 2023 18:48

prtos reviewed Oct 18, 2023

View reviewed changes

Update energy target names nabladft.py

56f31cd

prtos reviewed Oct 18, 2023

View reviewed changes

FNTwin and others added 2 commits October 17, 2023 20:47

Correction qm7x

b45ca81

Merge branch 'main' into units

5eaafd0

prtos merged commit a86f884 into main Oct 21, 2023

prtos deleted the units branch October 21, 2023 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic unit conversion for experiments #11

Basic unit conversion for experiments #11

FNTwin commented Oct 16, 2023

prtos Oct 18, 2023

FNTwin Oct 18, 2023

prtos Oct 18, 2023

FNTwin Oct 18, 2023

prtos Oct 18, 2023

FNTwin Oct 18, 2023

prtos Oct 18, 2023

FNTwin Oct 18, 2023

prtos Oct 18, 2023

FNTwin Oct 18, 2023

		@@ -59,6 +59,10 @@ class GEOM(BaseDataset):
		__name__ = "geom"
		__energy_methods__ = ["gfn2_xtb"]

Basic unit conversion for experiments #11

Basic unit conversion for experiments #11

Conversation

FNTwin commented Oct 16, 2023

TODO

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment