OpenFF Lipid Optimization Benchmark Supplement v1.0 #399

ntBre · 2024-10-30T21:33:14Z

This is the benchmarking counterpart to #394, constructed from full molecules in the LIPID MAPS database. I think this is generally ready to go (besides updating the main README), but I'm leaving it as a draft because @j-wags and I discussed a bit about possibly partitioning datasets based on molecule size to make requesting compute resources of a certain size easier. We weren't sure exactly where to draw that cutoff, but this set ranges in size from 6 atoms to 106 atoms, so I thought it might be worth discussing. (Edit: the splitting stuff is resolved by the new tagging mechanism in #412, so this should be good to go).

New Submission Checklist

Created a new folder in the submissions directory containing the dataset
Added README.md describing the dataset see here for examples
All files used to produce the dataset are included with a description
Dataset follows the QCSubmit schema defined for Datasets, OptimizationDatasets and TorsionDriveDatasets
Dataset filename matches pattern dataset*.json; may feature a compression extension, such as .bz2
A PDF depicting the molecules is attached, in the case of torsiondrives this should include the highlighting of the central bond, this can be done automatically using qcsubmit.
QCSubmit validation passed
Made a new dataset entry in the mapping table in repository README.md
Ready to submit!

openff-dangerbot · 2024-10-30T21:35:38Z

QCSubmit Validation Report

	submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/dataset.json.bz2
Dataset Name	OpenFF Lipid Optimization Benchmark Supplement v1.0
Dataset Type	OptimizationDataset
Elements	O ,H ,C ,Br ,P ,N ,Cl ,F ,S ,I
Valid Cmiles	🔥
Connected Dihedrals	🔥
No Linear Torsions	🔥
No Molecular Complexes	🔥
Valid Constraints	🔥
Complete Metatdata	🔥

QC Specification Report

	submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/dataset.json.bz2/default
Specification Name	default
Method	B3LYP-D3BJ
Basis	DZVP
Wavefunction Protocol	none
Implicit Solvent
Keywords	{}
Validated	🔥
Valid SCF Properties	🔥
Full Basis Coverage	🔥

QCSubmit version information(click to expand)

	version
openff.qcsubmit	0.53.0
openff.toolkit	0.16.5
basis_set_exchange	0.10
qcelemental	0.28.0
rdkit	2024.09.2

openff-dangerbot · 2024-12-02T21:09:33Z

QCSubmit Validation Report

	submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/dataset.json.bz2
Dataset Name	OpenFF Lipid Optimization Benchmark Supplement v1.0
Dataset Type	OptimizationDataset
Elements	O ,H ,C ,Br ,P ,N ,Cl ,F ,S ,I
Valid Cmiles	🔥
Connected Dihedrals	🔥
No Linear Torsions	🔥
No Molecular Complexes	🔥
Valid Constraints	🔥
Complete Metatdata	🔥

QC Specification Report

	submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/dataset.json.bz2/default
Specification Name	default
Method	B3LYP-D3BJ
Basis	DZVP
Wavefunction Protocol	none
Implicit Solvent
Keywords	{}
Validated	🔥
Valid SCF Properties	🔥
Full Basis Coverage	🔥

QCSubmit version information(click to expand)

	version
openff.qcsubmit	0.54.0
openff.toolkit	0.16.6
basis_set_exchange	0.10
qcelemental	0.28.0
rdkit	2024.09.3

ntBre · 2024-12-02T21:47:48Z

Marking this ready for review now that #412 is in, and we don't need to worry about splitting it.

lilyminium

Minor note about the threshold -- otherwise LGTM. Looking forward to see how the split workers go!

submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/opt.toml

openff-dangerbot · 2024-12-03T16:00:17Z

QCSubmit Validation Report

	submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/dataset.json.bz2
Dataset Name	OpenFF Lipid Optimization Benchmark Supplement v1.0
Dataset Type	OptimizationDataset
Elements	O ,H ,C ,Br ,P ,N ,Cl ,F ,S ,I
Valid Cmiles	🔥
Connected Dihedrals	🔥
No Linear Torsions	🔥
No Molecular Complexes	🔥
Valid Constraints	🔥
Complete Metatdata	🔥

QC Specification Report

	submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/dataset.json.bz2/default
Specification Name	default
Method	B3LYP-D3BJ
Basis	DZVP
Wavefunction Protocol	none
Implicit Solvent
Keywords	{}
Validated	🔥
Valid SCF Properties	🔥
Full Basis Coverage	🔥

QCSubmit version information(click to expand)

	version
openff.qcsubmit	0.54.0
openff.toolkit	0.16.6
basis_set_exchange	0.10
qcelemental	0.28.0
rdkit	2024.09.3

lilyminium · 2024-12-03T23:07:41Z

LGTM thank you Brent! I'll let you merge so you can figure out compute tags?

lilyminium · 2024-12-04T08:17:36Z

  File "/home/runner/work/qca-dataset-submission/qca-dataset-submission/./management/lifecycle.py", line 16, in <module>
    from qcelemental.models import Molecule
ModuleNotFoundError: No module named 'qcelemental'

Ah oops -- probably from #412. Surprised it's not pulled in by qcportal!

Edit: actually it's the backlog environment and script, I'll open a PR to only import on type checking

ntBre added 4 commits December 2, 2024 16:04

init

aee0528

regenerate input.smi after filtering fragment inchis

c14c52f

generate dataset

64a519b

add main readme entry

99cbb6d

ntBre force-pushed the lipid-molecules branch from 0bed114 to 99cbb6d Compare December 2, 2024 21:05

ntBre marked this pull request as ready for review December 2, 2024 21:47

ntBre requested a review from lilyminium December 2, 2024 21:47

lilyminium approved these changes Dec 3, 2024

View reviewed changes

submissions/2024-10-30-OpenFF-Lipid-Optimization-Benchmark-Supplement-v1.0/opt.toml Outdated Show resolved Hide resolved

update opt.toml and dataset descriptions to 0.708

4665263

lilyminium added the tracking label Dec 3, 2024

lilyminium added tracking and removed tracking labels Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenFF Lipid Optimization Benchmark Supplement v1.0 #399

OpenFF Lipid Optimization Benchmark Supplement v1.0 #399

ntBre commented Oct 30, 2024 •

edited

Loading

openff-dangerbot commented Oct 30, 2024

openff-dangerbot commented Dec 2, 2024

ntBre commented Dec 2, 2024

lilyminium left a comment

openff-dangerbot commented Dec 3, 2024

lilyminium commented Dec 3, 2024

lilyminium commented Dec 4, 2024 •

edited

Loading

OpenFF Lipid Optimization Benchmark Supplement v1.0 #399

Are you sure you want to change the base?

OpenFF Lipid Optimization Benchmark Supplement v1.0 #399

Conversation

ntBre commented Oct 30, 2024 • edited Loading

New Submission Checklist

openff-dangerbot commented Oct 30, 2024

QCSubmit Validation Report

QC Specification Report

openff-dangerbot commented Dec 2, 2024

QCSubmit Validation Report

QC Specification Report

ntBre commented Dec 2, 2024

lilyminium left a comment

Choose a reason for hiding this comment

openff-dangerbot commented Dec 3, 2024

QCSubmit Validation Report

QC Specification Report

lilyminium commented Dec 3, 2024

lilyminium commented Dec 4, 2024 • edited Loading

ntBre commented Oct 30, 2024 •

edited

Loading

lilyminium commented Dec 4, 2024 •

edited

Loading