ChemLoRA

Leveraging Large Language Models (LLMs) for Accurate Molecular Energy Predictions

Requirements

Data

The QM9-G4MP2 dataset is publicly available through Materials Data Facility (GitHub link).

Model Fine-Tuning

GPT-3 is fine-tuned on the QM9-G4MP2 dataset using the GPTChem framework. To run the provided Python script, execute the following command:

python gptchem_smiles.py

The runpeft.py script can be used to fine-tune any foundational LLM available in Hugging Face. For example, to fine-tune the gpt2 model, run the following command:

python runpeft.py "gpt2"

License

This software is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.ipynb_checkpoints		.ipynb_checkpoints
checkpoints_selfies		checkpoints_selfies
checkpoints_smiles		checkpoints_smiles
old_scripts		old_scripts
output_jsons_selfies		output_jsons_selfies
output_jsons_smiles		output_jsons_smiles
pickles		pickles
plots		plots
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gptchem_selfies.py		gptchem_selfies.py
gptchem_smiles.py		gptchem_smiles.py
runpeft.py		runpeft.py
runpeft_selfies.py		runpeft_selfies.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChemLoRA

Leveraging Large Language Models (LLMs) for Accurate Molecular Energy Predictions

Requirements

Data

Model Fine-Tuning

License

About

Releases

Packages

Languages

License

ankur56/ChemLoRA

Folders and files

Latest commit

History

Repository files navigation

ChemLoRA

Leveraging Large Language Models (LLMs) for Accurate Molecular Energy Predictions

Requirements

Data

Model Fine-Tuning

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages