-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error Encountered While Running Reinvent_TLRL.ipynb in REINVENT4 #88
Comments
Most likely from reinvent.models.transformer.core.vocabulary import Vocabulary |
Thank you very much for your response. The previous errors have been resolved, but when I run %%time !reinvent -l stage1.log $stage1_config_filename, a new message appears:Failed to find the pandas get_adjustment() function to patch
|
That is only a waning message due to RDKit not being able to cope with new Pandas versions (I believe versions 2.0 and above). Unless you use PandasTools there should be no impact. |
Hi, While continuing to debug Reinvent_TLRL.ipynb, I encountered the following error when running the cell in the notebook:%%time
|
You will need to copy the model file into that directory (see error message). You can find the download link for the file in the notebook. |
Hi, Thank you very much for your guidance. I have successfully run through every cell of both Reinvent_demo.ipynb and Reinvent_TLRL.ipynb, but I still need your help with some logical issues. If I want to generate a Prior model based on Chembl33, in the stage1.toml file of Reinvent_demo.ipynb, the code is as follows:run_type = "staged_learning" [parameters] prior_file = "/home/Anaconda3/envs/reinvent4/lib/python3.10/site-packages/reinvent/../priors/reinvent.prior" batch_size = 100 use_checkpoint = false [learning_strategy] type = "dap" [[stage]] max_score = 1.0 chkpt_file = 'stage1.chkpt' scoring_function.type = "custom_product" [stage.scoring] [[stage.scoring.component]] [[stage.scoring.component.custom_alerts.endpoint]] params.smarts = [ [[stage.scoring.component]] [[stage.scoring.component.QED.endpoint]] [[stage.scoring.component]] [[stage.scoring.component.NumAtomStereoCenters.endpoint]] transform.type = "left_step"
|
To create a new prior training you woul need to look into You probably also want to carefully consider why you need a new prior as it takes quite a bit of expertise to get this right. Chemical space coverage has probably not that much evolved in ChEMBL but if you want to support additional chemistry (the vocabulary is fixed) for example or think to support stereochemistry (but beware imbalanced data) then the current priors are limited in this. |
I have found reinvent/runmodes/create_model/create_reinvent.py, but I still don't know how to create an empty model. In REINVENT 3.2, there was a Create_Model_Demo.ipynb notebook that could be used to create an empty model with Chembl33. Could you please guide me on how to create an empty model with Chembl33 in REINVENT 4? I see that there are many pre-existing prior models in the Prior directory of REINVENT4, such as reinvent.prior. How are these models trained? Can these pre-existing prior models be used directly? How should each of these prior models in the Prior directory be used respectively? Is there a detailed usage guide? I would appreciate your continued guidance. Thank you! |
I can suggest to read our paper Reinvent 4: Modern AI–driven generative molecule design and the papers cited therein.
|
It's quite a coincidence. Before receiving your reply, I carefully read your paper "Reinvent 4: Modern AI-driven generative molecule design" published in the Journal of Cheminformatics this afternoon. I have a basic understanding of the logic and operation mechanism of REINVENT4. However, after reading this paper, there are still a few questions that need your guidance:
I look forward to your answers to these three questions. Thank you very much! |
3.You only need to train a new prior if you have specific needs in terms of supported chemistry. As for the other questions you will need to get that basic knowledge from the literature e.g. out paper. These things are not suitable for discussion in this forum. |
In the Reinvent_TLRL.ipynb notebook, the model.pt is annotated with: "This is a model that has been trained on free energy simulation data computed for the TNKS2 target." I browsed through the ChemProp GitHub site and it seems that ChemProp does not have the capability to compute binding free energy. I'm having trouble understanding this annotation, and would appreciate further clarification. |
ChemProp is software that allows the user to create deep learning models. The data to train on comes from the user. The model provided is just an example. |
Hi,
I have correctly installed REINVENT4 and generated the Reinvent_TLRL.ipynb file in the notebook directory using the jupytext command. When running the cell in Reinvent_TLRL.ipynb:
%%time
!reinvent -l stage1.log $stage1_config_filename
the following error message appears:
Traceback (most recent call last):
File "/home/Anaconda3/envs/reinvent4/bin/reinvent", line 8, in
sys.exit(main())
File "/home/Anaconda3/envs/reinvent4/lib/python3.10/site-packages/reinvent/Reinvent.py", line 302, in main
runner(input_config, actual_device, tb_logdir, responder_config)
File "/home/Anaconda3/envs/reinvent4/lib/python3.10/site-packages/reinvent/runmodes/RL/run_staged_learning.py", line 248, in run_staged_learning
adapter, _, model_type = create_adapter(prior_model_filename, "inference", device)
File "/home/Anaconda3/envs/reinvent4/lib/python3.10/site-packages/reinvent/runmodes/create_adapter.py", line 49, in create_adapter
compatibility_setup(model)
File "/home/Anaconda3/envs/reinvent4/lib/python3.10/site-packages/reinvent/runmodes/create_adapter.py", line 120, in compatibility_setup
from reinvent.models.mol2mol.models.vocabulary import Vocabulary
ImportError: cannot import name 'Vocabulary' from 'reinvent.models.mol2mol.models.vocabulary' (/home/Anaconda3/envs/reinvent4/lib/python3.10/site-packages/reinvent/models/mol2mol/models/vocabulary.py)
I need your help to resolve this issue. Thank you very much!
Best regards,
Jiyuan
The text was updated successfully, but these errors were encountered: