Optimizing Parameters for Integrating 3rd Party Docking Software #82

natarajbalakrishnan · 2024-05-19T23:00:36Z

natarajbalakrishnan
May 19, 2024

Hi All,

I would like to integrate a 3rd party docking software, which takes approximately 4 seconds per molecule to dock. What are the best parameters, such as the number of molecules per epoch, batch size, learning rate, etc., that I can adopt for this scenario to optimize and expedite the process?

Regards,
Nataraj

Answered by halx

May 20, 2024

Hi,

many thanks for your interest in REINVENT and welcome to the community!

There is probably a bit of a discussion to be had what "optimize and expedite" means and if this should really be the goal. Fast learning may mean that you lock yourself into a chemical space too early. Having said that, in practice, you would want to minimize the docking calls as much as possible, of course.

A typical approach would be to use staged learning. In the first stage(s) you would use rather cheap scoring components e.g. to fix the chemistry with say QED or other scores that are important to you. In a follow-up stage(s) you would then phase in the expensive omponent(s). The idea here obviously being tha…

View full answer

halx · 2024-05-20T06:22:07Z

halx
May 20, 2024
Maintainer

Hi,

many thanks for your interest in REINVENT and welcome to the community!

There is probably a bit of a discussion to be had what "optimize and expedite" means and if this should really be the goal. Fast learning may mean that you lock yourself into a chemical space too early. Having said that, in practice, you would want to minimize the docking calls as much as possible, of course.

A typical approach would be to use staged learning. In the first stage(s) you would use rather cheap scoring components e.g. to fix the chemistry with say QED or other scores that are important to you. In a follow-up stage(s) you would then phase in the expensive omponent(s). The idea here obviously being that you would carry out docking only on more reaonable (as per your definition) compounds. Staged learning may also help in cases where scoring components "oppose" each other. The next level could be active learning but I believe we do not offer those workflows yet.

Changing hyperparameters like batch size or learning rates may be useful too but would require a lot of experimentation to find the ones best suited to your problem. Also keep in mind that e.g. batch size has a direct influence on the optimizer.

Many thanks,
Hannes.

0 replies

natarajbalakrishnan · 2024-05-20T11:48:12Z

natarajbalakrishnan
May 20, 2024
Author

Hi Hannes
Thank you very much for your detailed response , it helps a lot

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing Parameters for Integrating 3rd Party Docking Software #82

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Optimizing Parameters for Integrating 3rd Party Docking Software #82

natarajbalakrishnan May 19, 2024

Replies: 2 comments

halx May 20, 2024 Maintainer

natarajbalakrishnan May 20, 2024 Author

natarajbalakrishnan
May 19, 2024

halx
May 20, 2024
Maintainer

natarajbalakrishnan
May 20, 2024
Author