add discussion in interpretability section and update molecular design #985

delton137 · 2019-12-20T00:02:57Z

Hello,

I have tried to make some changes to the manuscript. Unfortunately, I wasn't able to completely build the new version to pdf.

the build.sh runs until it outputs:

## INFO
Generated manscript stats:
{
"reference_counts": {
"arxiv": 132,
"doi": 425,
"pmcid": 6,
"url": 48,
"total": 611
},
"word_count": 34912
}
Exporting HTML manuscript
Exporting PDF manuscript using Docker + Athena
docker: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running?.
See 'docker run --help'.

Please let me know if I'm doing something wrong. Perhaps you can accept only the changes I made to the manuscript and not the changes in the /build/ directory. Otherwise, feel free to reject this pull request. You can reach me at daniel.elton@nih.gov.

…cular design

AppVeyorBot · 2019-12-20T00:07:58Z

AppVeyor build 1.0.15 for commit 1cb14cb by @delton137 is now complete. The rendered manuscript from this build is temporarily available for download at:

agitter · 2019-12-30T13:54:53Z

Thanks for the contributions @delton137. It's unclear how much maintenance activity there actually is to help review and merge this pull request. Recent pull requests struggled to find reviewers with the right domain expertise who could comment.

Anyone who wants to help review can please comment here.

If we do find reviewers, I recommend splitting these changes into two smaller pull requests, one for drug design and one for interpretability. That will help us discuss domain-specific changes. Keeping our convention of one sentence per line will also help us focus on which parts of the text actually changed. You will also need to exclude the files in the build directory as you noted above. These updates can wait until we find reviewers, however.

Did you see that build.sh error in a local build? Athena PDF generation worked correctly in both continuous integration services.

delton137 · 2019-12-31T16:13:33Z

The problem seems to be on my end with docker not working for running Athena.

I'll keep an eye on this page. I am happy to contribute more content / copyediting, especially on drug-discovery. It is something I know quite a bit about as I wrote a review on deep learning for molecular design.

cgreene · 2019-12-31T16:18:18Z

Getting this going is in my goals for the first half of 2020. I have one more review I'm trying to get out first. Sorry!

agitter · 2020-01-01T12:59:17Z

@delton137 I can't commit to it immediately, but I could possibly review your drug discovery edits if no one else is able to.

cgreene · 2020-02-10T19:26:38Z

Hi @delton137 : can you remove the files in the build directory from your PR or enable commits by project maintainers? I'll be happy to go ahead and review this so we can get things moving forward. Thanks!

delton137 · 2020-02-10T19:49:11Z

Hi,

I'm not sure how to remove the files that are in /build/ from my PR. Should I create a new PR?

Alternatively, can you only accept the relevant files when you accept the commit?

delton137 · 2020-02-10T19:50:15Z

"Allow edits from maintainers." is/was checked.

cgreene · 2020-02-10T19:51:54Z

Interesting - it didn't let me make edits before, but it does now. I'm going to batch the deletions of the /build/files. If you make edits locally, you'll have to pull the branch down again.

delton137 · 2020-02-10T19:58:10Z

OK, thank you for your help and for going forward with the review!

cgreene · 2020-02-10T20:04:25Z

Ok! You should pull from your master before you make the next set of changes. I'll have a few more detailed suggestions.

AppVeyorBot · 2020-02-10T20:07:43Z

AppVeyor build 1.0.21 for commit e7f6ca6 by @cgreene is now complete. The rendered manuscript from this build is temporarily available for download at:

cgreene

Can you make the text one-sentence-per-line for easier commenting and diffing? I had a few specific changes at this point but will probably suggest more when the one-sentence-per-line commit comes in.

Thanks!

cgreene · 2020-02-10T20:05:16Z

content/05.treat.md

 It explores an estimated 10<sup>60</sup> synthesizable organic molecules with drug-like properties without explicit enumeration [@doi:10.1002/wcms.1104].
-To test or score structures, algorithms like those discussed earlier are used.
+To test or score structures, physics-based simulation could be used, or machine learning models based on techniques discussed may be used, as they are much more computationally efficient.


Can you cite at least one example of each approach + the assertion around computational efficiency? I'm not sure if that part pertains just to prediction or if it also includes prediction.

cgreene · 2020-02-10T20:06:35Z

content/05.treat.md

-Even more interesting is the prospect of performing gradient-based or Bayesian optimization of molecules within this latent space.
-The strategy of constructing simple, continuous features before applying supervised learning techniques is reminiscent of autoencoders trained on high-dimensional EHR data [@tag:BeaulieuJones2016_ehr_encode].
+
+In the past few years a large number of techniques for the generative modeling and optimization of molecules with deep learning have been explored, including recursive neural networks, variational autoencoders, generative adversarial networks, and reinforcement learning -- for a review see Elton, et al.[@tag:Elton_molecular_design_review]


Suggested change

In the past few years a large number of techniques for the generative modeling and optimization of molecules with deep learning have been explored, including recursive neural networks, variational autoencoders, generative adversarial networks, and reinforcement learning -- for a review see Elton, et al.[@tag:Elton_molecular_design_review]

In the past few years a large number of techniques for the generative modeling and optimization of molecules with deep learning have been explored, including recursive neural networks, variational autoencoders, generative adversarial networks, and reinforcement learning -- for a review see Elton, et al.[@tag:Elton_molecular_design_review].

There are a few other categories that might be worth mentioning. Within generative models, there are at least a couple of normalizing flow-based methods (GraphAF and GraphNVP). Additionally, there are approaches using solely Bayesian optimization (ChemBO).

cgreene · 2020-02-10T20:07:26Z

content/05.treat.md

+Building off the large amount of work that has already gone into text generation,[@arxiv:1308.0850] many generative neural networks for drug design represent chemicals with the simplified molecular-input line-entry system (SMILES), a standard string-based representation with characters that represent atoms, bonds, and rings [@tag:Segler2017_drug_design].
+
+The first successful demonstration of a deep learning based approach for molecular optimization occured in 2016 with the development of a SMILES-to-SMILES autoencoder capable of learning a continuous latent feature space for molecules[@tag:Gomezb2016_automatic].
+In this learned continuous space it is possible to interpolate between molecular structures in a manner that is not possible with discrete


Can you make this part one sentence per line so it's easier to see changes?

cgreene · 2020-02-10T20:07:55Z

content/05.treat.md

-Unfortunately, this often leads to overfit, "weird" molecules, which are difficult to synthesize in the lab.
-Current programs have settled on rule-based virtual chemical reactions to generate molecular structures [@doi:10.1021/acs.jmedchem.5b01849].
-Deep learning models that generate realistic, synthesizable molecules have been proposed as an alternative.
-In contrast to the classical, symbolic approaches, generative models learned from data would not depend on laboriously encoded expert knowledge.


Did you mean to get rid of this information? Is it offloaded to the newly cited review?

Not sure what I was thinking. The first two sentences are both debateable but also summarize general sentiments. I'm restoring the sentences and this will be in my commit later today.

cgreene · 2020-02-10T20:08:25Z

content/05.treat.md

-The initial RNN is then fine-tuned to generate molecules that are likely to be active against a specific target by either continuing training on a small set of positive examples [@tag:Segler2017_drug_design] or adopting reinforcement learning strategies [@tag:Olivecrona2017_drug_design; @arxiv:1611.02796].
-Both the fine-tuning and reinforcement learning approaches can rediscover known, held-out active molecules.
-The great flexibility of neural networks, and progress in generative models offers many opportunities for deep architectures in *de novo* design (e.g. the adaptation of GANs for molecules).
+These generative models successfully learn the grammar of compound representations, with 94% [@tag:Olivecrona2017_drug_design] or nearly 98% [@tag:Segler2017_drug_design] of generated SMILES corresponding to valid molecular structures. The initial RNN is then fine-tuned to generate molecules that are likely to be active against a specific target by either continuing training on a small set of positive examples [@tag:Segler2017_drug_design] or adopting reinforcement learning strategies [@tag:Olivecrona2017_drug_design; @arxiv:1611.02796]. Both the fine-tuning and reinforcement learning approaches can rediscover known, held-out active molecules.


Can you make this one sentence per line so it's easier to review and track changes?

cgreene · 2020-02-10T20:08:50Z

content/06.discussion.md

-Methods for adjusting the flexibility of deep learning models include dropout, reduced data projections, and transfer learning (described below).
-One way of understanding such model optimizations is that they incorporate external information to limit model flexibility and thereby improve predictions.
-This balance is formally described as a tradeoff between "bias and variance"
+In particular, many problem-specific optimizations described in this review reflect a recurring universal tradeoff---controlling the flexibility of a model in order to maximize generalizability and prevent overfitting.


Suggested change

In particular, many problem-specific optimizations described in this review reflect a recurring universal tradeoff---controlling the flexibility of a model in order to maximize generalizability and prevent overfitting.

Many problem-specific optimizations described in this review reflect a recurring universal tradeoff---controlling the flexibility of a model in order to maximize generalizability and prevent overfitting.

cgreene · 2020-02-10T20:09:32Z

content/06.discussion.md

 [@url:http://www.deeplearningbook.org/].

-Although the bias-variance tradeoff is common to all machine learning applications, recent empirical and theoretical observations suggest that deep learning models may have uniquely advantageous generalization properties [@tag:Zhang2017_generalization; @tag:Lin2017_why_dl_works].
-Nevertheless, additional advances will be needed to establish a coherent theoretical foundation that enables practitioners to better reason about their models from first principles.
+Although the bias-variance tradeoff is is important to take into account in many machine learning tasks, recent empirical and theoretical observations suggest that deep neural networks have uniquely advantageous generalization properties and do not obey the tradeoff as expected [@tag:Belkin2019_PNAS; @tag:Zhang2017_generalization; @tag:Lin2017_why_dl_works]. According to the bias-variance theory, many of the most successful deep neural networks have so many free parameters they should overfit.[@tag:Belkin2019_PNAS] It has been shown that deep neural networks operate in a regime where they can exactly interpolate their training data yet are still able to generalize.[@tag:Belkin2019_PNAS] Thus, poor generalizability can often be remedied by adding more layers and increasing the number of free parameters, in conflict with the classic bias-variance theory. Additional advances will be needed to establish a coherent theoretical foundation that enables practitioners to better reason about their models from first principles.


Can you make this one sentence per line? Also citation style is inside the periods.

cgreene · 2020-02-10T20:10:41Z

content/06.discussion.md


-As the concept of interpretability is quite broad, many methods described as improving the interpretability of deep learning models take disparate and often complementary approaches.
+Firstly, a model that achieves breakthrough performance may have identified patterns in the data that practitioners in the field would like to understand.


One sentence per line here, but I also might note that one of the major hazards of noninterpretable models with high predictive accuracy would be learning patterns in the data that capture the data selection process as opposed to the underlying data generating process (i.e., ascertainment biases, etc).

agitter

I can give more domain-specific comments on the generative molecular models once this is updated to one sentence per line. It's hard to comment on specific papers currently and to see exactly what has changed in the interpretability section.

There is some newer work on generative models for lead optimization we could mention here as a slightly different application. @delton137 were those already covered in your review?

agitter · 2020-02-12T16:58:16Z

content/05.treat.md

-The great flexibility of neural networks, and progress in generative models offers many opportunities for deep architectures in *de novo* design (e.g. the adaptation of GANs for molecules).
+These generative models successfully learn the grammar of compound representations, with 94% [@tag:Olivecrona2017_drug_design] or nearly 98% [@tag:Segler2017_drug_design] of generated SMILES corresponding to valid molecular structures. The initial RNN is then fine-tuned to generate molecules that are likely to be active against a specific target by either continuing training on a small set of positive examples [@tag:Segler2017_drug_design] or adopting reinforcement learning strategies [@tag:Olivecrona2017_drug_design; @arxiv:1611.02796]. Both the fine-tuning and reinforcement learning approaches can rediscover known, held-out active molecules.
+
+Reinforcement learning approaches where operations are performed directly on the molecular graph bypass the need to learn the details of SMILES syntax, allowing the model to focus purely on chemistry. Additionally, they seem to require less training data and generate more valid molecules since they are constrained by design only to graph operations which satisfy chemical valiance rules.[@tag:Elton_molecular_design_review] A reinforcement learning agent developed by Zhou et al. demonstrated superior molecular optimization performance on certain easy to compute metrics when compared with other deep learning based approaches such as the Junction Tree VAE, Objective Reinforced Generative Adversarial Network, and Graph Convolutional Policy Network.[@doi:10.1038/s41598-019-47148-x] As another example, Zhavoronkov et al. used generative tensorial reinforcement learning to discover potent inhibitors of discoidin domain receptor 1 (DDR1).[@tag:Zhavoronkov2019_drugs] Their work is unique in that six lead candidates discovered using their approach were synthesized and tested in the lab, with 4/6 achieving some degree of binding to DDR1.[@tag:Zhavoronkov2019_drugs]  


If we're discussing the DDR1 example, we should also consider the new letter from Walters and Murcko:
Assessing the impact of generative AI on medicinal chemistry
https://doi.org/10.1038/s41587-020-0418-2

In general, should we add a couple more caveats or areas where these generative models still struggle?

I will have time to respond to your points and make a number of improvements starting on Friday afternoon.

Incidently, there has also back and forth recently about the GENTRL reinforcement learning method, with some skepticism from the phareceutical world. (see the response here and references)

I read the paper Walters and Murcko and the response paper I linked to, but I'm not sure how much of that needs to go in the review as it gets a bit "in the weeds". The basic critique is that a lot of the molecules being generated are variations on stuff in the training data. They argue for standards such as including the training data used with the paper and a ranking of the closest molecules in the training data, so that novelty can be discerned.

It may be worth discussing the need for metrics / criteria for scoring generative models or deep learning based systems for optimization more generally.. I discussed it in my review article on deep learning for molecular design (section 3.1). If you can't access the Royal Society of Chemistry version, its on the arXiv here. Note the criteria discussed are more for generation (ie ensuring a large, diverse set of "interesting" molecules), rather than optimization for a specific target.

cgreene · 2020-02-17T10:44:59Z

@delton137 : did you mean to close this?

cgreene · 2020-02-17T10:45:36Z

Oh - I see this got re-filed as #988

add discussion in interpretability section and update section on mole…

1cb14cb

…cular design

remove build files

e7f6ca6

cgreene reviewed Feb 10, 2020

View reviewed changes

agitter reviewed Feb 12, 2020

View reviewed changes

delton137 closed this Feb 14, 2020

cgreene mentioned this pull request Feb 17, 2020

[Review Requested] Update Underway #986

Merged

agitter mentioned this pull request Mar 16, 2020

add discussion in interpretability section and updates to molecular design section and discussion sections (issues with previous PR fixed) #988

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add discussion in interpretability section and update molecular design #985

add discussion in interpretability section and update molecular design #985

delton137 commented Dec 20, 2019

AppVeyorBot commented Dec 20, 2019

agitter commented Dec 30, 2019

delton137 commented Dec 31, 2019

cgreene commented Dec 31, 2019

agitter commented Jan 1, 2020

cgreene commented Feb 10, 2020

delton137 commented Feb 10, 2020

delton137 commented Feb 10, 2020

cgreene commented Feb 10, 2020

delton137 commented Feb 10, 2020

cgreene commented Feb 10, 2020

AppVeyorBot commented Feb 10, 2020

cgreene left a comment

cgreene Feb 10, 2020

cgreene Feb 10, 2020

stephenra Feb 14, 2020 •

edited

Loading

cgreene Feb 10, 2020

cgreene Feb 10, 2020

delton137 Feb 14, 2020

cgreene Feb 10, 2020

cgreene Feb 10, 2020

cgreene Feb 10, 2020

cgreene Feb 10, 2020

agitter left a comment

agitter Feb 12, 2020

delton137 Feb 13, 2020

delton137 Feb 14, 2020 •

edited

Loading

cgreene commented Feb 17, 2020

cgreene commented Feb 17, 2020

	In the past few years a large number of techniques for the generative modeling and optimization of molecules with deep learning have been explored, including recursive neural networks, variational autoencoders, generative adversarial networks, and reinforcement learning -- for a review see Elton, et al.[@tag:Elton_molecular_design_review]
	In the past few years a large number of techniques for the generative modeling and optimization of molecules with deep learning have been explored, including recursive neural networks, variational autoencoders, generative adversarial networks, and reinforcement learning -- for a review see Elton, et al.[@tag:Elton_molecular_design_review].

	In particular, many problem-specific optimizations described in this review reflect a recurring universal tradeoff---controlling the flexibility of a model in order to maximize generalizability and prevent overfitting.
	Many problem-specific optimizations described in this review reflect a recurring universal tradeoff---controlling the flexibility of a model in order to maximize generalizability and prevent overfitting.


		As the concept of interpretability is quite broad, many methods described as improving the interpretability of deep learning models take disparate and often complementary approaches.
		Firstly, a model that achieves breakthrough performance may have identified patterns in the data that practitioners in the field would like to understand.

add discussion in interpretability section and update molecular design #985

add discussion in interpretability section and update molecular design #985

Conversation

delton137 commented Dec 20, 2019

AppVeyorBot commented Dec 20, 2019

agitter commented Dec 30, 2019

delton137 commented Dec 31, 2019

cgreene commented Dec 31, 2019

agitter commented Jan 1, 2020

cgreene commented Feb 10, 2020

delton137 commented Feb 10, 2020

delton137 commented Feb 10, 2020

cgreene commented Feb 10, 2020

delton137 commented Feb 10, 2020

cgreene commented Feb 10, 2020

AppVeyorBot commented Feb 10, 2020

cgreene left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephenra Feb 14, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

agitter left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

delton137 Feb 14, 2020 • edited Loading

Choose a reason for hiding this comment

cgreene commented Feb 17, 2020

cgreene commented Feb 17, 2020

stephenra Feb 14, 2020 •

edited

Loading

delton137 Feb 14, 2020 •

edited

Loading