Refactor interpretability notebook, add comments, and add pure Jacobian formulation #21

asross · 2022-04-18T15:40:55Z

Per #12, I've reviewed the notebook on interpretability, and actually made some refactors and improvements:

Added markdown explaining the different interpretability methods used, along with links to some of the original papers that introduced them, so the notebook is easier to read for external audiences.
Included an explicit calculation of the gradient from torch.autograd (comparing it to the previous finite-difference approximation, which also had a small bug with calculating the perturbation).
Showed how LRP is equivalent to gradient * input.

See the updated notebook for more details.

review-notebook-app · 2022-04-18T15:40:59Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

dhruvbalwada · 2022-04-22T17:02:09Z

07-Interpretability-of-ML/LRP-L96.ipynb

@@ -1,5 +1,16 @@
 {


Are there any references on interpreting NNs for regression using Saliency maps?
I have not carefully looked through the references here, but reading the abstracts they all seem to be for classification problems. Maybe the extension to regression is trivial, but I (being a ML novice) am finding this notebook slightly hard to follow - put another way I am not sure what to do with the gradient and LRP plots at the end, or what insight is being actually derived there.

I tried to go back and listen to Pierre's talk and that didn't really help me either. Is there a very simple intro level reading that can be attached here?

Reply via ReviewNB

Good point -- I'll look for one. As elaborated in my comment below, the right way to think about gradients is as the local linear approximation of the model (i.e. the linear model that best approximates the NN at a given point, where in our plots we've averaged this model over 200 points). The right way to think about LRP or gradient * input is like that same linear model multiplied by the input, so it's giving the actual contributing evidence of each input to the output.

The interpretation might actually be easier/more sensical for regression than classification, actually, because in classification you have to take one additional step and relate the thing being output by the model (the logits / log-odds) back to the thing you actually care about (the probabilities).

I added a bit more explanation to the notebook, let me know if this is ok!

dhruvbalwada · 2022-04-22T17:02:09Z

07-Interpretability-of-ML/LRP-L96.ipynb

@@ -1,5 +1,16 @@
 {


Since this is the first time I am seeing this, I am not 100% sure what equations are being referred to here. Do you mean the L-96 equations?

Reply via ReviewNB

Hmm, on further reflection, my comment doesn't make sense. What I was remembering, though, was the fact that when I trained a linear model to predict subgrid forcing (and when Anastasia did), it learned forcing prediction coefficients for all X_i which were equal and about -0.8 (or alternatively, changing the corresponding differential equation weight from -1 to -1.8 in Anastasia's SINDY case).

The fact that a neural network's input gradients are all about -0.8 makes a lot of sense given those results (average local linear approximation of NN over many points = linear model, approximately), but I can't relate that back to the original equations, since we don't have any actual equations for the subgrid forcing in terms of the large-scale variables. Need to think about how to describe that succinctly in the notebook (+ with a reference to something public, since we can't link to m2lines presentations), let me know if you have suggestions.

In lieu of a satisfactory physical explanation, I updated the notebook to actually train and visualize the linear model for comparison!

rabernat · 2022-05-17T20:15:11Z

I have reconciled this PR with the main branch. Were the discussions above resolved, or did @asross want to make any more changes in response to @dhruvbalwada's review before we merge.

for more information, see https://pre-commit.ci

asross · 2022-05-18T12:28:27Z

Updated the PR, and also fixed a few issues that prevented it from running in the restructured repository. I also changed the name since it's not really just about LRP!

asross · 2022-05-18T12:32:54Z

~~Hmm -- it looks like the pre-commit-ci bot updated the notebook to remove the outputs and images. Why is that? Won't that make it much less useful for others to read?~~ Ignore this, I didn't yet understand the awesome build process!

Refactor notebook, add comments, and add pure Jacobian formulation

5086bb9

asross mentioned this pull request Apr 18, 2022

Internal Review Checklist #12

Closed

9 tasks

johannag126 requested review from joanbruna and gentine April 18, 2022 20:30

dhruvbalwada reviewed Apr 22, 2022

View reviewed changes

merge

4b8ed3b

asross and others added 2 commits May 18, 2022 08:26

Update notebook in response to comments

6e83448

[pre-commit.ci] auto fixes from pre-commit.com hooks

694ab20

for more information, see https://pre-commit.ci

Update name in toc

fea6bba

Merge branch 'main' into interpretability-update

01bd22b

adcroft merged commit 7314cfa into main May 22, 2022

IamShubhamGupto deleted the interpretability-update branch June 21, 2024 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor interpretability notebook, add comments, and add pure Jacobian formulation #21

Refactor interpretability notebook, add comments, and add pure Jacobian formulation #21

asross commented Apr 18, 2022

review-notebook-app bot commented Apr 18, 2022

dhruvbalwada Apr 22, 2022

asross Apr 22, 2022

asross Apr 22, 2022

asross May 18, 2022

dhruvbalwada Apr 22, 2022

asross Apr 22, 2022 •

edited

Loading

asross May 18, 2022

rabernat commented May 17, 2022

asross commented May 18, 2022

asross commented May 18, 2022 •

edited

Loading

Refactor interpretability notebook, add comments, and add pure Jacobian formulation #21

Refactor interpretability notebook, add comments, and add pure Jacobian formulation #21

Conversation

asross commented Apr 18, 2022

review-notebook-app bot commented Apr 18, 2022

dhruvbalwada Apr 22, 2022

Choose a reason for hiding this comment

asross Apr 22, 2022

Choose a reason for hiding this comment

asross Apr 22, 2022

Choose a reason for hiding this comment

asross May 18, 2022

Choose a reason for hiding this comment

dhruvbalwada Apr 22, 2022

Choose a reason for hiding this comment

asross Apr 22, 2022 • edited Loading

Choose a reason for hiding this comment

asross May 18, 2022

Choose a reason for hiding this comment

rabernat commented May 17, 2022

asross commented May 18, 2022

asross commented May 18, 2022 • edited Loading

asross Apr 22, 2022 •

edited

Loading

asross commented May 18, 2022 •

edited

Loading