Access self-attention matrix of vision transformer #6032

sophmrtn · 2023-02-21T10:29:26Z

Is your feature request related to a problem? Please describe.
I am training a vision transformer using monai and would like to carry out some interpretability analysis. However, the current model code does not save the self-attention matrix during training, and it is not straightforward to pass it from the self-attention block to the model output.

MONAI/monai/networks/blocks/selfattention.py

Lines 56 to 65 in 88fb0b1

    
           def forward(self, x): 
        
               output = self.input_rearrange(self.qkv(x)) 
        
               q, k, v = output[0], output[1], output[2] 
        
               att_mat = (torch.einsum("blxd,blyd->blxy", q, k) * self.scale).softmax(dim=-1) 
        
               att_mat = self.drop_weights(att_mat) 
        
               x = torch.einsum("bhxy,bhyd->bhxd", att_mat, v) 
        
               x = self.out_rearrange(x) 
        
               x = self.out_proj(x) 
        
               x = self.drop_output(x) 
        
               return x

Describe the solution you'd like
An option to output the attn_mat from the self-attention block in the model forward pass (before matrix multiplication with the input) or access it after training as a class attribute.

The text was updated successfully, but these errors were encountered:

a-parida12 · 2023-04-02T03:20:31Z

@wyli I would like to help with this issue.

I was thinking, the easiest way to achieve this without changing the API could be by making att_mat before dropout as a class attribute so could be accessed by SABlock().attn_mat. also, a parameter like store_attn:bool in the __init__ to prevent memory overhead when the feature is not required.

Let me know your thoughts.

wyli · 2023-04-02T08:40:40Z

sounds good, please make sure it's backward compatible, e.g. previously saved checkpoints can still be loaded by default.

Fixes #6032 . ### Description A few sentences describing the changes proposed in this pull request. ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [x] Breaking change (fix or new feature that would cause existing functionality to change). - [x] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [x] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: Ben Murray <ben.murray@gmail.com> Signed-off-by: a-parida12 <abhijeet.parida@tum.de> Signed-off-by: YanxuanLiu <yanxuanl@nvidia.com> Signed-off-by: monai-bot <monai.miccai2019@gmail.com> Signed-off-by: Wenqi Li <wenqil@nvidia.com> Co-authored-by: Ben Murray <ben.murray@gmail.com> Co-authored-by: YanxuanLiu <104543031+YanxuanLiu@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Wenqi Li <831580+wyli@users.noreply.github.com> Co-authored-by: monai-bot <monai.miccai2019@gmail.com> Co-authored-by: Wenqi Li <wenqil@nvidia.com>

a-parida12 · 2023-04-05T17:56:25Z

@wyli so #6271 is merged? So do i put a new merge request for a potential solution?

Fixes #6032 . ### Description specified the data type of the `att_matrix` to be compliant with torch.jit.compile requirements for conditionals ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [x] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [x] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: a-parida12 <abhijeet.parida@tum.de>

a-parida12 · 2023-04-17T21:05:57Z

@wyli I just realized that ViT backbone is used by by UNetr and ViTAutoEnc ideally they should have the option to allow access to the the attn_mat else it will always be set as the default value and no way it can be changed by the user of UNetr and ViTAutoEnc. What do you think?

wyli · 2023-04-18T04:35:59Z

Sure, please help create another feature request to follow up..

wyli added the Feature request label Feb 21, 2023

wyli added the Contribution wanted label Mar 9, 2023

a-parida12 mentioned this issue Apr 3, 2023

feat(SABlock): access to the attn matrix #6271

Merged

7 tasks

wyli closed this as completed in #6271 Apr 3, 2023

wyli reopened this Apr 4, 2023

a-parida12 mentioned this issue Apr 5, 2023

feat(SABlock): access atten matrix jit compliant #6308

Merged

7 tasks

wyli closed this as completed in #6308 Apr 11, 2023

a-parida12 mentioned this issue May 8, 2023

Self-Attention Matrix Access for ViTAutoEnc and UneTr #6492

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Access self-attention matrix of vision transformer #6032

Access self-attention matrix of vision transformer #6032

sophmrtn commented Feb 21, 2023 •

edited

Loading

a-parida12 commented Apr 2, 2023

wyli commented Apr 2, 2023

a-parida12 commented Apr 5, 2023

a-parida12 commented Apr 17, 2023

wyli commented Apr 18, 2023

Access self-attention matrix of vision transformer #6032

Access self-attention matrix of vision transformer #6032

Comments

sophmrtn commented Feb 21, 2023 • edited Loading

a-parida12 commented Apr 2, 2023

wyli commented Apr 2, 2023

a-parida12 commented Apr 5, 2023

a-parida12 commented Apr 17, 2023

wyli commented Apr 18, 2023

sophmrtn commented Feb 21, 2023 •

edited

Loading