Bug in forward of attention_augmented_conv.py #21

dhananjaisharma10 · 2020-12-29T17:16:45Z

Hi! I think there's a bug at this line in the forward function. Specifically, if the attention tensor attn_out is as follows for an input image with shape (channels, h(=2), w(=3)) and self-attention channels dv = 2:

# attention values of the 6 pixels
Att tensor([[-3.5002, -1.2102],
        [-4.3694, -1.5107],
        [-4.7621, -1.6465],
        [-4.9178, -1.7003],
        [-2.2335, -0.7722],
        [-5.0056, -1.7307]], grad_fn=<SliceBackward>)

you should not reshape it directly using

attn_out = torch.reshape(attn_out, (batch, Nh, dv // Nh, height, width)) # Method 1

but instead you should use

attn_out = torch.reshape(attn_out.permute(0, 1, 3, 2), (bs, Nh, dv // Nh, H, W)) # Method 2

The output difference:

# Method 1
Att tensor([[[-3.5002, -1.2102, -4.3694],
         [-1.5107, -4.7621, -1.6465]],

        [[-4.9178, -1.7003, -2.2335],
         [-0.7722, -5.0056, -1.7307]]], grad_fn=<SliceBackward>)

vs.

# Method 2
Att tensor([[[-3.5002, -4.3694, -4.7621],
         [-4.9178, -2.2335, -5.0056]],

        [[-1.2102, -1.5107, -1.6465],
         [-1.7003, -0.7722, , -1.7307]]], grad_fn=<SliceBackward>)

Hope it helps!

The text was updated successfully, but these errors were encountered:

JonathanCMitchell · 2021-04-29T19:39:36Z

It looks like you are just moving the width, and height around. What is the purpose behind this?

JonathanCMitchell · 2021-04-30T19:24:32Z

I can confirm that this is a bug and that you solved it. can disregard my original question. The issue was because on reshape we needed to have H*W as the last dimension

dhananjaisharma10 closed this as completed May 3, 2021

dhananjaisharma10 reopened this May 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in forward of attention_augmented_conv.py #21

Bug in forward of attention_augmented_conv.py #21

dhananjaisharma10 commented Dec 29, 2020

JonathanCMitchell commented Apr 29, 2021

JonathanCMitchell commented Apr 30, 2021

Bug in forward of attention_augmented_conv.py #21

Bug in forward of attention_augmented_conv.py #21

Comments

dhananjaisharma10 commented Dec 29, 2020

JonathanCMitchell commented Apr 29, 2021

JonathanCMitchell commented Apr 30, 2021