about the way to calculate attention weight #15

FreyWang · 2018-12-07T03:34:58Z

It seems that the way to calculate attention weight is different from origin paper: softmax(v* tanh(W*[s,h])), relu are used after softmax here, can you give some reasons or reference?

` def forward(self, hidden, encoder_outputs):
timestep = encoder_outputs.size(0)
h = hidden.repeat(timestep, 1, 1).transpose(0, 1)
encoder_outputs = encoder_outputs.transpose(0, 1) # [BTH]
attn_energies = self.score(h, encoder_outputs)
return F.relu(attn_energies).unsqueeze(1)

def score(self, hidden, encoder_outputs):
    # [B*T*2H]->[B*T*H]
    energy = F.softmax(self.attn(torch.cat([hidden, encoder_outputs], 2)), dim=2)
    energy = energy.transpose(1, 2)  # [B*H*T]
    v = self.v.repeat(encoder_outputs.size(0), 1).unsqueeze(1)  # [B*1*H]
    energy = torch.bmm(v, energy)  # [B*1*T]
    return energy.squeeze(1)  # [B*T]`

The text was updated successfully, but these errors were encountered:

xiaodaoyoumin · 2018-12-07T18:39:35Z

I am also confused about this ,if author come back,please notice me thank you

patiencefromzhou1229 · 2019-03-15T11:11:20Z

I am also confused about this

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the way to calculate attention weight #15

about the way to calculate attention weight #15

FreyWang commented Dec 7, 2018 •

edited

Loading

xiaodaoyoumin commented Dec 7, 2018

patiencefromzhou1229 commented Mar 15, 2019

about the way to calculate attention weight #15

about the way to calculate attention weight #15

Comments

FreyWang commented Dec 7, 2018 • edited Loading

xiaodaoyoumin commented Dec 7, 2018

patiencefromzhou1229 commented Mar 15, 2019

FreyWang commented Dec 7, 2018 •

edited

Loading