how to output sentence's probability? #96

OswaldoBornemann · 2019-02-15T04:03:26Z

May i ask how to use awd-lstm-lm to output sentence's probability ?

The text was updated successfully, but these errors were encountered:

lorelupo · 2019-02-28T17:15:25Z

Same question here! @tsungruihon did you find a solution?

OswaldoBornemann · 2019-03-01T02:04:20Z

@wolflo no i haven't, still working in progress.

lorelupo · 2019-03-05T15:46:59Z

@tsungruihon I calculate the likelihood of an input sentence by summing the log-probabilities output by the model for each word of the input sentence. It looks like this:

def score(self, sentence):
    tokens = text_utils.getTokens(sentence)
    idxs = [self.dictionary.getIndex(x) for x in tokens]
    idxs = torch.LongTensor(idxs)
    # make it look as a batch of one element
    input = batch_utils.batchifyCorpusTensor(idxs, 1)
    # instantiate hidden states
    hidden = self.model.initHidden(batchSize=1)
    output, hidden = self.model(input, hidden)
    logits = self.model.decoder(output)
    logProba = F.log_softmax(logits, dim=1)
    return sum([logProba[i][idxs[i]] for i in range(len((idxs)))])

OswaldoBornemann · 2019-03-05T22:54:38Z

@wolflo thanks my friend! Nice work!

gailweiss · 2019-06-02T12:58:29Z

Hi, thanks @wolflo ! One thing is confusing me - does this also take into account the probability of the first token in the sentence? (i.e., the probability the model assigns to the first token when in the state given by model.initHidden?)

lorelupo · 2019-06-03T09:28:08Z

Hi @gailweiss , an approximation of the log probability of the first token in the sentence should be given by logProba[0][idxs[0]], right? However, I might have misunderstood your doubt.

gailweiss · 2019-06-03T10:57:31Z

Hi @wolflo , thanks for the quick response!
I guess what I'm not clear on is:

isn't logProba[i] the (log) next-token distribution after step i? i.e. if the input is a a <eos>, isn't logProba[0] the (log) probabilities for each input token after seeing that initial a (and logProba[-1] the log probabilities after having seen all of "a a <eos>")?

more directly, isn't output[0] only the output of the model after processing the first input token?

lorelupo · 2019-06-03T13:38:09Z

Oh, I see! That's an excellent remark. Then, I think you could rewrite the above scoring function as:

def score(self, sentence):
    tokens = text_utils.getTokens( "<eos> " + sentence)  # <eos> here serves as <sos>
    idxs = [self.dictionary.getIndex(x) for x in tokens]
    idxs = torch.LongTensor(idxs)
    # make it look as a batch of one element
    input = batch_utils.batchifyCorpusTensor(idxs, 1)
    # instantiate hidden states
    hidden = self.model.initHidden(batchSize=1)
    output, hidden = self.model(input, hidden)
    logits = self.model.decoder(output)
    logProba = F.log_softmax(logits, dim=1)
    return sum([logProba[i][idxs[i+1]] for i in range(len((idxs))-1)])

What do you think?

gailweiss · 2019-06-03T14:36:24Z

This seems to make sense :) thank you for taking the time to get into this!

I assume/hope the way the models here are trained, one sequence begins after the of the previous, i.e. I hope that the training in this repository also trains the distribution after . But at any rate this is a consistent solution and its just a question of whether the model optimises appropriately, which is something else.

Thank you!

lorelupo · 2019-06-03T14:54:32Z

Indeed, training in this repo is performed over a long tensor representing the concatenation of all the sentences of the corpus, with the tag <eos> appended at the end of each sentence.

Thank you for pointing out this issue !

ishpiki · 2019-07-08T12:39:56Z

Hi, @wolflo, thanks for the code, I have one issue related to the next word prediction, because by given word and previous hidden states we could try to predict the next most probable word according to softmax probability distribution. Did you try to do this with your function?
When I did it with the trained model (default settings, wiki-2 dataset), the result was not so good:

original: an English film , television and theatre actor . He had a guest @-@ starring role on the television series The Bill in 2000 . This was followed by a starring role in the play Herons written by Simon Stephens , which was performed in 2001 at the Royal Court Theatre . He had a guest role in the television
predicted: @-@ , and , . , also a appearance seller in the series , and the , was the by a in in the film , . by . who was released by the . the Academy of in was previously appearance in the film series

May be you faced with this issue before?

Thanks.

lorelupo · 2019-07-08T12:52:50Z

I tried sentence generation some time ago with the awd-lstm model trained on wikitext-2. Results were pretty poor for me too. You might improve generation quality by adjusting the temperature, by using some tricks like beam search or by training the model on bigger datasets. Unfortunately, I do not have time to dig further into this right now. Should I work on this in the future, I will let you know !

Have a good day :)

ishpiki mentioned this issue Jul 8, 2019

Problems with the next word predictions #106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to output sentence's probability? #96

how to output sentence's probability? #96

OswaldoBornemann commented Feb 15, 2019

lorelupo commented Feb 28, 2019

OswaldoBornemann commented Mar 1, 2019

lorelupo commented Mar 5, 2019 •

edited

Loading

OswaldoBornemann commented Mar 5, 2019

gailweiss commented Jun 2, 2019

lorelupo commented Jun 3, 2019 •

edited

Loading

gailweiss commented Jun 3, 2019 •

edited

Loading

lorelupo commented Jun 3, 2019

gailweiss commented Jun 3, 2019

lorelupo commented Jun 3, 2019

ishpiki commented Jul 8, 2019

lorelupo commented Jul 8, 2019

how to output sentence's probability? #96

how to output sentence's probability? #96

Comments

OswaldoBornemann commented Feb 15, 2019

lorelupo commented Feb 28, 2019

OswaldoBornemann commented Mar 1, 2019

lorelupo commented Mar 5, 2019 • edited Loading

OswaldoBornemann commented Mar 5, 2019

gailweiss commented Jun 2, 2019

lorelupo commented Jun 3, 2019 • edited Loading

gailweiss commented Jun 3, 2019 • edited Loading

lorelupo commented Jun 3, 2019

gailweiss commented Jun 3, 2019

lorelupo commented Jun 3, 2019

ishpiki commented Jul 8, 2019

lorelupo commented Jul 8, 2019

lorelupo commented Mar 5, 2019 •

edited

Loading

lorelupo commented Jun 3, 2019 •

edited

Loading

gailweiss commented Jun 3, 2019 •

edited

Loading