You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I would like to use this LM for classification and therefore I need to
calculate the log probability of an entire document.
One of the getLogProb() methods state:
"Calculate language model score of an n-gram. <b>Warning:</b> if you
* pass in an n-gram of length greater than <code>getLmOrder()</code>,
* this call will silently ignore the extra words of context. In other
* words, if you pass in a 5-gram (<code>endPos-startPos == 5</code>) to
* a 3-gram model, it will only score the words from <code>startPos + 2</code>
* to <code>endPos</code>."
Is it correct to assume that the only way to get the log probability score for
an entire document (sentence that contains more than LMOrder words) is to split
up the document in separate n-grams and query the log probability score for
these separately?
Original issue reported on code.google.com by Dirkcvan...@gmail.com on 26 May 2015 at 7:49
The text was updated successfully, but these errors were encountered:
Original issue reported on code.google.com by
Dirkcvan...@gmail.com
on 26 May 2015 at 7:49The text was updated successfully, but these errors were encountered: