The ngram size parameter (n) in redundancy calculation seems not used #2

rordi · 2023-12-03T16:51:23Z

Thank your for the wonderful work provided in the MoreThanSentiments package.

The function parameter n of the Redundancy method seems not used towards the ngram size, as the ngram size is hardcoded to 10 in the following line:

morethansentiments/src/MoreThanSentiments.py

Line 203 in ebb2837

ngram[i][j] = list(nltk.ngrams(input_data[i][j].split(),10))

As the original publication (Cazier and Pfeiffer, 2015) was based on rather long documents, the 10-gram was probably ok in that context. When dealing with shorter documents, it would be useful for users to be able to work with a smaller ngram size.

The text was updated successfully, but these errors were encountered:

jinhangjiang · 2023-12-04T14:31:45Z

@rordi you are right. We will fix it soon. Thank you for pointing that out! Will get back to you on this thread once the bug is fixed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The ngram size parameter (n) in redundancy calculation seems not used #2

The ngram size parameter (n) in redundancy calculation seems not used #2

rordi commented Dec 3, 2023 •

edited

Loading

jinhangjiang commented Dec 4, 2023

The ngram size parameter (n) in redundancy calculation seems not used #2

The ngram size parameter (n) in redundancy calculation seems not used #2

Comments

rordi commented Dec 3, 2023 • edited Loading

jinhangjiang commented Dec 4, 2023

rordi commented Dec 3, 2023 •

edited

Loading