PEP8 Fixes for Summarization. #1017

souravsingh · 2016-11-13T16:37:57Z

No description provided.

tmylk · 2016-11-14T17:37:16Z

gensim/summarization/bm25.py

@@ -18,7 +18,8 @@ class BM25(object):

    def __init__(self, corpus):
        self.corpus_size = len(corpus)
-        self.avgdl = sum(map(lambda x: float(len(x)), corpus)) / self.corpus_size
+        self.avgdl = sum(map(lambda x: float(len(x)), corpus)


what is the reason for this change?

# Arguments on first line forbidden when not using vertical alignment. foo = long_function_name(var_one, var_two, var_three, var_four)```

PEP8 error E501:Line longer than 79 characters.

Does this line need modification?

We ignore the 79 chars guideline, it is silly.

Put line breaks in places that are conceptually and visually meaningful. Try not to exceed 120 characters per line (unless absolutely necessary).

piskvorky · 2016-11-15T08:22:27Z

This PR has the same issues as the previous ones: makes code harder to read and maintain (unwanted line breaks, vertical indent).

tmylk

please remove length of line changes

tmylk · 2016-11-22T19:43:00Z

gensim/summarization/commons.py

@@ -16,5 +16,6 @@ def build_graph(sequence):

 def remove_unreachable_nodes(graph):
    for node in graph.nodes():
-        if sum(graph.edge_weight((node, other)) for other in graph.neighbors(node)) == 0:
+        if sum(graph.edge_weight((node, other))


we don't check for lenght of line

tmylk · 2016-11-22T19:43:14Z

gensim/summarization/keywords.py

@@ -161,7 +161,8 @@ def _get_combined_keywords(_keywords, split_text):
        if word in _keywords:
            combined_word = [word]
            if i + 1 == len_text:
-                result.append(word)   # appends last word if keyword and doesn't iterate


tmylk · 2016-11-24T17:06:06Z

do you know how to disable the 'line length' rule in a PEP8 check? Many lines are still unnecessarily corrected.

souravsingh · 2016-11-24T17:27:10Z

There is a command line argument called -max-line-length which can set the maximum line length for scripts(like 120 chars or so, default is 79).

piskvorky · 2016-11-28T06:31:52Z

There are syntax errors in this PR (\ probability_matrix, indent of diff's last line...), as well as unwanted changes (vertical indent). How was this tested?

I am really uneasy about all these "PEP8 fix" pull requests, it seems we're going in circles.

piskvorky · 2016-11-28T06:34:37Z

gensim/summarization/bm25.py


    def get_score(self, document, index, average_idf):
        score = 0
        for word in document:
            if word not in self.f[index]:
                continue
            idf = self.idf[word] if self.idf[word] >= 0 else EPSILON * average_idf
-            score += (idf*self.f[index][word]*(PARAM_K1+1)
-                      / (self.f[index][word] + PARAM_K1*(1 - PARAM_B+PARAM_B*self.corpus_size / self.avgdl)))
+            score += (idf * self.f[index][word] * (PARAM_K1 + 1) /


No vertical indent.
Splitting the mega-expression into something saner (subexpressions) will help both readability and line length.

piskvorky · 2016-11-28T06:34:54Z

gensim/summarization/summarizer.py

@@ -110,12 +110,17 @@ def _get_sentences_with_word_count(sentences, word_count):
    return selected_sentences


-def _extract_important_sentences(sentences, corpus, important_docs, word_count):
+def _extract_important_sentences(


Definitely not.

piskvorky · 2016-11-28T06:35:08Z

gensim/summarization/summarizer.py

    important_sentences = _get_important_sentences(sentences, corpus, important_docs)

    # If no "word_count" option is provided, the number of sentences is
    # reduced by the provided ratio. Else, the ratio is ignored.
-    return important_sentences if word_count is None else _get_sentences_with_word_count(important_sentences, word_count)
+    return important_sentences if word_count is None else _get_sentences_with_word_count(


I have made some changes to the expression for calculation of score to match PEP8 specifications.

PEP8 Fixes for Summarization.

f051cbf

tmylk reviewed Nov 14, 2016

View reviewed changes

souravsingh and others added 3 commits November 22, 2016 00:50

Undo some changes to code.

c3b772e

Fix an error in newline

c2e20c9

Fix indent

fdcbc75

tmylk suggested changes Nov 22, 2016

View reviewed changes

souravsingh added 2 commits November 24, 2016 21:01

Fixes according to the review

482ab89

Some small fixes

1298db6

piskvorky reviewed Nov 28, 2016

View reviewed changes

souravsingh and others added 4 commits November 28, 2016 18:37

Updates to a few scripts

f731f19

Update bm25.py

b21068f

Update keywords.py

fc3a8af

Update expression to match PEP8 specifications

19312fc

I have made some changes to the expression for calculation of score to match PEP8 specifications.

souravsingh closed this Jan 6, 2017

tmylk reopened this Jan 6, 2017

souravsingh mentioned this pull request Jan 15, 2017

Pep8 fixes #1081

Closed

SamriddhiJain mentioned this pull request Mar 10, 2017

pep8/pycodestyle fixes for hanging indents in Summarization module #1202

Merged

souravsingh closed this Apr 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PEP8 Fixes for Summarization. #1017

PEP8 Fixes for Summarization. #1017

souravsingh commented Nov 13, 2016

tmylk Nov 14, 2016

souravsingh Nov 14, 2016

piskvorky Nov 15, 2016 •

edited

Loading

piskvorky commented Nov 15, 2016 •

edited

Loading

tmylk left a comment

tmylk Nov 22, 2016

tmylk Nov 22, 2016

tmylk commented Nov 24, 2016 •

edited

Loading

souravsingh commented Nov 24, 2016

piskvorky commented Nov 28, 2016 •

edited

Loading

piskvorky Nov 28, 2016

piskvorky Nov 28, 2016

piskvorky Nov 28, 2016

PEP8 Fixes for Summarization. #1017

PEP8 Fixes for Summarization. #1017

Conversation

souravsingh commented Nov 13, 2016

tmylk Nov 14, 2016

Choose a reason for hiding this comment

souravsingh Nov 14, 2016

Choose a reason for hiding this comment

piskvorky Nov 15, 2016 • edited Loading

Choose a reason for hiding this comment

piskvorky commented Nov 15, 2016 • edited Loading

tmylk left a comment

Choose a reason for hiding this comment

tmylk Nov 22, 2016

Choose a reason for hiding this comment

tmylk Nov 22, 2016

Choose a reason for hiding this comment

tmylk commented Nov 24, 2016 • edited Loading

souravsingh commented Nov 24, 2016

piskvorky commented Nov 28, 2016 • edited Loading

piskvorky Nov 28, 2016

Choose a reason for hiding this comment

piskvorky Nov 28, 2016

Choose a reason for hiding this comment

piskvorky Nov 28, 2016

Choose a reason for hiding this comment

piskvorky Nov 15, 2016 •

edited

Loading

piskvorky commented Nov 15, 2016 •

edited

Loading

tmylk commented Nov 24, 2016 •

edited

Loading

piskvorky commented Nov 28, 2016 •

edited

Loading