Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix stopwords in areatitles #477

Merged
merged 15 commits into from
Sep 7, 2020
Merged

fix stopwords in areatitles #477

merged 15 commits into from
Sep 7, 2020

Conversation

chreman
Copy link
Member

@chreman chreman commented Aug 31, 2020

This PR fixes the root cause of two edge cases, which sometimes intersect: documents with single (or duplicate) documents that have no keywords. In theses cases, two bugs could occur individually or simultaneously: appearance of stopwords in bubble titles that should be excluded; wrongly increased number of bubbles.

The bubble title replacement was not working at the correct level: instead of replacing single bubble titles for one (or multiple) documents, multiple area titles were created according to the count of documents with missing subjects.

This issue has now been fixed, resulting in correct number of bubbles and preventing appearance of stopwords in bubble titles.

@chreman chreman merged commit 88ede04 into master Sep 7, 2020
@chreman chreman deleted the fix-titlestopwords branch September 7, 2020 11:20
chreman added a commit to chreman/Headstart that referenced this pull request Oct 13, 2021
…itlestopwords

fix stopwords in areatitles

Former-commit-id: 88ede04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant