-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update stopwords_ukr.js #329
Conversation
79736a3
to
2862555
Compare
Nice, thanks @imposerene! |
Sorry, pushed some more changes. I think that's all for now. But yeah, I saw that tests failed before I pushed again... Will look at what fails and try to fix (maybe I just missed coma somewhere) |
Sorry. Used wrong email in my commit (was using work one). Amended to not relate my current company to the commit. Sorry, for slopy commits... |
No problem 😊 |
Manually selected top frequency pronounces, conjunctions, adpositions, grammatical particles and adverbs from Ukrainian corpus and added them to stopwords_ukr.js . Also moved some stopwords from the end of the to their alphabetical position: now whole list is more or less in alphabetical order (first letter for sure, second letter I think is also fine, but I did not check all existing words and did not use automatic sorting to minimize the changes)
Any suggestions on how to quickly run tests locally? I'm not a JS developer, but if there is an easy way to run under linux or in docker, then I can manage to check the tests locally. I already pushed potential fix (test will now expect "на" adposition to be removed), but would be greate if I could actually run the test locally |
All tests are ok after the small fix I introduced earlier |
Yes, I saw that. Just one test with a little hickup on cleaning up. So I'll merge and publish! |
New version out on npm: https://www.npmjs.com/package/stopword/v/3.1.2 Thanks for the PR, @imposeren ! |
Manually selected top frequency pronounces, conjunctions, adpositions, grammatical particles and adverbs from Ukrainian corpus and added them to stopwords_ukr.js . Also moved some stopwords from the end of the list to their alphabetical position: now whole list is more or less in alphabetical order (first letter for sure, second letter I think is also fine, but I did not check all existing words and did not use automatic sorting to minimize the changes)