Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word Issues #1

Open
timothymarois opened this issue Jan 20, 2013 · 0 comments
Open

Word Issues #1

timothymarois opened this issue Jan 20, 2013 · 0 comments

Comments

@timothymarois
Copy link

There are several word issues, and the PorterStemmer never was able to resolve.
(from what I've noticed)

Words Ending with: (these are just some ive noticed, I wish we had a solution)

  • 'er' (this is a tricky one because 'er' can be removed or kept the e)
    • counter => counter (should equal 'count')
    • crusher => crusher (should be crush)
  • 'le'
    • puzzle => puzzl (should equal puzzle)
    • rumble => rumbl (should equal rumble)
  • 'y'
    (rocky => rocki (should equal rock)
    (communities => communiti) -this one is a well-known issue
    (plays => plai) Im surprised at this one

for some, you can run a spell check program like 'pspell' php ext, and compare if its correct spelling of a 'real' word.

I wonder if for some, you could find a pattern for why there is an e and some arent. Though these algorithms are old and possibly all thats able to do without a 'speech' program.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant