Skip to content

Shoud fuzzy search be supported? #615

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
suzumiyasmith opened this issue Sep 3, 2017 · 10 comments
Closed

Shoud fuzzy search be supported? #615

suzumiyasmith opened this issue Sep 3, 2017 · 10 comments
Assignees

Comments

@suzumiyasmith
Copy link

Now, it is obvious that fuzzy search is not supported. e.g. search rethink won't get rethinkdb package as a result. Sometimes, it can be annoying. :(

Any idea or plan about fuzzy searching?

Thanks.

@tfausak
Copy link

tfausak commented Sep 7, 2017

I would like to see this too. I can't remember the number of times I've searched for a partial string and been surprised by the lack of results. Some recent examples that I remember:

That being said, I'm not holding my breath for a fix. Hackage's search appears to be based on keywords and word stems. And search issues tend to stay open for ... a while.

Have you tried searching on Stackage? It does a better job for all the queries given here:

@gbaz
Copy link
Contributor

gbaz commented Sep 7, 2017

Hackage is under active development at the moment, and search is definitely a place we want to tackle! thanks for the links to related issues, but I wouldn't be so negative about the possibility for progress. More hands and more hackers are always welcome :-)

@gbaz gbaz self-assigned this Sep 22, 2017
@gbaz
Copy link
Contributor

gbaz commented Sep 30, 2017

So I think the simplest thing to do is ad-hoc improve our stemmer since it seems to work just fine except for "weird" computery stems like dropping "db" or "ql" etc.

@gbaz gbaz mentioned this issue Sep 30, 2017
hvr added a commit that referenced this issue Oct 28, 2017
Ad hoc stemming (should resolve the concrete issues posed in #615)
@adamgundry
Copy link
Member

It might be worth switching Hackage to use the full-text-search package. This was derived from the Hackage search implementation (so it provides a very similar interface and switching should be fairly trivial), but it has a few extra features, notably autocomplete search.

@gbaz
Copy link
Contributor

gbaz commented Feb 22, 2018

Good idea. Any volunteers? :-P

@adamgundry
Copy link
Member

I'm hoping to get a chance to look at the full-text-search integration at some point (but no promises, and no objections to someone else jumping in!).

On the original issue, another case I just happened across (see haskell/binary#92) is that searching for "serialisation" yields binary whereas searching for "serialization" yields cereal.

@gbaz gbaz mentioned this issue Apr 1, 2018
@gbaz
Copy link
Contributor

gbaz commented Apr 1, 2018

@adamgundry do you think we should always change "z" to "s" in all our stemming? Might be the simplest way to prevent the pitfalls of britishised spellings (or britishized if you prefer).

@gbaz
Copy link
Contributor

gbaz commented Apr 1, 2018

I do worry about other internationalisation issues that have a different flavour...

@gbaz
Copy link
Contributor

gbaz commented Apr 30, 2018

Ad-hoc stemming now works. That means postgres and influx and rethink searches work. I think that may suffice to close this issue?

@gbaz
Copy link
Contributor

gbaz commented Apr 30, 2018

I'll make a different issue for full-text-search

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants