-
Notifications
You must be signed in to change notification settings - Fork 7
Tips
Vincent Foulon edited this page Feb 12, 2020
·
1 revision
Assuming you haven't configured the types of your engine, you can optimize your index by using advanced tokenizers.
The default configuration allows you to bootstrap a generic search engine, without language specific configuration. If you want to have more accurate results you can use some LanguageTokenizers (actually, 12 languages are supported).
- LanguageStemmingTokenizer : Stem the words so that plural form or conjugations are cropped. Only the useful part of the word is kept.
- LanguageStopWordsTokenizer : Removes all "stop words" from the tokens. A stop word is a word that add nothing to the search, like pronouns (I, you, me, etc...) and many many others.