Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Full-search. No result on some words or characters #6674

Closed
ScharfViktor opened this issue Jun 29, 2023 · 4 comments · Fixed by #7553
Closed

Full-search. No result on some words or characters #6674

ScharfViktor opened this issue Jun 29, 2023 · 4 comments · Fixed by #7553
Labels

Comments

@ScharfViktor
Copy link
Contributor

Start ocis with tika use https://github.com/owncloud/web/blob/master/docker-compose.yml

tika cuts and stores only the words. all characters, numbers, prepositions, and punctuation marks are cut off.
you can't find something using non-words or words with character and it won't show up in the search results

Example:

  1. content with a number user1 shares the file to Marie

  • Search doesn't get result by user1.
  • if we search for user -> <mark>user</mark> user shares file marie -> no numbers, no article the and preposition to
  1. content with a link https://localhost/remote.php/dav/files/admin/Photos/San%20Francisco.jpg.

  • Upload ownCloud manual.pdf. here can download it: https://cloud.owncloud.com/s/5stDcdTW7K7XJio -
  • search for localhost.
  • result -> <oc:highlights>...xml &#39;http <mark>localhost</mark> remote php dav files admin photos san francisco jpg&#39;return…</oc:highlights>
  1. content with a find can should will .... words

  • no search results for words
@ScharfViktor ScharfViktor changed the title Highlighting in search. Highlighting in search. No result on some words or characters Jun 29, 2023
@ScharfViktor
Copy link
Contributor Author

when we use keywords like where, this, what to search the file via full-search then it does not search

@ScharfViktor ScharfViktor changed the title Highlighting in search. No result on some words or characters Full-search. No result on some words or characters Jul 3, 2023
@saw-jan
Copy link
Member

saw-jan commented Sep 7, 2023

TODO (when fixed):

  • add test coverage

@fschade
Copy link
Contributor

fschade commented Oct 20, 2023

confirmed, everything expect 3. is a bug, 3 is intended since those words are part of the stop word list.
Anyway, it makes sense to have a option if stop word cleaning should take place or not, i take care.

@ScharfViktor
Copy link
Contributor Author

TODO (when fixed):

  • add test coverage

#7574

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

3 participants