Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CERN: File search #2970

Open
labkode opened this issue Jun 14, 2022 · 8 comments
Open

CERN: File search #2970

labkode opened this issue Jun 14, 2022 · 8 comments

Comments

@labkode
Copy link
Member

labkode commented Jun 14, 2022

Allow our users to search for filenames
Provide a list of "recently viewed" files in the main UI

@labkode
Copy link
Member Author

labkode commented Jun 15, 2022

It should be there.

@dragotin @micbar how this search feature has been stress tested? How many files?
Does the UI contains the "Recently viewed" files?

@micbar
Copy link
Member

micbar commented Jun 15, 2022

We implemented a bleve search index in ocis outside of reva which answers REPORT requests.
It was not yet stress tested. We will test this before GA, that is sure.

I doubt that CernBOX can be treated as any other ownCloud out there ;-) So I expect a different CERN specific implementation. We implemented search using the same pattern using a SearchProvider and a SearchDriver. The Driver can be different and not use bleve.

@dragotin
Copy link
Contributor

Hm, I would not give up on the bleve solution too fast. At least for personal spaces, that can be a proper solution. Others depend on the size of the indexed spaces.

@labkode
Copy link
Member Author

labkode commented Jun 15, 2022

@micbar can you point me to the interface that needs to be implemented?

@micbar
Copy link
Member

micbar commented Jul 1, 2022

Current State

  • We store the spaces in one bleve index and have one service running.

What would we need to do to scale that?

Understand the limitations of bleve

  • How big can one index grow?
  • Does it help to open / close indexes on demand?
  • Do we gain performance if we shard spaces across multiple indexes?
  • Can we shard the whole load across multiple search services?

@labkode
Copy link
Member Author

labkode commented Jul 1, 2022

Work in progress, Bleve stress tests to be done.

@labkode
Copy link
Member Author

labkode commented Jul 15, 2022

We'll implement on master using the existing search API (Report method).
The big chunk of work is in the driver, so by using the same interface as in edge, the migration should not give any major headaches.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants