Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Solr: load balancing, fault tolerance, and high availability #2322

Closed
pdurbin opened this issue Jul 7, 2015 · 3 comments
Closed

Solr: load balancing, fault tolerance, and high availability #2322

pdurbin opened this issue Jul 7, 2015 · 3 comments
Labels

Comments

@pdurbin
Copy link
Member

pdurbin commented Jul 7, 2015

As of Dataverse 4.0 it is assumed that Solr is only running on a single host. In #1083 we added a configuration option so that Solr doesn't have to run on localhost but there are several problems with our current approach:

Ideally, this ticket would encompass the following deliverables:

  • Fault tolerance: no more single point of failure, greater availability
  • Load balancing: distribution of indexing and search load across multiple servers

From a quick look it seems like SolrCloud is what we want: https://cwiki.apache.org/confluence/display/solr/SolrCloud . It makes use of ZooKeeper, which we previously wrote briefly about at #1083 (comment) .

@pdurbin pdurbin added the Type: Suggestion an idea label Jul 7, 2015
@pdurbin
Copy link
Member Author

pdurbin commented Aug 13, 2015

I just realized that http://lucene.apache.org/solr/quickstart.html is totally new for Solr 5 vs. Solr 4.

To launch Solr, run: bin/solr start -e cloud -noprompt

So maybe it will be easier to us SolrCloud as part of an upgrade from Solr 4.6.0 (#456). It seem like they don't guide you to use java -jar start.jar anymore.

@mercecrosas mercecrosas modified the milestone: In Review Nov 30, 2015
@mheppler mheppler added Feature: Search/Browse Component: Code Infrastructure formerly "Feature: Code Infrastructure" labels Jan 28, 2016
@scolapasta scolapasta removed this from the Not Assigned to a Release milestone Jan 28, 2016
@pdurbin
Copy link
Member Author

pdurbin commented Feb 2, 2016

@bionary just wrote to the mailing list with a nice writeup having do with SolrCloud, and I just copied and pasted his list of issues to think about into a Google Doc called Integrating Dataverse 4.x with SolrCloud.

@pdurbin
Copy link
Member Author

pdurbin commented Jun 28, 2017

As I wrote at #2985 (comment) I still feel bad for encouraging @bionary to create pull request #2985 and then being unable to follow up on it and shepard it through completion.

No one else seems to have expressed much interest in load balancing, fault tolerance, and high availability for Solr as it's used in Dataverse, so I'm closing this issue. Please get in touch if you're interested after all.

@pdurbin pdurbin closed this as completed Jun 28, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants