-
Notifications
You must be signed in to change notification settings - Fork 277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for multi node clusters and logging #3448
Comments
@rishabh6788 could you please look into this. |
Tagging @gaiksaya and @peterzhuamazon as well. |
HI team, can we have some updates on this issue. |
Hi @monusingh-1, We recently enhanced our testing logs. See #3381 for more information. You can now access all logs for given distribution. Also the integration testing code is located at https://github.com/opensearch-project/opensearch-build/blob/main/src/test_workflow/integ_test/distribution_rpm.py Can you look into that if you have the bandwidth and see if the issue in our test workflow or in the distribution itself? You can also try to install manually and see if the cluster for those distribution comes up as expected: https://opensearch.org/docs/latest/install-and-configure/index/ |
Hi @gaiksaya, #3381 does not take into account if there are multiple clusters running according to topology specified. |
For yum and deb we need to run 2 systemctl service for opensearch, one on 9200 and other on 9300 port to be able to run on the same ec2 host. |
@rishabh6788, currently, the test workflow is able to create multiple clusters when specified in the manifest, pain point is unavailability of individual cluster logs. |
I think there is a logging issue that is applied on all distribution types including tarball. The reason is there are two individual clusters and each of them will create a cluster logs ( |
Hi zelinh, is this issue on the roadmap ? |
eg:
|
@gaiksaya could we have some traction on this. |
@monusingh-1 @ankitkala This is not on our priority list. We will try to look in to it when we have bandwidth. Having said that, it would be great if you can contribute a fix for this issue to get it in early. |
@peterzhuamazon @zelinh @gaiksaya Can we test and close this issue since the PR is merged? |
Thanks for the contribution @monusingh-1 the change works as expected. Tested it with 2.11.0 artifact here: https://build.ci.opensearch.org/blue/organizations/jenkins/integ-test/detail/integ-test/6457/pipeline Closing this issue! |
Is your feature request related to a problem? Please describe
During integration tests of cross cluster replication, clusters with following Topology are created
Leader - Data node -2
Follower- Data node -2
The data nodes join to form a cluster in these distribution builds
2.7.0/opensearch-2.7.0-test.yml, 2.7.0, arm64, 7759, tar,
2.7.0/opensearch-2.7.0-test.yml, 2.7.0, x64, 7759, tar
https://build.ci.opensearch.org/blue/organizations/jenkins/integ-test/detail/integ-test/4653/pipeline
The nodes fail to join to form common cluster in these distributions
2.7.0/opensearch-2.7.0-test.yml, 2.7.0, x64, 7764, deb
2.7.0/opensearch-2.7.0-test.yml, 2.7.0, arm64, 7764, rpm
https://build.ci.opensearch.org/blue/organizations/jenkins/integ-test/detail/integ-test/4658/pipeline
https://build.ci.opensearch.org/job/integ-test/4660/execution/node/858/log/?consoleFull
Search for
Node 1 at
2023-04-26 08:51:30
Node 2 at
2023-04-26 08:50:57
We can see
After investigation we were able to check that when the data nodes fail to join, the clusters are not reachable and the integ tests starts throwing
Due to insufficient cluster level logging we are not able to investigate why the nodes fail to join in some distribution.
Describe the solution you'd like
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: