Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Addition of tests for 3AZ Cluster Scenarios - Netsplit b/w DCs #4156

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

pdhiran
Copy link
Contributor

@pdhiran pdhiran commented Oct 9, 2024

Description

Adding test coverage for 3 AZ cluster with stretch pools.

New Test module added :

  1. tests/rados/test_stretch_n-az_netsplit_scenarios.py -> Test module to induce Netsplit scenarios in a 3 AZ cluster, between two DCs.

Modified the 3AZ Conf file to include hosts from different subnets.

New methods added in ceph_core:

  1. create_3_az_stretch_pool : module to create a regular pool, create a custom crush rule and enable stretch mode on the pool created.
  2. get_multi_az_stretch_site_hosts : module to fetch the hosts present in various CRUSH hierarchy, based on crush bucket and number of such buckets.
  3. enable_nsite_stretch_pool: Module to enable stretch mode on the pool.
  4. add_custom_crush_rules: module to add custom CRUSH rules on the cluster by modifying the crush.bin file on cluster.

@pdhiran pdhiran added RADOS Rados Core 8.0 Squid Automation labels Oct 9, 2024
@pdhiran pdhiran self-assigned this Oct 9, 2024
Copy link
Contributor

openshift-ci bot commented Oct 9, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: pdhiran

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@pdhiran
Copy link
Contributor Author

pdhiran commented Oct 9, 2024

run log : http://magna002.ceph.redhat.com/cephci-jenkins/cephci-run-J8QY71/

Test failed due to bug : https://bugzilla.redhat.com/show_bug.cgi?id=2316900 .

Creating PR so that all supporting methods for 3AZ are merged for other 3 AZ scenario automations.

Signed-off-by: Pawan Dhiran <pdhiran@redhat.com>
min_size=min_size,
):
log.error(f"Unable to enable stretch mode on the pool : {pool_name}")
return False
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Return true if pool was successfully enabled with stretch.

@@ -5,6 +5,8 @@ globals:
- ceph-cluster:
name: ceph
node1:
networks:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

change the networks to use networks from RHOS-1.

Current networks are from rhos-d

log.error(
"Cluster PGs not in active + clean state before starting the tests"
)
# raise Exception("Post execution checks failed on the Stretch cluster")
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uncomment this exception.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
8.0 Squid Automation RADOS Rados Core
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant