Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: Replication install using Operator does not work #788

Closed
donatwork opened this issue Apr 28, 2023 · 6 comments
Closed

[BUG]: Replication install using Operator does not work #788

donatwork opened this issue Apr 28, 2023 · 6 comments
Assignees
Labels
area/csm-docs Issue pertains to the CSM Documentation area/csm-operator type/bug Something isn't working. This is the default label associated with a bug issue.
Milestone

Comments

@donatwork
Copy link
Contributor

Bug Description

It is not possible to install CSM Replication in the single stretched cluster mode. The parameter TARGET_CLUSTER_IDS requires at least one value but in the repctl instructions the target cluster is replaced with "self". This does not work when using Operator.

    - name: dell-replication-controller-manager
       # image: Defines controller image. This shouldn't be changed
       # Allowed values: string
       image: dellemc/dell-replication-controller:v1.4.0
       envs:
         # TARGET_CLUSTERS_IDS: comma separated list of clusters id of the targets clusters. DO NOT include the source(wherever CSM Operator is deployed) cluster id
         # Allowed values: string
         - name: "TARGET_CLUSTERS_IDS"
           value: "self"

Also tried with an empty value "" and also saw errors.

Another issue I have with the documentation is that the instructiosn (https://dell.github.io/csm-docs/docs/deployment/csmoperator/modules/replication/#prerequisite) states to follow all of the steps in the install using repctl section but those instructions install an instance of the dell-replication-controller-manager. Why is the Operator installation also installing the same pod?

Logs

Event(v1.ObjectReference{Kind:"ContainerStorageModule", Namespace:"isilon", Name:"isilon", UID:"44538889-bfe8-4dae-aae1-63959c9d73de", APIVersion:"storage.dell.com/v1", ResourceVersion:"892885", FieldPath:""}): type: 'Warning' reason: 'Updated' Failed Prechecks: failed replication validation: TARGET_CLUSTERS_IDS on CR should have more than 0 commma seperated cluster IDs. Got 0

Screenshots

No response

Additional Environment Information

No response

Steps to Reproduce

Attempt to follow the documentation to install CSM-Replication on a single "stretched" cluster.

Expected Behavior

I expect either "self" or "" to be accepted as valid when installing in stretched clsuter mode.

CSM Driver(s)

CSI Driver for PowerScale v2.6.0

Installation Type

CSM-Operater 1.1.0

Container Storage Modules Enabled

No response

Container Orchestrator

Kubernetes v1.25.6

Operating System

RHEL 8.7-2

@donatwork donatwork added needs-triage Issue requires triage. type/bug Something isn't working. This is the default label associated with a bug issue. area/csm-operator labels Apr 28, 2023
@csmbot
Copy link
Collaborator

csmbot commented Apr 28, 2023

@donatwork: Thank you for submitting this issue!

The issue is currently awaiting triage. Please make sure you have given us as much context as possible.

If the maintainers determine this is a relevant issue, they will remove the needs-triage label and assign an appropriate priority label.


We want your feedback! If you have any questions or suggestions regarding our contributing process/workflow, please reach out to us at container.storage.modules@dell.com.

@donatwork
Copy link
Contributor Author

There is a value, skip-replication-cluster-check which is in the code which gets past the issue int he precheck and all pods end up getting installed. Still need to verify if the install will actually work.

@donatwork
Copy link
Contributor Author

Another thing. The replication controller is being attempted at install although the Operator install instructions points to the repctl instructions which also installs the replication controller. This may be causing errors int he Operator installation:

Event(v1.ObjectReference{Kind:"ContainerStorageModule", Namespace:"isilon", Name:"isilon", UID:"7cbff48b-c634-4540-ada9-915f5acabfe8", APIVersion:"storage.dell.com/v1", ResourceVersion:"2414290", FieldPath:""}): type: 'Warning' reason: 'Updated' Failed install: failed to deploy replication controller: clusterroles.rbac.authorization.k8s.io "dell-replication-manager-role" is forbidden: user "system:serviceaccount:dell-csm-operator:dell-csm-operator-manager-service-account" (groups=["system:serviceaccounts" "system:serviceaccounts:dell-csm-operator" "system:authenticated"]) is attempting to grant RBAC permissions not currently held:
{APIGroups:[""], Resources:["namespaces"], Verbs:["create"]}

Suggest that Operator not attempt to install the replication controller?

@harshaatdell harshaatdell added the area/csm-docs Issue pertains to the CSM Documentation label May 2, 2023
@donatwork donatwork self-assigned this May 12, 2023
@donatwork donatwork changed the title [BUG]: Not possible to install CSM-Replication via Operator in stretched cluster mode [BUG]: Replication install instructions using Operator is incorrect May 12, 2023
@donatwork
Copy link
Contributor Author

donatwork commented May 12, 2023

Updated headline to reflect that there are problems with both the documentation and Operator when installing Replication in the current release.

@donatwork donatwork changed the title [BUG]: Replication install instructions using Operator is incorrect [BUG]: Replication install using Operator does not work May 12, 2023
@donatwork
Copy link
Contributor Author

Documentation for Operator installation has been updated in dell/csm-docs#612.

@shaynafinocchiaro shaynafinocchiaro added this to the v1.7.0 milestone May 15, 2023
@harshaatdell harshaatdell added backlog and removed needs-triage Issue requires triage. labels May 15, 2023
@rensyct
Copy link

rensyct commented May 16, 2023

Issues are fixed and are available at main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/csm-docs Issue pertains to the CSM Documentation area/csm-operator type/bug Something isn't working. This is the default label associated with a bug issue.
Projects
None yet
Development

No branches or pull requests

5 participants