Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: Multi Controller defect - sidecars timeout #1110

Closed
adamginna-dell opened this issue Jan 23, 2024 · 2 comments
Closed

[BUG]: Multi Controller defect - sidecars timeout #1110

adamginna-dell opened this issue Jan 23, 2024 · 2 comments
Labels
area/csi-powerflex Issue pertains to the CSI Driver for Dell EMC PowerFlex area/csi-powermax Issue pertains to the CSI Driver for Dell EMC PowerMax area/csi-powerscale Issue pertains to the CSI Driver for Dell EMC PowerScale area/csi-powerstore Issue pertains to the CSI Driver for Dell EMC PowerStore area/csi-unity Issue pertains to the CSI Driver for Dell EMC Unity area/csm-operator type/bug Something isn't working. This is the default label associated with a bug issue.
Milestone

Comments

@adamginna-dell
Copy link

adamginna-dell commented Jan 23, 2024

Bug Description

The first controller pods start without any problems. Second and next controller pods are failing.
Link to Change in a csi-lib-utils library that add timeout for connection with socket. New versions of sidecars use new version of csi-lib-utils and timeout cause controller pods fail.

Logs

level=info msg="Attempting driver connection at: unix:/var/run/csi/csi.sock"
level=debug msg="grpc.Dial returned context deadline exceeded"
level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
level=debug msg="grpc.Dial returned context deadline exceeded"
level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"

Screenshots

No response

Additional Environment Information

No response

Steps to Reproduce

Setup controller replicas in to number more then 1.

Expected Behavior

All controller pods are in Running state.

CSM Driver(s)

CSI Driver - Unity 2.9.0, Powerscale 2.9.0, Powerflex 2.9.0, Powermax 2.9.0

Installation Type

No response

Container Storage Modules Enabled

No response

Container Orchestrator

All

Operating System

All

@adamginna-dell adamginna-dell added needs-triage Issue requires triage. type/bug Something isn't working. This is the default label associated with a bug issue. labels Jan 23, 2024
@adamginna-dell
Copy link
Author

Standby controller pod is in crashloopbackoff state. As a workaround scale down the replica count of the controller pod’s deployment to 1 using
kubectl scale deployment <deployment_name> --replicas=1 -n <driver_namespace>

@gallacher gallacher added this to the v1.9.1 milestone Jan 24, 2024
@shaynafinocchiaro shaynafinocchiaro added area/csm-operator area/csi-powerflex Issue pertains to the CSI Driver for Dell EMC PowerFlex area/csi-powerstore Issue pertains to the CSI Driver for Dell EMC PowerStore area/csi-powerscale Issue pertains to the CSI Driver for Dell EMC PowerScale area/csi-unity Issue pertains to the CSI Driver for Dell EMC Unity area/csi-powermax Issue pertains to the CSI Driver for Dell EMC PowerMax and removed needs-triage Issue requires triage. labels Feb 1, 2024
@gallacher
Copy link
Contributor

link: 20348

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/csi-powerflex Issue pertains to the CSI Driver for Dell EMC PowerFlex area/csi-powermax Issue pertains to the CSI Driver for Dell EMC PowerMax area/csi-powerscale Issue pertains to the CSI Driver for Dell EMC PowerScale area/csi-powerstore Issue pertains to the CSI Driver for Dell EMC PowerStore area/csi-unity Issue pertains to the CSI Driver for Dell EMC Unity area/csm-operator type/bug Something isn't working. This is the default label associated with a bug issue.
Projects
None yet
Development

No branches or pull requests

3 participants