-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
amazon-efs-mount-watchdog may crash loop if restarted shortly after a volume is unmounted #74
Comments
Thanks for the detail feedback, we will have someone working on this. |
OOps, the crucial step I was missing is that in step 3 you cannot just restart watchdog, you must also kill the stunnel process of the mount. Since I found this in Kubernetes using CSI driver I should translate the exact steps in Kubernetes-world that I did:
Plus exact steps on vanilla ec2 instance:
|
Hey @wongma7 , fix is in v1.26.3, thanks for the report. |
This bug only affects the latest release v1.26.2.
Steps to reproduce:
/var/run/efs/stunnel-config.fs-12345678.mountpoint.20238
efs-utils/src/mount_efs/__init__.py
Line 894 in 914889d
efs-utils/src/watchdog/__init__.py
Line 472 in 914889d
/var/run/efs/stunnel-config.fs-12345678.mountpoint.20238
efs-utils/src/watchdog/__init__.py
Line 468 in 914889d
6. amazon-efs-mount-watchdog crash loops
*By default
poll_interval_sec
is 1 second so calls of check_efs_mounts occur every 1 second.**By default
unmount_grace_period_sec
is 30 seconds so the call of check_efs_mounts that would delete the state file occurs at least 30 seconds after the unmount. So the window to trigger this bug is 30 seconds after the unmount.The text was updated successfully, but these errors were encountered: