Use force unmount and explicitly unmount bad mount points #183

dabradley · 2024-10-16T12:04:52Z

There have been cases where the logic to cleanup a mount point has caused the driver to get into a bad state. This is most obvious when a subdirectory is mounted to a volume and a parent directory of that subdirectory is deleted. The Lustre driver doesn't handle that case in the way that Kubernetes expects and returns invalid data. To avoid this scenario causing our driver to get into a bad state, leak mount points, etc, we must explicitly check that we can read the necessary information about the mount point, and if not, explicitly unmount that mount point before allowing Kubernetes to clean up the directory. To ensure that we don't end up in a bad state, this change enables force unmounting as well. The force unmount will only occur after a timeout has expired, since force unmounts can cause issues with the Lustre driver. However, in this case, it is better if we are in a bad enough situation to be able to eventually return to a good state rather than require manual intervention.

What type of PR is this?

/kind bug

coveralls · 2024-10-16T12:11:55Z

coverage: 81.844% (-0.9%) from 82.773%
when pulling 028a78b on dabradley:personal/dabradley/cleanupbadmounts
into 4613c77 on kubernetes-sigs:development.

t-mialve

/lgtm

k8s-ci-robot · 2024-11-25T19:48:56Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dabradley, t-mialve

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [dabradley,t-mialve]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

There have been cases where the logic to cleanup a mount point has caused the driver to get into a bad state. This is most obvious when a subdirectory is mounted to a volume and a parent directory of that subdirectory is deleted. The Lustre driver doesn't handle that case in the way that Kubernetes expects and returns invalid data. To avoid this scenario causing our driver to get into a bad state, leak mount points, etc, we must explicitly check that we can read the necessary information about the mount point, and if not, explicitly unmount that mount point before allowing Kubernetes to clean up the directory. To ensure that we don't end up in a bad state, this change enables force unmounting as well. The force unmount will only occur after a timeout has expired, since force unmounts can cause issues with the Lustre driver. However, in this case, it is better if we are in a bad enough situation to be able to eventually return to a good state rather than require manual intervention.

k8s-ci-robot · 2024-11-25T20:28:10Z

New changes are detected. LGTM label has been removed.

dabradley requested a review from t-mialve October 16, 2024 12:04

k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. kind/bug Categorizes issue or PR as related to a bug. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Oct 16, 2024

k8s-ci-robot requested review from andyzhangx and vinli-cn October 16, 2024 12:05

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Oct 16, 2024

dabradley removed request for andyzhangx and vinli-cn October 16, 2024 12:08

t-mialve approved these changes Nov 25, 2024

View reviewed changes

k8s-ci-robot assigned t-mialve Nov 25, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 25, 2024

dabradley force-pushed the personal/dabradley/cleanupbadmounts branch from 186ee0a to 028a78b Compare November 25, 2024 20:28

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 25, 2024

dabradley marked this pull request as ready for review November 25, 2024 20:28

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 25, 2024

k8s-ci-robot requested review from t-mialve and vinli-cn November 25, 2024 20:28

dabradley merged commit abc5396 into kubernetes-sigs:development Nov 25, 2024
9 of 10 checks passed

dabradley deleted the personal/dabradley/cleanupbadmounts branch November 25, 2024 21:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use force unmount and explicitly unmount bad mount points #183

Use force unmount and explicitly unmount bad mount points #183

dabradley commented Oct 16, 2024

coveralls commented Oct 16, 2024 •

edited

Loading

t-mialve left a comment

k8s-ci-robot commented Nov 25, 2024

k8s-ci-robot commented Nov 25, 2024

Use force unmount and explicitly unmount bad mount points #183

Use force unmount and explicitly unmount bad mount points #183

Conversation

dabradley commented Oct 16, 2024

coveralls commented Oct 16, 2024 • edited Loading

t-mialve left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Nov 25, 2024

k8s-ci-robot commented Nov 25, 2024

coveralls commented Oct 16, 2024 •

edited

Loading