Fix sequential resilver drive failure race condition #14063
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation and Context
Backport of #14050 for the 2.1.7 staging branch.
Description
This patch handles the race condition on simultaneous failure of 2 drives, which misses the vdev_rebuild_reset_wanted signal in vdev_rebuild_thread. We retry to catch this inside the vdev_rebuild_complete_sync function.
Reviewed-by: Brian Behlendorf behlendorf1@llnl.gov
Reviewed-by: Richard Yao richard.yao@alumni.stonybrook.edu
Reviewed-by: Dipak Ghosh dipak.ghosh@hpe.com
Reviewed-by: Akash B akash-b@hpe.com
Signed-off-by: Samuel Wycliffe J samwyc@hpe.com
How Has This Been Tested?
Manually tested with the test case described in the original issue.
Types of changes
Checklist:
Signed-off-by
.