-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Linux 4.7: fix deadlock during lookup on case-insensitive #5141
Conversation
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Signed-off-by: Chunwei Chen <david.chen@osnexus.com>
I was going to suggest this would be a good opportunity to enable all 14 casenorm tests in the ZFS Test Suite. However, based on the test suite results it might take a little work. |
I think it means that there's no such thing as $ZLOOK. |
That would make sense. It's definitely referring to zlook.c which was never ported to Linux, Nor was its kernel space counterpart zut.c. We could port this probably fairly easily, it's not that much code. But that might be overkill, it's only used for these casenorm tests and there may be a simpler way to achieve the same thing. |
@behlendorf |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, but I'd like to hear from @cgaspar that this in fact resolves the issue before merging it.
Is there a version of this patch that works for the 0.6.5.x branch? |
@satmandu |
@tuxoko That doesn't seem to work. It compiles fine. But I'm getting the same hang as before trying to mount the pool. |
@satmandu Either way, please see if there's anything in dmesg after a couple minutes of hang. Or do sysrq-l, sysrq-t. |
@tuxoko I stand corrected. This definitely works. Maybe it was a signed version issue. Installing a mainline kernel-ppa package with the patched file used for an install appears to allow proper booting and mounting. It would be great if this gets added to a stable point release so it can get pulled into ubuntu. |
@satmandu thanks for testing the fix. It's been merged to master and I added this PR to the 0.6.5.9 milestone so we remember to apply it to the next point release. |
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
Any chance 0.6.5.9 might get tagged before 0.7.0? (Ubuntu seems to following 0.6.5.x at the moment, and I'd hate for this to not get into the next stable release.) |
@satmandu thanks for the reminder, this fix was already slatted for 0.6.5.9. See https://github.com/zfsonlinux/zfs/projects/4. |
I saw that. 👍 |
Question: I see that the fix adds a strcmp operation which ought to be unnecessary in the majority of cases presuming case-insensitive datasets are relatively rare. Isn't there a cheap test of the case sensitivity property one could do before blindly invoking strcmp? |
@RJVB doesn't that |
Sorry, my bad, I should have read the full function instead of only the patch. There's probably still room for improvement by checking a (precomputed) cheap hash first, but I can live with slower performance in the rare cases I need a case-insensitive FS :) |
Sigh. I'd also be happy with finding a way to convert my filesystem from case-insensitive to case-sensitive without copying everything to another filesystem and back. |
Sadly you can only go from sensitive to insensitive with a guarantee of not losing anything due to aliasing. I don't know to what extent zfs refusal to do this conversion is only due to a decision that it shouldn't be supported or on the contrary, because it is not just a matter of how filenames are handled. The nice thing with ZFS is that you don't need to copy things back and forth. Just create a new, case-sensitive dataset (as a child of the insensitive one?), mount it on a temporary location, move your data to that location, destroy the old dataset and remount the new one on the intended location. No need to fool around with partitions or external devices. |
Thanks! That's on my list of things to try now when I have a moment. :) |
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes openzfs#5124 Closes openzfs#5141 Closes openzfs#5147 Closes openzfs#5148 Requires-builders: style
We must not use d_add_ci if the dentry already has the real name. Otherwise, d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait on itself causing deadlock. Tested-by: satmandu Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Chunwei Chen <david.chen@osnexus.com> Closes #5124 Closes #5141 Closes #5147 Closes #5148
We must not use d_add_ci if the dentry already has the real name. Otherwise,
d_add_ci()->d_alloc_parallel() will find itself on the lookup hash and wait
on itself causing deadlock.
Signed-off-by: Chunwei Chen david.chen@osnexus.com