-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kernel NULL pointer dereference #13430
Comments
What are the non-default settings on the datasets on the pool? Have you been using send/recv at all? Did it work previously any only break like this after a recent upgrade, or is it a recent setup and you have no prior data one way or another? |
No send/recv at all System have been unstable for a while, have been trying to switch kernels and versions without success. System hangs and complains about a core that was locked. Worked without an issue at all a month ago, not sure what changed. I hoped this stack trace would help in any way.. |
I see now that the pool is degraded, have not been seeing issues until now. A ssd with cache and log have failed. Pretty sure this have happened after the crashes, but probably related. Will try to replace the drive and see if anything changes.. |
l2arc and slog (cache and log) devices being marked failed shouldn't be actively harmful, though if they were misbehaving while not marked failed who knows. Specifically, Wondering if 5.17.5 or something shipped something exciting. May go try a build of it on a testbed. |
Huh, 5.17.5 is running fine with my single NVMe vdev under a LUKS device using the default Arch 5.17.5 kernel .config. We'll see in a few days if it lets the magic smoke out. Did you use any special settings for LUKS? |
I have been experiencing the same issue on a Dell PowerEdge R610 server / Gentoo / Linux 5.17.5 / ZFS 2.1.4. I don't use LUKS. It started happening after upgrading the kernel to 5.17.5 and glibc to glibc-2.35-r4 r610 /home/dell # zfs get all | grep -v default |
...and this is the the change between -r3 and -r4 of glibc in Gentoo:
|
@mapmot, can you please share one or more of the BUG: messages and stacktraces from your logs when this happens? |
@rincebrain, here is the log. What triggered it was a [735471.464489] BUG: kernel NULL pointer dereference, address: 000000000000000b |
@rincebrain, another one: May 02 10:57:39 r610.maze kernel: BUG: kernel NULL pointer dereference, address: 000000000000000b |
@rincebrain, the last one, this is while trying to gracefully reboot ( May 02 12:02:42 r610.maze kernel: BUG: kernel NULL pointer dereference, address: 000000000000000b |
What's the storage backing this pool in what configuration? I have some recent suspicions about one or two places that might be doing things incorrectly that this might align with. |
6 x 1 TB SAS drives in raidz1 on Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 in IT/HBA mode. |
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions. |
System information
Describe the problem you're observing
Server crashes during almost idle operation
Describe how to reproduce the problem
Wait a few days.
-->
The text was updated successfully, but these errors were encountered: