-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
page fault after arc_write_done / dbuf_write_done / abd_free_linear_page #15745
Comments
Please try reproducing this with 2.2.2, there were a lot of fixes between 2.2.0 and 2.2.2. |
This is version 2.2.2: |
#14980 suggests that KSM being enabled may be causing you issues, I was kind of wondering about #15140 but that stacktrace and failure mode is different. My suspicion based on it saying the ZFS thread is dying in the middle of a non-preemptible section is that either ZFS or something else is fucking with SIMD without proper locking around it and boom goes the dynamite. But that's a guess. Thinking harder, it seems possible that what's happening is it's calling kfree when it should be calling vfree, for some reason, but I'm not really sure how that could be the case, because it seems like My guess would be some weird edge case with virtio page handling if it's not a SIMD problem. |
It's most probably not a software related bug: I did a memory test with memtest86 and it failed reproducible. After series of experiments with different BIOS settings, removing or changing the position of DDR5 RAM modules the problem seems now to be gone. All subsequent memory tests are now being passed flawlessly. In fact now the BIOS settings are exact the same as before, but eventually some RAM modules might now be seated in another slot than before. My educated guess is that the problem most probably was caused due to a bad DDR5 training which is only executed once and after a change in BIOS settings or hardware. |
System information
Describe the problem you're observing
ASUS ROG STRIX X670E-E GAMING WIFI
AMD Ryzen 9 7950X3D 16-Core Processor
128GB DDR5 RAM
The system is equipped with these physical storage mediums and zfs pools:
The problem occurs only when accessing the raidz2 pool 'storage' via virtiofsd (see https://gitlab.com/virtio-fs/virtiofsd) from a virtual machine. It does not seem to occur when accessing the other mirror pool or when accessing any of the pools locally on the host. The crash occurs on the host when running bonnie++ within the virtual machine, almost everytime when the "rewriting" stage is reached.
Describe how to reproduce the problem
Include any warning/errors/backtraces from the system logs
zpool list
NAME SIZE ALLOC FREE CKPOINT EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT
rpool 3.62T 26.2G 3.60T - - 0% 0% 1.00x ONLINE -
storage 98.2T 62.6G 98.2T - - 0% 0% 1.00x ONLINE -
zfs list
NAME USED AVAIL REFER MOUNTPOINT
rpool 26.2G 3.49T 25K /rpool
rpool/ROOT 1.78G 3.49T 24K /rpool/ROOT
rpool/ROOT/pve-1 1.78G 3.49T 1.78G /
rpool/data 15.9G 3.49T 31K /rpool/data
rpool/data/vm-100-disk-0 42K 3.49T 42K -
rpool/data/vm-100-disk-1 14.0G 3.49T 14.0G -
rpool/data/vm-100-disk-2 18.5K 3.49T 18.5K -
rpool/data/vm-101-disk-0 1.84G 3.49T 1.84G -
rpool/var-lib-vz 8.47G 3.49T 8.47G /var/lib/vz
storage 41.6G 65.2T 208K /storage
storage/filespace 41.6G 65.2T 41.6G /storage/filespace
[ 905.590301] BUG: unable to handle page fault for address: ffff998480042ac8
[ 905.590335] #PF: supervisor read access in kernel mode
[ 905.590353] #PF: error_code(0x0000) - not-present page
[ 905.590368] PGD 3df001067 P4D 3df001067 PUD 0
[ 905.590384] Oops: 0000 [#1] PREEMPT SMP NOPTI
[ 905.590943] CPU: 15 PID: 6812 Comm: z_wr_int_0 Tainted: P O 6.5.11-4-pve #1
[ 905.591289] Hardware name: ASUS System Product Name/ROG STRIX X670E-E GAMING WIFI, BIOS 1709 09/28/2023
[ 905.591564] RIP: 0010:__kmem_cache_free+0xb7/0x350
[ 905.591832] Code: ff 03 0f 86 f1 00 00 00 48 89 da 49 83 e7 fc 48 29 c2 48 81 fa ff ff 1f 00 77 09 48 85 c0 0f 85 80 02 00 00 4c 89 e8 48 89 de <45> 0f b6 74 24 25 48 29 c8 41 0f b6 4c 24 24 48 c1 f8 06 48 c1 e0
[ 905.592111] RSP: 0018:ffffaa444fe4fc10 EFLAGS: 00010292
[ 905.592384] RAX: fffffb8f84530f00 RBX: ffff99ba94c3da00 RCX: fffffb8f80000000
[ 905.592654] RDX: ffff99ba94c3da00 RSI: ffff99ba94c3da00 RDI: ffff998480042aa3
[ 905.592922] RBP: ffffaa444fe4fc58 R08: 0000000000000000 R09: 0000000000000000
[ 905.593188] R10: 0000000000000000 R11: 0000000000000000 R12: ffff998480042aa3
[ 905.593454] R13: fffffb8f84530f00 R14: ffff99ba83d8e000 R15: 000000cb000000dc
[ 905.593723] FS: 0000000000000000(0000) GS:ffff99d95e3c0000(0000) knlGS:0000000000000000
[ 905.593985] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 905.594239] CR2: ffff998480042ac8 CR3: 00000003dd634000 CR4: 0000000000750ee0
[ 905.594504] PKRU: 55555554
[ 905.594758] Call Trace:
[ 905.595008]
[ 905.595276] ? show_regs+0x6d/0x80
[ 905.595548] ? __die+0x24/0x80
[ 905.595799] ? page_fault_oops+0x176/0x500
[ 905.596053] ? srso_alias_return_thunk+0x5/0x7f
[ 905.596307] ? srso_alias_return_thunk+0x5/0x7f
[ 905.596556] ? __kmem_cache_free+0xb7/0x350
[ 905.596822] ? kernelmode_fixup_or_oops+0xb2/0x140
[ 905.597095] ? __bad_area_nosemaphore+0x1a5/0x280
[ 905.597381] ? srso_alias_return_thunk+0x5/0x7f
[ 905.597654] ? bad_area_nosemaphore+0x16/0x30
[ 905.597920] ? do_kern_addr_fault+0x7b/0xa0
[ 905.598197] ? exc_page_fault+0x10d/0x1b0
[ 905.598466] ? asm_exc_page_fault+0x27/0x30
[ 905.598775] ? __kmem_cache_free+0xb7/0x350
[ 905.599052] ? srso_alias_return_thunk+0x5/0x7f
[ 905.599332] ? spl_kmem_free+0x31/0x40 [spl]
[ 905.599634] kfree+0x78/0x120
[ 905.599904] spl_kmem_free+0x31/0x40 [spl]
[ 905.600183] dbuf_write_done+0x114/0x1e0 [zfs]
[ 905.600534] arc_write_done+0xa7/0x550 [zfs]
[ 905.600874] zio_done+0x289/0x10b0 [zfs]
[ 905.601216] ? srso_alias_return_thunk+0x5/0x7f
[ 905.601477] ? kfree+0x78/0x120
[ 905.601743] ? srso_alias_return_thunk+0x5/0x7f
[ 905.602022] zio_execute+0x88/0x130 [zfs]
[ 905.602375] taskq_thread+0x27f/0x490 [spl]
[ 905.602648] ? __pfx_default_wake_function+0x10/0x10
[ 905.602951] ? __pfx_zio_execute+0x10/0x10 [zfs]
[ 905.603286] ? __pfx_taskq_thread+0x10/0x10 [spl]
[ 905.603582] kthread+0xef/0x120
[ 905.603859] ? __pfx_kthread+0x10/0x10
[ 905.604138] ret_from_fork+0x44/0x70
[ 905.604408] ? __pfx_kthread+0x10/0x10
[ 905.604664] ret_from_fork_asm+0x1b/0x30
[ 905.604940]
[ 905.605191] Modules linked in: tcp_diag inet_diag veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter softdog nf_tables sunrpc binfmt_misc bonding tls nfnetlink_log nfnetlink snd_usb_audio iwlmvm snd_usbmidi_lib snd_ump snd_rawmidi snd_seq_device mc mac80211 libarc4 btusb iwlwifi btrtl btbcm btintel btmtk bluetooth cfg80211 ecdh_generic ecc intel_rapl_msr intel_rapl_common joydev input_leds amdgpu edac_mce_amd kvm_amd snd_hda_codec_hdmi amdxcp iommu_v2 drm_buddy kvm snd_hda_intel gpu_sched drm_suballoc_helper snd_intel_dspcfg drm_ttm_helper snd_intel_sdw_acpi ttm snd_hda_codec irqbypass crct10dif_pclmul drm_display_helper polyval_clmulni snd_hda_core polyval_generic ghash_clmulni_intel snd_hwdep aesni_intel cec crypto_simd snd_pcm rc_core cryptd eeepc_wmi asus_nb_wmi asus_wmi snd_timer rapl drm_kms_helper ledtrig_audio snd sparse_keymap platform_profile pcspkr i2c_algo_bit wmi_bmof soundcore ccp k10temp mac_hid hid_logitech_hidpp vhost_net vhost vhost_iotlb
[ 905.605263] tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 hid_logitech_dj hid_generic usbkbd usbmouse usbhid hid zfs(PO) spl(O) btrfs blake2b_generic xor raid6_pq xhci_pci nvme xhci_pci_renesas crc32_pclmul bnx2x mpt3sas ahci igc xhci_hcd nvme_core i2c_piix4 raid_class libahci mdio scsi_transport_sas nvme_common libcrc32c video wmi gpio_amdpt
[ 905.607381] CR2: ffff998480042ac8
[ 905.607697] ---[ end trace 0000000000000000 ]---
[ 905.915763] RIP: 0010:__kmem_cache_free+0xb7/0x350
[ 905.916174] Code: ff 03 0f 86 f1 00 00 00 48 89 da 49 83 e7 fc 48 29 c2 48 81 fa ff ff 1f 00 77 09 48 85 c0 0f 85 80 02 00 00 4c 89 e8 48 89 de <45> 0f b6 74 24 25 48 29 c8 41 0f b6 4c 24 24 48 c1 f8 06 48 c1 e0
[ 905.916552] RSP: 0018:ffffaa444fe4fc10 EFLAGS: 00010292
[ 905.916908] RAX: fffffb8f84530f00 RBX: ffff99ba94c3da00 RCX: fffffb8f80000000
[ 905.917274] RDX: ffff99ba94c3da00 RSI: ffff99ba94c3da00 RDI: ffff998480042aa3
[ 905.917637] RBP: ffffaa444fe4fc58 R08: 0000000000000000 R09: 0000000000000000
[ 905.917996] R10: 0000000000000000 R11: 0000000000000000 R12: ffff998480042aa3
[ 905.918363] R13: fffffb8f84530f00 R14: ffff99ba83d8e000 R15: 000000cb000000dc
[ 905.918725] FS: 0000000000000000(0000) GS:ffff99d95e3c0000(0000) knlGS:0000000000000000
[ 905.919078] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 905.919422] CR2: ffff998480042ac8 CR3: 0000000188aee000 CR4: 0000000000750ee0
[ 905.919745] PKRU: 55555554
[ 905.920060] note: z_wr_int_0[6812] exited with irqs disabled
[14660.887337] BUG: unable to handle page fault for address: ffff932a6860f1ad
[14660.888476] #PF: supervisor read access in kernel mode
[14660.889086] #PF: error_code(0x0000) - not-present page
[14660.889518] PGD 74a601067 P4D 74a601067 PUD 0
[14660.889925] Oops: 0000 [#1] PREEMPT SMP NOPTI
[14660.890322] CPU: 14 PID: 563 Comm: dbuf_evict Tainted: P O 6.5.11-4-pve #1
[14660.890732] Hardware name: ASUS System Product Name/ROG STRIX X670E-E GAMING WIFI, BIOS 1709 09/28/2023
[14660.891130] RIP: 0010:arc_buf_destroy_impl+0x32/0x330 [zfs]
[14660.891710] Code: e5 41 56 41 55 41 54 53 4c 8b 77 10 48 89 fb 4c 8b 2f 4d 85 f6 0f 84 e2 00 00 00 49 8b bd b0 00 00 00 4c 89 e8 48 85 ff 74 09 07 01 0f 85 32 02 00 00 f6 43 18 02 0f 85 ec 01 00 00 44 0f b7
[14660.892154] RSP: 0018:ffffbd5103d0fdc0 EFLAGS: 00010282
[14660.892603] RAX: ffff932f62f39860 RBX: ffff93338760d260 RCX: 0000000000000000
[14660.893056] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff932a6860f1ad
[14660.893486] RBP: ffffbd5103d0fde0 R08: 0000000000000000 R09: 0000000000000000
[14660.893890] R10: 0000000000000000 R11: 0000000000000000 R12: ffff932f62f39860
[14660.894293] R13: ffff932f62f39860 R14: ffffbd5194377000 R15: ffffffffc03ec580
[14660.894687] FS: 0000000000000000(0000) GS:ffff934d1e380000(0000) knlGS:0000000000000000
[14660.895096] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[14660.895499] CR2: ffff932a6860f1ad CR3: 0000000748c34000 CR4: 0000000000750ee0
[14660.895920] PKRU: 55555554
[14660.896324] Call Trace:
[14660.896729]
[14660.897131] ? show_regs+0x6d/0x80
[14660.897544] ? __die+0x24/0x80
[14660.897947] ? page_fault_oops+0x176/0x500
[14660.898360] ? srso_alias_return_thunk+0x5/0x7f
[14660.898764] ? srso_alias_return_thunk+0x5/0x7f
[14660.899163] ? arc_buf_destroy_impl+0x32/0x330 [zfs]
[14660.899627] ? kernelmode_fixup_or_oops+0xb2/0x140
[14660.900019] ? __bad_area_nosemaphore+0x1a5/0x280
[14660.900422] ? bad_area_nosemaphore+0x16/0x30
[14660.900814] ? do_kern_addr_fault+0x7b/0xa0
[14660.901206] ? exc_page_fault+0x10d/0x1b0
[14660.901605] ? asm_exc_page_fault+0x27/0x30
[14660.901995] ? __pfx_thread_generic_wrapper+0x10/0x10 [spl]
[14660.902447] ? arc_buf_destroy_impl+0x32/0x330 [zfs]
[14660.902944] arc_buf_destroy+0x7a/0x100 [zfs]
[14660.903385] ? __pfx_thread_generic_wrapper+0x10/0x10 [spl]
[14660.903747] dbuf_destroy+0x31/0x4a0 [zfs]
[14660.904165] ? srso_alias_return_thunk+0x5/0x7f
[14660.904522] ? __pfx_thread_generic_wrapper+0x10/0x10 [spl]
[14660.904885] dbuf_evict_one+0x118/0x180 [zfs]
[14660.905301] dbuf_evict_thread+0x146/0x210 [zfs]
[14660.905709] ? kfree+0x78/0x120
[14660.906087] ? __pfx_dbuf_evict_thread+0x10/0x10 [zfs]
[14660.906509] thread_generic_wrapper+0x5c/0x70 [spl]
[14660.906878] kthread+0xef/0x120
[14660.907255] ? __pfx_kthread+0x10/0x10
[14660.907624] ret_from_fork+0x44/0x70
[14660.908020] ? __pfx_kthread+0x10/0x10
[14660.908386] ret_from_fork_asm+0x1b/0x30
[14660.908752]
[14660.909128] Modules linked in: veth tcp_diag inet_diag ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter softdog nf_tables sunrpc binfmt_misc bonding tls nfnetlink_log nfnetlink hid_logitech_hidpp joydev input_leds snd_usb_audio snd_usbmidi_lib snd_ump snd_rawmidi snd_seq_device hid_logitech_dj mc intel_rapl_msr intel_rapl_common edac_mce_amd iwlmvm kvm_amd amdgpu snd_hda_codec_hdmi amdxcp iommu_v2 mac80211 drm_buddy kvm snd_hda_intel gpu_sched snd_intel_dspcfg drm_suballoc_helper snd_intel_sdw_acpi drm_ttm_helper snd_hda_codec irqbypass ttm btusb crct10dif_pclmul libarc4 snd_hda_core btrtl polyval_clmulni drm_display_helper polyval_generic iwlwifi ghash_clmulni_intel snd_hwdep btbcm aesni_intel cec btintel snd_pcm rc_core btmtk eeepc_wmi asus_nb_wmi crypto_simd snd_timer cryptd bluetooth asus_wmi drm_kms_helper ledtrig_audio snd rapl cfg80211 sparse_keymap pcspkr platform_profile i2c_algo_bit wmi_bmof soundcore ecdh_generic ccp k10temp ecc mac_hid vhost_net
[14660.909206] vhost vhost_iotlb tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 hid_generic usbkbd usbmouse usbhid hid zfs(PO) spl(O) btrfs blake2b_generic xor raid6_pq xhci_pci xhci_pci_renesas nvme mpt3sas bnx2x crc32_pclmul ahci igc xhci_hcd nvme_core raid_class i2c_piix4 libahci mdio scsi_transport_sas nvme_common libcrc32c video wmi gpio_amdpt
[14660.912285] CR2: ffff932a6860f1ad
[14660.912692] ---[ end trace 0000000000000000 ]---
[14661.231526] RIP: 0010:arc_buf_destroy_impl+0x32/0x330 [zfs]
[14661.232199] Code: e5 41 56 41 55 41 54 53 4c 8b 77 10 48 89 fb 4c 8b 2f 4d 85 f6 0f 84 e2 00 00 00 49 8b bd b0 00 00 00 4c 89 e8 48 85 ff 74 09 07 01 0f 85 32 02 00 00 f6 43 18 02 0f 85 ec 01 00 00 44 0f b7
[14661.232590] RSP: 0018:ffffbd5103d0fdc0 EFLAGS: 00010282
[14661.232978] RAX: ffff932f62f39860 RBX: ffff93338760d260 RCX: 0000000000000000
[14661.233408] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff932a6860f1ad
[14661.233789] RBP: ffffbd5103d0fde0 R08: 0000000000000000 R09: 0000000000000000
[14661.234183] R10: 0000000000000000 R11: 0000000000000000 R12: ffff932f62f39860
[14661.234567] R13: ffff932f62f39860 R14: ffffbd5194377000 R15: ffffffffc03ec580
[14661.234946] FS: 0000000000000000(0000) GS:ffff934d1e380000(0000) knlGS:0000000000000000
[14661.235354] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[14661.235738] CR2: ffff932a6860f1ad CR3: 00000002fcf5a000 CR4: 0000000000750ee0
[14661.236160] PKRU: 55555554
[14661.236548] note: dbuf_evict[563] exited with irqs disabled
[ 826.964554] BUG: unable to handle page fault for address: ffff8b016e31a4df
[ 826.964907] #PF: supervisor read access in kernel mode
[ 826.965168] #PF: error_code(0x0000) - not-present page
[ 826.965394] PGD a88a01067 P4D a88a01067 PUD 0
[ 826.965630] Oops: 0000 [#1] PREEMPT SMP NOPTI
[ 826.965853] CPU: 9 PID: 2396 Comm: z_wr_iss Tainted: P O 6.5.11-4-pve #1
[ 826.966076] Hardware name: ASUS System Product Name/ROG STRIX X670E-E GAMING WIFI, BIOS 1709 09/28/2023
[ 826.966303] RIP: 0010:__kmem_cache_alloc_node+0x1f7/0x360
[ 826.966532] Code: 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 cc fc c5 00 41 8b 44 24 28 4d 8b 0c 24 49 8d 88 00 20 00 00 48 01 f8 <48> 8b 18 48 89 c2 49 33 9c 24 b8 00 00 00 48 89 f8 48 0f ca 48 31
[ 826.966778] RSP: 0018:ffffa4bf738dba60 EFLAGS: 00010282
[ 826.967025] RAX: ffff8b016e31a4df RBX: 0000000000042d00 RCX: 00000001b4b20009
[ 826.967273] RDX: 0000000000000000 RSI: 0000000000042d00 RDI: ffff8b016e31a4cf
[ 826.967522] RBP: ffffa4bf738dbab0 R08: 00000001b4b1e009 R09: 0000000000038080
[ 826.967771] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8b1300042400
[ 826.968021] R13: 0000000000042d00 R14: 00000000ffffffff R15: 0000000000000020
[ 826.968272] FS: 0000000000000000(0000) GS:ffff8b31de240000(0000) knlGS:0000000000000000
[ 826.968526] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 826.968779] CR2: ffff8b016e31a4df CR3: 0000000a87034000 CR4: 0000000000750ee0
[ 826.969036] PKRU: 55555554
[ 826.969289] Call Trace:
[ 826.969541]
[ 826.969796] ? show_regs+0x6d/0x80
[ 826.970059] ? __die+0x24/0x80
[ 826.970315] ? page_fault_oops+0x176/0x500
[ 826.970567] ? srso_alias_return_thunk+0x5/0x7f
[ 826.970820] ? srso_alias_return_thunk+0x5/0x7f
[ 826.971069] ? __kmem_cache_alloc_node+0x1f7/0x360
[ 826.971320] ? kernelmode_fixup_or_oops+0xb2/0x140
[ 826.971594] ? __bad_area_nosemaphore+0x1a5/0x280
[ 826.971844] ? srso_alias_return_thunk+0x5/0x7f
[ 826.972119] ? bad_area_nosemaphore+0x16/0x30
[ 826.972386] ? do_kern_addr_fault+0x7b/0xa0
[ 826.972658] ? exc_page_fault+0x10d/0x1b0
[ 826.972912] ? asm_exc_page_fault+0x27/0x30
[ 826.973176] ? __kmem_cache_alloc_node+0x1f7/0x360
[ 826.973450] ? spl_kvmalloc+0x84/0xc0 [spl]
[ 826.973713] ? spl_kvmalloc+0x84/0xc0 [spl]
[ 826.973981] __kmalloc_node+0x52/0xe0
[ 826.974267] spl_kvmalloc+0x84/0xc0 [spl]
[ 826.974526] spl_kmem_zalloc+0xf2/0x130 [spl]
[ 826.974800] vdev_raidz_map_alloc+0x80/0x600 [zfs]
[ 826.975224] ? srso_alias_return_thunk+0x5/0x7f
[ 826.975479] ? spl_kmem_cache_alloc+0x94/0x660 [spl]
[ 826.975729] ? sg_alloc_table+0x27/0xc0
[ 826.975978] vdev_raidz_io_start+0x38/0x310 [zfs]
[ 826.976294] ? zio_create+0x3e8/0x660 [zfs]
[ 826.976607] zio_vdev_io_start+0x14c/0x340 [zfs]
[ 826.976930] ? __pfx_vdev_mirror_child_done+0x10/0x10 [zfs]
[ 826.977239] zio_nowait+0xd2/0x1c0 [zfs]
[ 826.977546] vdev_mirror_io_start+0xa7/0x270 [zfs]
[ 826.977849] zio_vdev_io_start+0x2a5/0x340 [zfs]
[ 826.978156] zio_execute+0x88/0x130 [zfs]
[ 826.978459] taskq_thread+0x27f/0x490 [spl]
[ 826.978709] ? __pfx_default_wake_function+0x10/0x10
[ 826.978956] ? __pfx_zio_execute+0x10/0x10 [zfs]
[ 826.979261] ? __pfx_taskq_thread+0x10/0x10 [spl]
[ 826.979509] kthread+0xef/0x120
[ 826.979754] ? __pfx_kthread+0x10/0x10
[ 826.979998] ret_from_fork+0x44/0x70
[ 826.980242] ? __pfx_kthread+0x10/0x10
[ 826.980486] ret_from_fork_asm+0x1b/0x30
[ 826.980733]
[ 826.980974] Modules linked in: veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter softdog nf_tables sunrpc binfmt_misc bonding tls nfnetlink_log nfnetlink hid_logitech_hidpp input_leds joydev hid_logitech_dj hid_generic usbkbd usbmouse snd_usb_audio snd_usbmidi_lib snd_ump usbhid snd_rawmidi snd_seq_device hid mc intel_rapl_msr intel_rapl_common iwlmvm edac_mce_amd mac80211 amdgpu kvm_amd libarc4 kvm snd_hda_codec_hdmi amdxcp iommu_v2 drm_buddy gpu_sched snd_hda_intel drm_suballoc_helper irqbypass drm_ttm_helper crct10dif_pclmul snd_intel_dspcfg polyval_clmulni ttm snd_intel_sdw_acpi polyval_generic snd_hda_codec btusb ghash_clmulni_intel drm_display_helper aesni_intel btrtl iwlwifi btbcm snd_hda_core cec btintel snd_hwdep rc_core btmtk eeepc_wmi crypto_simd asus_nb_wmi snd_pcm cryptd bluetooth snd_timer asus_wmi drm_kms_helper rapl ledtrig_audio snd sparse_keymap cfg80211 platform_profile pcspkr wmi_bmof i2c_algo_bit soundcore ecdh_generic ccp k10temp ecc
[ 826.981044] mac_hid vhost_net vhost vhost_iotlb tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 zfs(PO) spl(O) btrfs blake2b_generic xor raid6_pq xhci_pci nvme xhci_pci_renesas crc32_pclmul bnx2x mpt3sas xhci_hcd igc ahci i2c_piix4 nvme_core raid_class libahci mdio scsi_transport_sas nvme_common libcrc32c video wmi gpio_amdpt
[ 826.983097] CR2: ffff8b016e31a4df
[ 826.983427] ---[ end trace 0000000000000000 ]---
[ 828.466288] RIP: 0010:__kmem_cache_alloc_node+0x1f7/0x360
[ 828.466684] Code: 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 cc fc c5 00 41 8b 44 24 28 4d 8b 0c 24 49 8d 88 00 20 00 00 48 01 f8 <48> 8b 18 48 89 c2 49 33 9c 24 b8 00 00 00 48 89 f8 48 0f ca 48 31
[ 828.467018] RSP: 0018:ffffa4bf738dba60 EFLAGS: 00010282
[ 828.467356] RAX: ffff8b016e31a4df RBX: 0000000000042d00 RCX: 00000001b4b20009
[ 828.467690] RDX: 0000000000000000 RSI: 0000000000042d00 RDI: ffff8b016e31a4cf
[ 828.468002] RBP: ffffa4bf738dbab0 R08: 00000001b4b1e009 R09: 0000000000038080
[ 828.468311] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8b1300042400
[ 828.468621] R13: 0000000000042d00 R14: 00000000ffffffff R15: 0000000000000020
[ 828.468943] FS: 0000000000000000(0000) GS:ffff8b31de240000(0000) knlGS:0000000000000000
[ 828.469252] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 828.469562] CR2: ffff8b016e31a4df CR3: 000000013915a000 CR4: 0000000000750ee0
[ 828.469873] PKRU: 55555554
[ 828.470180] note: z_wr_iss[2396] exited with irqs disabled
[ 828.471008] BUG: unable to handle page fault for address: ffff8b016e31a4df
[ 828.471526] #PF: supervisor read access in kernel mode
[ 828.471856] #PF: error_code(0x0000) - not-present page
[ 828.472178] PGD a88a01067 P4D a88a01067 PUD 0
[ 828.472500] Oops: 0000 [#2] PREEMPT SMP NOPTI
[ 828.472817] CPU: 9 PID: 246 Comm: kworker/9:1 Tainted: P D O 6.5.11-4-pve #1
[ 828.473143] Hardware name: ASUS System Product Name/ROG STRIX X670E-E GAMING WIFI, BIOS 1709 09/28/2023
[ 828.473468] Workqueue: events drm_fb_helper_damage_work [drm_kms_helper]
[ 828.473806] RIP: 0010:__kmem_cache_alloc_node+0x1f7/0x360
[ 828.474136] Code: 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 cc fc c5 00 41 8b 44 24 28 4d 8b 0c 24 49 8d 88 00 20 00 00 48 01 f8 <48> 8b 18 48 89 c2 49 33 9c 24 b8 00 00 00 48 89 f8 48 0f ca 48 31
[ 828.474483] RSP: 0018:ffffa4bf40ab3990 EFLAGS: 00010282
[ 828.474846] RAX: ffff8b016e31a4df RBX: 0000000000000cc0 RCX: 00000001b4b20009
[ 828.475223] RDX: 0000000000000000 RSI: 0000000000000cc0 RDI: ffff8b016e31a4cf
[ 828.475598] RBP: ffffa4bf40ab39d8 R08: 00000001b4b1e009 R09: 0000000000038080
[ 828.475971] R10: ffff8b133ec80000 R11: ffff8b1316d43000 R12: ffff8b1300042400
[ 828.476330] R13: 0000000000000cc0 R14: 00000000ffffffff R15: 0000000000000020
[ 828.476701] FS: 0000000000000000(0000) GS:ffff8b31de240000(0000) knlGS:0000000000000000
[ 828.477068] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 828.477458] CR2: ffff8b016e31a4df CR3: 000000013915a000 CR4: 0000000000750ee0
[ 828.477842] PKRU: 55555554
[ 828.478234] Call Trace:
[ 828.478610]
[ 828.478976] ? show_regs+0x6d/0x80
[ 828.479383] ? __die+0x24/0x80
[ 828.479786] ? page_fault_oops+0x176/0x500
[ 828.480172] ? srso_alias_return_thunk+0x5/0x7f
[ 828.480567] ? srso_alias_return_thunk+0x5/0x7f
[ 828.480948] ? __kmem_cache_alloc_node+0x1f7/0x360
[ 828.481355] ? kernelmode_fixup_or_oops+0xb2/0x140
[ 828.481752] ? __bad_area_nosemaphore+0x1a5/0x280
[ 828.482137] ? srso_alias_return_thunk+0x5/0x7f
[ 828.482508] ? bad_area_nosemaphore+0x16/0x30
[ 828.482911] ? do_kern_addr_fault+0x7b/0xa0
[ 828.483293] ? exc_page_fault+0x10d/0x1b0
[ 828.483681] ? asm_exc_page_fault+0x27/0x30
[ 828.484066] ? __kmem_cache_alloc_node+0x1f7/0x360
[ 828.484461] ? drm_atomic_get_private_obj_state+0xbb/0x1c0 [drm]
[ 828.484882] ? drm_atomic_get_private_obj_state+0xbb/0x1c0 [drm]
[ 828.485281] __kmalloc_node_track_caller+0x51/0xe0
[ 828.485667] krealloc+0x70/0x100
[ 828.486075] drm_atomic_get_private_obj_state+0xbb/0x1c0 [drm]
[ 828.486463] dm_update_plane_state+0x17e/0x670 [amdgpu]
[ 828.487032] amdgpu_dm_atomic_check+0x647/0x14d0 [amdgpu]
[ 828.487590] ? srso_alias_return_thunk+0x5/0x7f
[ 828.487969] ? find_busiest_group+0x48/0x370
[ 828.488345] drm_atomic_check_only+0x668/0xb20 [drm]
[ 828.488724] drm_atomic_commit+0x58/0xd0 [drm]
[ 828.489110] ? __pfx___drm_printfn_info+0x10/0x10 [drm]
[ 828.489469] drm_atomic_helper_dirtyfb+0x1a2/0x290 [drm_kms_helper]
[ 828.489799] amdgpu_dirtyfb+0x13/0x40 [amdgpu]
[ 828.490213] drm_fbdev_generic_helper_fb_dirty+0x268/0x340 [drm_kms_helper]
[ 828.490517] ? finish_task_switch.isra.0+0x85/0x2c0
[ 828.490813] drm_fb_helper_damage_work+0x94/0x180 [drm_kms_helper]
[ 828.491114] process_one_work+0x23b/0x450
[ 828.491410] worker_thread+0x50/0x3f0
[ 828.491693] ? __pfx_worker_thread+0x10/0x10
[ 828.491964] kthread+0xef/0x120
[ 828.492233] ? __pfx_kthread+0x10/0x10
[ 828.492501] ret_from_fork+0x44/0x70
[ 828.492768] ? __pfx_kthread+0x10/0x10
[ 828.493035] ret_from_fork_asm+0x1b/0x30
[ 828.493303]
[ 828.493564] Modules linked in: veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter softdog nf_tables sunrpc binfmt_misc bonding tls nfnetlink_log nfnetlink hid_logitech_hidpp input_leds joydev hid_logitech_dj hid_generic usbkbd usbmouse snd_usb_audio snd_usbmidi_lib snd_ump usbhid snd_rawmidi snd_seq_device hid mc intel_rapl_msr intel_rapl_common iwlmvm edac_mce_amd mac80211 amdgpu kvm_amd libarc4 kvm snd_hda_codec_hdmi amdxcp iommu_v2 drm_buddy gpu_sched snd_hda_intel drm_suballoc_helper irqbypass drm_ttm_helper crct10dif_pclmul snd_intel_dspcfg polyval_clmulni ttm snd_intel_sdw_acpi polyval_generic snd_hda_codec btusb ghash_clmulni_intel drm_display_helper aesni_intel btrtl iwlwifi btbcm snd_hda_core cec btintel snd_hwdep rc_core btmtk eeepc_wmi crypto_simd asus_nb_wmi snd_pcm cryptd bluetooth snd_timer asus_wmi drm_kms_helper rapl ledtrig_audio snd sparse_keymap cfg80211 platform_profile pcspkr wmi_bmof i2c_algo_bit soundcore ecdh_generic ccp k10temp ecc
[ 828.493630] mac_hid vhost_net vhost vhost_iotlb tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 zfs(PO) spl(O) btrfs blake2b_generic xor raid6_pq xhci_pci nvme xhci_pci_renesas crc32_pclmul bnx2x mpt3sas xhci_hcd igc ahci i2c_piix4 nvme_core raid_class libahci mdio scsi_transport_sas nvme_common libcrc32c video wmi gpio_amdpt
[ 828.495772] CR2: ffff8b016e31a4df
[ 828.496096] ---[ end trace 0000000000000000 ]---
[ 828.816025] RIP: 0010:__kmem_cache_alloc_node+0x1f7/0x360
[ 828.816420] Code: 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 cc fc c5 00 41 8b 44 24 28 4d 8b 0c 24 49 8d 88 00 20 00 00 48 01 f8 <48> 8b 18 48 89 c2 49 33 9c 24 b8 00 00 00 48 89 f8 48 0f ca 48 31
[ 828.816770] RSP: 0018:ffffa4bf738dba60 EFLAGS: 00010282
[ 828.817119] RAX: ffff8b016e31a4df RBX: 0000000000042d00 RCX: 00000001b4b20009
[ 828.817460] RDX: 0000000000000000 RSI: 0000000000042d00 RDI: ffff8b016e31a4cf
[ 828.817789] RBP: ffffa4bf738dbab0 R08: 00000001b4b1e009 R09: 0000000000038080
[ 828.818109] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8b1300042400
[ 828.818428] R13: 0000000000042d00 R14: 00000000ffffffff R15: 0000000000000020
[ 828.818746] FS: 0000000000000000(0000) GS:ffff8b31de240000(0000) knlGS:0000000000000000
[ 828.819068] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 828.819387] CR2: ffff8b016e31a4df CR3: 000000013915a000 CR4: 0000000000750ee0
[ 828.819707] PKRU: 55555554
[ 828.820022] note: kworker/9:1[246] exited with irqs disabled
[ 831.396082] BUG: unable to handle page fault for address: ffff8b016e31a4df
[ 831.396544] #PF: supervisor read access in kernel mode
[ 831.396871] #PF: error_code(0x0000) - not-present page
[ 831.397194] PGD a88a01067 P4D a88a01067 PUD 0
[ 831.397513] Oops: 0000 [#3] PREEMPT SMP NOPTI
[ 831.397832] CPU: 9 PID: 3003 Comm: pve-firewall Tainted: P D O 6.5.11-4-pve #1
[ 831.398155] Hardware name: ASUS System Product Name/ROG STRIX X670E-E GAMING WIFI, BIOS 1709 09/28/2023
[ 831.398481] RIP: 0010:__kmem_cache_alloc_node+0x1f7/0x360
[ 831.398815] Code: 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 cc fc c5 00 41 8b 44 24 28 4d 8b 0c 24 49 8d 88 00 20 00 00 48 01 f8 <48> 8b 18 48 89 c2 49 33 9c 24 b8 00 00 00 48 89 f8 48 0f ca 48 31
[ 831.399161] RSP: 0018:ffffa4bf6b213bb0 EFLAGS: 00010282
[ 831.399509] RAX: ffff8b016e31a4df RBX: 0000000000000dc0 RCX: 00000001b4b20009
[ 831.399859] RDX: 0000000000000000 RSI: 0000000000000dc0 RDI: ffff8b016e31a4cf
[ 831.400210] RBP: ffffa4bf6b213bf8 R08: 00000001b4b1e009 R09: 0000000000038080
[ 831.400563] R10: 0000000000000000 R11: ffff8b138b462f80 R12: ffff8b1300042400
[ 831.400919] R13: 0000000000000dc0 R14: 00000000ffffffff R15: 0000000000000020
[ 831.401274] FS: 00007fcde51ecb80(0000) GS:ffff8b31de240000(0000) knlGS:0000000000000000
[ 831.401631] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 831.401988] CR2: ffff8b016e31a4df CR3: 000000014cc4e000 CR4: 0000000000750ee0
[ 831.402347] PKRU: 55555554
[ 831.402706] Call Trace:
[ 831.403064]
[ 831.403422] ? show_regs+0x6d/0x80
[ 831.403782] ? __die+0x24/0x80
[ 831.404137] ? page_fault_oops+0x176/0x500
[ 831.404498] ? srso_alias_return_thunk+0x5/0x7f
[ 831.404860] ? srso_alias_return_thunk+0x5/0x7f
[ 831.405215] ? __kmem_cache_alloc_node+0x1f7/0x360
[ 831.405571] ? kernelmode_fixup_or_oops+0xb2/0x140
[ 831.405926] ? __bad_area_nosemaphore+0x1a5/0x280
[ 831.406283] ? bad_area_nosemaphore+0x16/0x30
[ 831.406636] ? do_kern_addr_fault+0x7b/0xa0
[ 831.406989] ? exc_page_fault+0x10d/0x1b0
[ 831.407341] ? asm_exc_page_fault+0x27/0x30
[ 831.407693] ? __kmem_cache_alloc_node+0x1f7/0x360
[ 831.408043] ? __vmalloc_node_range+0x1c8/0x8c0
[ 831.408391] ? __vmalloc_node_range+0x1c8/0x8c0
[ 831.408742] __kmalloc_node+0x52/0xe0
[ 831.409091] __vmalloc_node_range+0x1c8/0x8c0
[ 831.409438] ? kernel_clone+0xbd/0x440
[ 831.409785] ? copy_process+0x21c/0x1d50
[ 831.410131] copy_process+0x6da/0x1d50
[ 831.410478] ? kernel_clone+0xbd/0x440
[ 831.410823] ? srso_alias_return_thunk+0x5/0x7f
[ 831.411170] ? kmem_cache_alloc+0x1a4/0x380
[ 831.411518] kernel_clone+0xbd/0x440
[ 831.411855] __do_sys_clone+0x66/0xa0
[ 831.412183] __x64_sys_clone+0x25/0x40
[ 831.412500] do_syscall_64+0x58/0x90
[ 831.412808] ? srso_alias_return_thunk+0x5/0x7f
[ 831.413109] ? exc_page_fault+0x94/0x1b0
[ 831.413398] entry_SYSCALL_64_after_hwframe+0x6e/0xd8
[ 831.413678] RIP: 0033:0x7fcde52fe193
[ 831.413975] Code: 00 00 00 00 00 66 90 64 48 8b 04 25 10 00 00 00 45 31 c0 31 d2 31 f6 bf 11 00 20 01 4c 8d 90 d0 02 00 00 b8 38 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 35 89 c2 85 c0 75 2c 64 48 8b 04 25 10 00 00
[ 831.414288] RSP: 002b:00007ffea8ac4078 EFLAGS: 00000246 ORIG_RAX: 0000000000000038
[ 831.414589] RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007fcde52fe193
[ 831.414879] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000001200011
[ 831.415159] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 831.415436] R10: 00007fcde51ece50 R11: 0000000000000246 R12: 0000000000000001
[ 831.415713] R13: 00007ffea8ac4190 R14: 00007ffea8ac4210 R15: 00007fcde5525020
[ 831.415991]
[ 831.416264] Modules linked in: veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter softdog nf_tables sunrpc binfmt_misc bonding tls nfnetlink_log nfnetlink hid_logitech_hidpp input_leds joydev hid_logitech_dj hid_generic usbkbd usbmouse snd_usb_audio snd_usbmidi_lib snd_ump usbhid snd_rawmidi snd_seq_device hid mc intel_rapl_msr intel_rapl_common iwlmvm edac_mce_amd mac80211 amdgpu kvm_amd libarc4 kvm snd_hda_codec_hdmi amdxcp iommu_v2 drm_buddy gpu_sched snd_hda_intel drm_suballoc_helper irqbypass drm_ttm_helper crct10dif_pclmul snd_intel_dspcfg polyval_clmulni ttm snd_intel_sdw_acpi polyval_generic snd_hda_codec btusb ghash_clmulni_intel drm_display_helper aesni_intel btrtl iwlwifi btbcm snd_hda_core cec btintel snd_hwdep rc_core btmtk eeepc_wmi crypto_simd asus_nb_wmi snd_pcm cryptd bluetooth snd_timer asus_wmi drm_kms_helper rapl ledtrig_audio snd sparse_keymap cfg80211 platform_profile pcspkr wmi_bmof i2c_algo_bit soundcore ecdh_generic ccp k10temp ecc
[ 831.416328] mac_hid vhost_net vhost vhost_iotlb tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 zfs(PO) spl(O) btrfs blake2b_generic xor raid6_pq xhci_pci nvme xhci_pci_renesas crc32_pclmul bnx2x mpt3sas xhci_hcd igc ahci i2c_piix4 nvme_core raid_class libahci mdio scsi_transport_sas nvme_common libcrc32c video wmi gpio_amdpt
[ 831.418563] CR2: ffff8b016e31a4df
[ 831.418900] ---[ end trace 0000000000000000 ]---
[ 831.737360] RIP: 0010:__kmem_cache_alloc_node+0x1f7/0x360
[ 831.737703] Code: 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 cc fc c5 00 41 8b 44 24 28 4d 8b 0c 24 49 8d 88 00 20 00 00 48 01 f8 <48> 8b 18 48 89 c2 49 33 9c 24 b8 00 00 00 48 89 f8 48 0f ca 48 31
[ 831.738057] RSP: 0018:ffffa4bf738dba60 EFLAGS: 00010282
[ 831.738411] RAX: ffff8b016e31a4df RBX: 0000000000042d00 RCX: 00000001b4b20009
[ 831.738764] RDX: 0000000000000000 RSI: 0000000000042d00 RDI: ffff8b016e31a4cf
[ 831.739114] RBP: ffffa4bf738dbab0 R08: 00000001b4b1e009 R09: 0000000000038080
[ 831.739461] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8b1300042400
[ 831.739793] R13: 0000000000042d00 R14: 00000000ffffffff R15: 0000000000000020
[ 831.740124] FS: 00007fcde51ecb80(0000) GS:ffff8b31de240000(0000) knlGS:0000000000000000
[ 831.740457] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 831.740789] CR2: ffff8b016e31a4df CR3: 000000014cc4e000 CR4: 0000000000750ee0
[ 831.741124] PKRU: 55555554
[ 831.741455] note: pve-firewall[3003] exited with irqs disabled
The text was updated successfully, but these errors were encountered: