Use new -P to get list of zpool devices #13

b333z · 2016-05-04T10:31:14Z

Have adjusted get_pool_devices to use the -P switch, here is (with a dodgy stdout dinfo replacement) an example on pool with a mixture (let us know if any idea's for cleanup)

# zpool status system
  pool: system
 state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
        still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
        the pool may no longer be accessible by software that does not support
        the features. See zpool-features(5) for details.
  scan: scrub canceled on Sun May  1 17:20:59 2016
config:

        NAME                                                   STATE     READ WRITE CKSUM
        system                                                 ONLINE       0     0     0
          mirror-0                                             ONLINE       0     0     0
            sdh                                                ONLINE       0     0     0
            sdf                                                ONLINE       0     0     0
            sdi                                                ONLINE       0     0     0
        logs
          mirror-1                                             ONLINE       0     0     0
            sdd3                                               ONLINE       0     0     0
            sde3                                               ONLINE       0     0     0
        cache
          ata-Samsung_SSD_840_EVO_120GB_S1D5NSBDA42933Z-part5  ONLINE       0     0     0
          ata-Samsung_SSD_840_EVO_120GB_S1D5NSBDA20279M-part5  ONLINE       0     0     0

errors: No known data errors

# get_pool_devices system
dinfo: zfsexpandknowledge: pool system has device /dev/sdh1 (which resolves to /dev/sdh1)
/dev/sdh1
dinfo: zfsexpandknowledge: pool system has device /dev/sdf1 (which resolves to /dev/sdf1)
/dev/sdf1
dinfo: zfsexpandknowledge: pool system has device /dev/sdi1 (which resolves to /dev/sdi1)
/dev/sdi1
dinfo: zfsexpandknowledge: pool system has device /dev/sdd3 (which resolves to /dev/sdd3)
/dev/sdd3
dinfo: zfsexpandknowledge: pool system has device /dev/sde3 (which resolves to /dev/sde3)
/dev/sde3
dinfo: zfsexpandknowledge: pool system has device /dev/disk/by-id/ata-Samsung_SSD_840_EVO_120GB_S1D5NSBDA42933Z-part5 (which resolves to /dev/sdd5)
/dev/sdd5
dinfo: zfsexpandknowledge: pool system has device /dev/disk/by-id/ata-Samsung_SSD_840_EVO_120GB_S1D5NSBDA20279M-part5 (which resolves to /dev/sde5)
/dev/sde5

Under certain loads, the following panic is hit: panic: page fault KDB: stack backtrace: #0 0xffffffff805db025 at kdb_backtrace+0x65 #1 0xffffffff8058e86f at vpanic+0x17f #2 0xffffffff8058e6e3 at panic+0x43 #3 0xffffffff808adc15 at trap_fatal+0x385 #4 0xffffffff808adc6f at trap_pfault+0x4f #5 0xffffffff80886da8 at calltrap+0x8 #6 0xffffffff80669186 at vgonel+0x186 #7 0xffffffff80669841 at vgone+0x31 #8 0xffffffff8065806d at vfs_hash_insert+0x26d #9 0xffffffff81a39069 at sfs_vgetx+0x149 #10 0xffffffff81a39c54 at zfsctl_snapdir_lookup+0x1e4 #11 0xffffffff8065a28c at lookup+0x45c #12 0xffffffff806594b9 at namei+0x259 #13 0xffffffff80676a33 at kern_statat+0xf3 #14 0xffffffff8067712f at sys_fstatat+0x2f openzfs#15 0xffffffff808ae50c at amd64_syscall+0x10c openzfs#16 0xffffffff808876bb at fast_syscall_common+0xf8 The page fault occurs because vgonel() will call VOP_CLOSE() for active vnodes. For this reason, define vop_close for zfsctl_ops_snapshot. While here, define vop_open for consistency. After adding the necessary vop, the bug progresses to the following panic: panic: VERIFY3(vrecycle(vp) == 1) failed (0 == 1) cpuid = 17 KDB: stack backtrace: #0 0xffffffff805e29c5 at kdb_backtrace+0x65 #1 0xffffffff8059620f at vpanic+0x17f #2 0xffffffff81a27f4a at spl_panic+0x3a #3 0xffffffff81a3a4d0 at zfsctl_snapshot_inactive+0x40 #4 0xffffffff8066fdee at vinactivef+0xde #5 0xffffffff80670b8a at vgonel+0x1ea #6 0xffffffff806711e1 at vgone+0x31 #7 0xffffffff8065fa0d at vfs_hash_insert+0x26d #8 0xffffffff81a39069 at sfs_vgetx+0x149 #9 0xffffffff81a39c54 at zfsctl_snapdir_lookup+0x1e4 #10 0xffffffff80661c2c at lookup+0x45c #11 0xffffffff80660e59 at namei+0x259 #12 0xffffffff8067e3d3 at kern_statat+0xf3 #13 0xffffffff8067eacf at sys_fstatat+0x2f #14 0xffffffff808b5ecc at amd64_syscall+0x10c openzfs#15 0xffffffff8088f07b at fast_syscall_common+0xf8 This is caused by a race condition that can occur when allocating a new vnode and adding that vnode to the vfs hash. If the newly created vnode loses the race when being inserted into the vfs hash, it will not be recycled as its usecount is greater than zero, hitting the above assertion. Fix this by dropping the assertion. FreeBSD-issue: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=252700 Reviewed-by: Andriy Gapon <avg@FreeBSD.org> Reviewed-by: Mateusz Guzik <mjguzik@gmail.com> Reviewed-by: Alek Pinchuk <apinchuk@axcient.com> Reviewed-by: Ryan Moeller <ryan@iXsystems.com> Signed-off-by: Rob Wing <rob.wing@klarasystems.com> Co-authored-by: Rob Wing <rob.wing@klarasystems.com> Submitted-by: Klara, Inc. Sponsored-by: rsync.net Closes openzfs#14501

Use new -P to get list of zpool devices

eb05c22

b333z mentioned this pull request May 4, 2016

dracut fails when pool members are /dev/sd* openzfs/zfs#4577

Closed

Rudd-O merged commit bd5aba7 into Rudd-O:topic-fixboot May 10, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use new -P to get list of zpool devices #13

Use new -P to get list of zpool devices #13

b333z commented May 4, 2016 •

edited

Loading

Use new -P to get list of zpool devices #13

Use new -P to get list of zpool devices #13

Conversation

b333z commented May 4, 2016 • edited Loading

b333z commented May 4, 2016 •

edited

Loading