Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ZFS suspends my pool On I/O disk failure.. #7118

Closed
morphinz opened this issue Feb 2, 2018 · 20 comments
Closed

ZFS suspends my pool On I/O disk failure.. #7118

morphinz opened this issue Feb 2, 2018 · 20 comments
Assignees
Milestone

Comments

@morphinz
Copy link

morphinz commented Feb 2, 2018

System information

Type Version/Name
Distribution Name ARCH Linux
Distribution Version SMP PREEMPT Wed Jan 10 11:14:50 UTC 2018
Linux Kernel 4.14.12-1-ARCH
Architecture x86_64
ZFS Version zfs-linux 0.7.5.4.14.12.1-1
SPL Version spl-linux 0.7.5.4.14.12.1-1

Describe the problem you're observing

I have a pool with multihost=on property.
2 time on 3 day zfs just suspended my pool. I cant see any log about bad thing. Just suspends with not saying anything. This is the only log about it:

[Tue Jan 30 09:16:52 2018] WARNING: Pool 'clspool' has encountered an uncorrectable I/O failure and has been suspended.

On first failure i saw few I/O disk failure but i didnt saw any fault on disks with "zpool status".
After that i rebooted the server and i got my pool back, everything was clear, i didnt see any bad things on dmesg and i moved on.
But after 3 days, in this morning zfs just suspended my pool again and i saw Disk I/O failure on same disk and zpool status gaved to me 71 fault on the disk.. But my every vdev has 10 disk from 5 different jbod and i have "raidz_2". Just one disk failure can't break my 226 disk pool. Also i have 5 spare too. Even i lost 1 jbod on my system i didn't have any problem... But 1 disk failure ruins it? What the heck is that?

I'm afraid zfs cant handle 1 disk failure right now. And i think may be the problem is "multihost=on".
Because on different pool,kernel,unix etc. when i got a disk error i faced with so bad things; like kernel freeze or etc. But first time i saw suspend issue and my difference is multihost property.

If you need anything just ask me..

Include any warning/errors/backtraces from the system logs

Edit: "Adding logs"

My pool: https://paste.ubuntu.com/26511690/

And this is the suspend log:

Jan 29 17:05:33 Server1 kernel: mpt3sas_cm2: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
Jan 29 17:05:33 Server1 kernel: mpt3sas_cm5: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
Jan 29 17:05:33 Server1 kernel: mpt3sas_cm4: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
Jan 30 09:17:35 Server1 kernel: WARNING: Pool 'clspool' has encountered an uncorrectable I/O failure and has been suspended.
Jan 30 09:56:15 Server1 kernel: watchdog: watchdog0: watchdog did not stop!
Jan 30 09:56:15 Server1 kernel: systemd-shutdow: 51 output lines suppressed due to ratelimiting
Jan 30 10:00:43 Server1 systemd-journald[5568]: Missed 4920 kernel messages
Jan 30 10:00:43 Server1 kernel: scsi 13:0:153:0: SSP: enclosure_logical_id(0x50030480092690bf), slot(31)
Jan 30 10:00:43 Server1 kernel: scsi 13:0:153:0: SSP: enclosure level(0x0001), connector name( 0   )
Jan 30 10:00:43 Server1 kernel: scsi 15:0:143:0: Direct-Access     SEAGATE  ST6000NM0095     E003 PQ: 0 ANSI: 6
Jan 30 10:00:43 Server1 kernel: scsi 15:0:143:0: SSP: handle(0x009e), sas_addr(0x5000c50093d4fb7a), phy(22), de

Before the suspend i see these logs first. As you can see I have an I/O failure disk.
Dmesg:

[Tue Jan 30 15:46:19 2018] mpt3sas_cm1: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Tue Jan 30 15:46:22 2018] mpt3sas_cm4: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Tue Jan 30 15:46:25 2018] mpt3sas_cm2: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Tue Jan 30 15:46:25 2018] mpt3sas_cm3: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Tue Jan 30 15:46:25 2018] mpt3sas_cm5: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Wed Jan 31 16:10:54 2018] sd 14:0:152:0: attempting task abort! scmd(ffff94f8c94d2448)
[Wed Jan 31 16:10:54 2018] sd 14:0:152:0: [sdwt] tag#7137 CDB: opcode=0x0 00 00 00 00 00 00
[Wed Jan 31 16:10:54 2018] scsi target14:0:152: handle(0x0053), sas_address(0x5000c50093cb92b1), phy(31)
[Wed Jan 31 16:10:54 2018] scsi target14:0:152: enclosure_logical_id(0x5003048009268cbf), slot(30)
[Wed Jan 31 16:10:54 2018] scsi target14:0:152: enclosure level(0x0001),connector name(1   )
[Wed Jan 31 16:10:54 2018] sd 14:0:152:0: task abort: SUCCESS scmd(ffff94f8c94d2448)
[Wed Jan 31 16:10:54 2018] sd 14:0:71:0: attempting task abort! scmd(ffff94f8e3cd4748)
[Wed Jan 31 16:10:54 2018] sd 14:0:71:0: [sdtt] tag#7133 CDB: opcode=0x0 00 00 00 00 00 00
[Wed Jan 31 16:10:54 2018] scsi target14:0:71: handle(0x00a7), sas_address(0x5000c50093cb92b2), phy(31)
[Wed Jan 31 16:10:54 2018] scsi target14:0:71: enclosure_logical_id(0x5003048009268cbf), slot(30)
[Wed Jan 31 16:10:54 2018] scsi target14:0:71: enclosure level(0x0001),connector name(0   )
[Wed Jan 31 16:10:54 2018] sd 14:0:71:0: task abort: SUCCESS scmd(ffff94f8e3cd4748)
[Wed Jan 31 16:10:54 2018] device-mapper: multipath: Failing path 65:688.
[Wed Jan 31 16:10:54 2018] device-mapper: multipath: Failing path 70:656.
[Wed Jan 31 16:10:55 2018] print_req_error: I/O error, dev dm-285, sector 11721044992
[Wed Jan 31 16:10:55 2018] print_req_error: I/O error, dev dm-285, sector 11721044992
[Wed Jan 31 16:10:55 2018] Buffer I/O error on dev dm-285, logical block 1465130624, async page read
[Wed Jan 31 16:11:25 2018] device-mapper: multipath: Reinstating path 65:688.
[Wed Jan 31 16:11:25 2018] device-mapper: multipath: Reinstating path 70:656.
[Fri Feb  2 04:54:50 2018] sd 14:0:152:0: attempting task abort! scmd(ffff94f8c94d2448)
[Fri Feb  2 04:54:50 2018] sd 14:0:152:0: [sdwt] tag#9272 CDB: opcode=0x0 00 00 00 00 00 00
[Fri Feb  2 04:54:50 2018] scsi target14:0:152: handle(0x0053), sas_address(0x5000c50093cb92b1), phy(31)
[Fri Feb  2 04:54:50 2018] scsi target14:0:152: enclosure_logical_id(0x5003048009268cbf), slot(30)
[Fri Feb  2 04:54:50 2018] scsi target14:0:152: enclosure level(0x0001),connector name(1   )
[Fri Feb  2 04:54:50 2018] sd 14:0:152:0: task abort: SUCCESS scmd(ffff94f8c94d2448)
[Fri Feb  2 04:54:50 2018] sd 14:0:71:0: attempting task abort! scmd(ffff94f8e3cd4748)
[Fri Feb  2 04:54:50 2018] sd 14:0:71:0: [sdtt] tag#9270 CDB: opcode=0x0 00 00 00 00 00 00
[Fri Feb  2 04:54:50 2018] scsi target14:0:71: handle(0x00a7), sas_address(0x5000c50093cb92b2), phy(31)
[Fri Feb  2 04:54:50 2018] scsi target14:0:71: enclosure_logical_id(0x5003048009268cbf), slot(30)
[Fri Feb  2 04:54:50 2018] scsi target14:0:71: enclosure level(0x0001),connector name(0   )
[Fri Feb  2 04:54:50 2018] sd 14:0:71:0: task abort: SUCCESS scmd(ffff94f8e3cd4748)
[Fri Feb  2 04:54:51 2018] device-mapper: multipath: Failing path 65:688.
[Fri Feb  2 04:54:51 2018] device-mapper: multipath: Failing path 70:656.
[Fri Feb  2 04:55:20 2018] print_req_error: I/O error, dev dm-285, sector 11721045152
[Fri Feb  2 04:55:20 2018] print_req_error: I/O error, dev dm-285, sector 11721045152
[Fri Feb  2 04:55:20 2018] Buffer I/O error on dev dm-285, logical block 1465130644, async page read
[Fri Feb  2 04:55:36 2018] device-mapper: multipath: Reinstating path 65:688.
[Fri Feb  2 04:55:36 2018] device-mapper: multipath: Reinstating path 70:656.
[Fri Feb  2 04:56:06 2018] sd 14:0:71:0: attempting task abort! scmd(ffff94f8e3d38d48)
[Fri Feb  2 04:56:06 2018] sd 14:0:71:0: [sdtt] tag#1304 CDB: opcode=0x88 88 00 00 00 00 02 ba a0 f4 a0 00 00 00 08 00 00
[Fri Feb  2 04:56:06 2018] scsi target14:0:71: handle(0x00a7), sas_address(0x5000c50093cb92b2), phy(31)
[Fri Feb  2 04:56:06 2018] scsi target14:0:71: enclosure_logical_id(0x5003048009268cbf), slot(30)
[Fri Feb  2 04:56:06 2018] scsi target14:0:71: enclosure level(0x0001),connector name(0   )
[Fri Feb  2 04:56:06 2018] sd 14:0:71:0: task abort: SUCCESS scmd(ffff94f8e3d38d48)
[Fri Feb  2 09:09:40 2018] mpt3sas_cm2: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Fri Feb  2 09:09:42 2018] mpt3sas_cm4: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Fri Feb  2 09:09:42 2018] mpt3sas_cm1: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Fri Feb  2 09:09:42 2018] mpt3sas_cm5: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Fri Feb  2 09:09:42 2018] mpt3sas_cm3: log_info(0x30030101): originator(IOP), code(0x03), sub_code(0x0101)
[Fri Feb  2 19:45:26 2018] rpc-srv/tcp: nfsd: sent only 606964 when sending 1048708 bytes - shutting down socket
[Fri Feb  2 21:48:17 2018] rpc-srv/tcp: nfsd: sent only 779204 when sending 1048708 bytes - shutting down socket
[Sat Feb  3 12:34:54 2018] rpc-srv/tcp: nfsd: sent only 389668 when sending 1048708 bytes - shutting down socket
@rincebrain
Copy link
Contributor

Please don't open a new issue for something you already reported in #7097, close this one and ask around on IRC or zfs-discuss or just wait for someone to respond with more insight into the problem on the bug.

@morphinz
Copy link
Author

morphinz commented Feb 2, 2018

@rincebrain sorry about that but this is not same issue. I just post it before editing wrongly..
Now as you can see this is totally different issue.. And I'm on the irc almost everyday.

@rincebrain
Copy link
Contributor

@morphinz Ah, yes, I see, I was confused why the subject didn't match the contents.

This also sounds like it's not a bug, but something to ask about on IRC or zfs-discuss, unless you have good reason to think it's not related to disk IO errors. I suppose the logs you share will answer that.

@loli10K
Copy link
Contributor

loli10K commented Feb 3, 2018

And i think may be the problem is "multihost=on".

@morphinz it should be easy enough to verify this hypothesis by cherry-picking 51d1b58 on top of your current version.

@behlendorf
Copy link
Contributor

@morphinz yes can you please cherry pick 51d1b58, or wait for 0.7.6, to determine if this is multihost related. It is possible that the single drive failure is preventing multihost from writing to all of the drives for a few seconds, in this case it will suspend the pool as a safety precaution. This behavior can be disabled by setting zfs_multihost_fail_intervals=0 but is not recommended for a production configuration.

@morphinz
Copy link
Author

morphinz commented Feb 8, 2018

@behlendorf I have a pool of 226 disks in a two node clustered architecture. Whenever one of them pretends to fail my pool is suspended and all the dependent services fail as well.
So I would like to figure out a workaround until the root cause is found.

As far as I understand by setting zfs_multihost_fail_intervals=0 the risk I will be taking might be a fairly short time frame. In detail, I suppose if multihost write fails this will be retried. And in my circumtances next retry success chance is high since disk flickering is rare and it ends in few second.
If my understanding is correct I can take the way of zfs_multihost_fail_intervals=0 until a detailed analyze is done which is mentioned here also.

BTW 51d1b58 is not included 0.7.6 if I am not mistaken.

@behlendorf
Copy link
Contributor

Yes, the risks involved in setting zfs_multihost_fail_intervals=0 are pretty low. It's intended to ensure that your pool gets suspended if there ever exits a window of time where the partner node would assess (correctly) that the pool can be safely imported because no IO was observed.

If that's a risk your OK with then you can absolutely use this as a workaround until @ofaaland root causes what's happening.

But if I understand correctly you're saying that a single disk failure can cause this, that shouldn't be the case. The MMP code is heart beating all of those drives and only a single one needs to succeed. Do you know if when a drive fails the others are unavailable if your configuration for some reason?

BTW 51d1b58 is not included 0.7.6 if I am not mistaken.

Indeed, I've added it to the queue for 0.7.7. Sorry about that.

@morphinz
Copy link
Author

morphinz commented Feb 9, 2018

This risk seems OK for me thanks for helping.

I have no clue that whole disks are disappeared. My pool reports as healthy and there are no read/write checksum errors, but never run a scrub after this issue.
I've suspected my multipath configuration but it is pretty simple as below.

defaults {
find_multipaths yes
user_friendly_names no
}

blacklist {
}

My pool is created with devices under /dev/mapper.

However I am sure enough that single disappeared and appeared with in a few seconds. This system is up & running more than 6 months and pool suspend has never occurred before.
As soon as this single disk pretends to fail pool is suspended. Indeed it happened twice. At first I didn't realize the disk is guilty. After a few days the same disk failed as before and pool got suspended again. And finally I replaced it.

I'd like to help for further investigation. This is a production site but I can help with my test systems if it is possible to simulate the problem again.

@arturpzol
Copy link

I have experienced similar issue as @morphinz .

My zpool structure was:

zpool status
pool: Pool-0
state: ONLINE
scan: resilvered 324K in 0h0m with 0 errors on Fri Feb 9 12:42:05 2018
config:

    NAME                                  STATE     READ WRITE CKSUM
    Pool-0                                ONLINE       0     0     0
      mirror-0                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-3  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-2  ONLINE       0     0     0
      mirror-1                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-5  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-3  ONLINE       0     0     0
      mirror-2                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-6  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-4  ONLINE       0     0     0
      mirror-3                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-4  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-6  ONLINE       0     0     0
      mirror-4                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-1  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-2  ONLINE       0     0     0
    logs
      mirror-5                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-5  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-1  ONLINE       0     0     0

errors: No known data errors

Multihost was enabled:

zpool get multihost
NAME PROPERTY VALUE SOURCE
Pool-0 multihost on local

After remove one of the disk and run high I/O I got suspend I/O:

echo 1 > /sys/block/sdm/device/delete

zpool status
pool: Pool-0
state: DEGRADED
status: One or more devices could not be used because the label is missing or
invalid. Sufficient replicas exist for the pool to continue
functioning in a degraded state.
action: Replace the device using 'zpool replace'.
see: http://zfsonlinux.org/msg/ZFS-8000-4J
scan: resilvered 324K in 0h0m with 0 errors on Fri Feb 9 12:42:05 2018
config:

    NAME                                  STATE     READ WRITE CKSUM
    Pool-0                                DEGRADED     0     0     0
      mirror-0                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-3  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-2  ONLINE       0     0     0
      mirror-1                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-5  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-3  ONLINE       0     0     0
      mirror-2                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-6  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-4  ONLINE       0     0     0
      mirror-3                            DEGRADED     0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-4  UNAVAIL      6     4     0
        scsi-SQEMU_QEMU_HARDDISK_27105-6  ONLINE       0     0     0
      mirror-4                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-1  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-2  ONLINE       0     0     0
    logs
      mirror-5                            ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27105-5  ONLINE       0     0     0
        scsi-SQEMU_QEMU_HARDDISK_27106-1  ONLINE       0     0     0

errors: No known data errors

dmesg

Feb 9 12:47:37 [kern.info] [ 1110.136797] scst: Detached from scsi5, channel 0, id 0, lun 0, type 0
Feb 9 12:47:37 [kern.notice] [ 1110.136953] sd 5:0:0:0: [sdm] Synchronizing SCSI cache
Feb 9 12:47:37 [kern.info] [ 1110.136983] sd 5:0:0:0: [sdm] Synchronize Cache(10) failed: Result: hostbyte=0x0f driverbyte=0x00
Feb 9 12:50:27 [kern.warning] [ 1280.623787] WARNING: Pool 'Pool-0' has encountered an uncorrectable I/O failure and has been suspended.

so I used 51d1b58 to be sure that this is multihost related, repeated the test and got:

Feb 9 13:03:25 [kern.warning] [ 400.474357] WARNING: MMP writes to pool 'Pool-0' have not succeeded in over 5s; suspending pool
Feb 9 13:03:25 [kern.warning] [ 400.474360] WARNING: Pool 'Pool-0' has encountered an uncorrectable I/O failure and has been suspended.

Maybe setting zfs_multihost_fail_intervals to 30 seconds as SCSI timeout for disk will be some solution?

@arturpzol
Copy link

Issue can be also repeated without high I/O with zfs_multihost_fail=1 and with removing disk scenario.

Seems that when one disk in vdev is mark as REMOVED, FAULTED or UNAVAIL multihost causes suspend I/O.

I have test enviroment with 100% scenario and also can help for investigation.

Logs without high I/O:

zpool status -L
  pool: Pool-0
 state: ONLINE
  scan: resilvered 16K in 0h0m with 0 errors on Fri Feb  9 14:44:09 2018
config:

        NAME        STATE     READ WRITE CKSUM
        Pool-0      ONLINE       0     0     0
          mirror-0  ONLINE       0     0     0
            sdd     ONLINE       0     0     0
            sde     ONLINE       0     0     0
          mirror-1  ONLINE       0     0     0
            sdc     ONLINE       0     0     0
            sdb     ONLINE       0     0     0
        logs
          mirror-2  ONLINE       0     0     0
            sdg     ONLINE       0     0     0
            sdf     ONLINE       0     0     0

errors: No known data errors
echo 1 > /sys/module/zfs/parameters/zfs_multihost_fail_intervals
echo 1 > /sys/block/sdd/device/delete
root@192.168.251.29:~$ zpool status -L
  pool: Pool-0
 state: DEGRADED
status: One or more devices could not be used because the label is missing or
        invalid.  Sufficient replicas exist for the pool to continue
        functioning in a degraded state.
action: Replace the device using 'zpool replace'.
   see: http://zfsonlinux.org/msg/ZFS-8000-4J
  scan: resilvered 16K in 0h0m with 0 errors on Fri Feb  9 14:44:09 2018
config:

        NAME                                  STATE     READ WRITE CKSUM
        Pool-0                                DEGRADED     0     0     0
          mirror-0                            DEGRADED     0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27105-4  UNAVAIL      0     0     0
            sde                               ONLINE       0     0     0
          mirror-1                            ONLINE       0     0     0
            sdc                               ONLINE       0     0     0
            sdb                               ONLINE       0     0     0
        logs
          mirror-2                            ONLINE       0     0     0
            sdg                               ONLINE       0     0     0
            sdf                               ONLINE       0     0     0

errors: No known data errors

Feb  9 14:44:28 [kern.info] [  356.265505] scst: Detached from scsi0, channel 0, id 0, lun 4, type 0
Feb  9 14:44:28 [kern.notice] [  356.266286] sd 0:0:0:4: [sdd] Synchronizing SCSI cache
Feb  9 14:45:00 [kern.warning] [  387.724380] WARNING: MMP writes to pool 'Pool-0' have not succeeded in over 1s; suspending pool
Feb  9 14:45:00 [kern.warning] [  387.724384] WARNING: Pool 'Pool-0' has encountered an uncorrectable I/O failure and has been suspended.
zpool events -v

Feb  9 2018 14:44:34.704894115 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x43905496e8ec1a1e
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0x2cf2970d62cd5675
        vdev_state = "REMOVED" (0x3)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x5a7da5c2 0x2a03d4a3 
        eid = 0x1d

Feb  9 2018 14:44:35.704894115 ereport.fs.zfs.vdev.unknown
        class = "ereport.fs.zfs.vdev.unknown"
        ena = 0x547b8c155300001
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x43905496e8ec1a1e
                vdev = 0x2cf2970d62cd5675
        (end detector)
        pool = "Pool-0"
        pool_guid = 0x43905496e8ec1a1e
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0x2cf2970d62cd5675
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x5356e40dc8
        vdev_delta_ts = 0x1808
        vdev_read_errors = 0x0
        vdev_write_errors = 0x0
        vdev_cksum_errors = 0x0
        parent_guid = 0x922f6fcb574bcd2a
        parent_type = "mirror"
        vdev_spare_paths = 
        vdev_spare_guids = 
        prev_state = 0x1
        time = 0x5a7da5c3 0x2a03d4a3 
        eid = 0x1e

Feb  9 2018 14:44:35.704894115 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x43905496e8ec1a1e
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0x2cf2970d62cd5675
        vdev_state = "UNAVAIL" (0x4)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1"
        vdev_laststate = "REMOVED" (0x3)
        time = 0x5a7da5c3 0x2a03d4a3 
        eid = 0x1f

Feb  9 2018 14:45:00.394894115 ereport.fs.zfs.io_failure
        class = "ereport.fs.zfs.io_failure"
        ena = 0x5a3b62b6ea00001
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x43905496e8ec1a1e
        (end detector)
        pool = "Pool-0"
        pool_guid = 0x43905496e8ec1a1e
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        time = 0x5a7da5dc 0x17899b23 
        eid = 0x20

@behlendorf
Copy link
Contributor

@arturpzol it would be very helpful if you could rerun your test case with the multihost logging enabled.
That should give us a much better idea of exactly what happening for each vdev leading up to suspending the pool. If you can still reproduce the issue with the default tunings that would be preferable.

echo 100 >/sys/module/zfs/parameters/zfs_multihost_history
<run test>
cat /proc/spl/kstat/zfs/<pool>/multihost

@behlendorf behlendorf added this to the 0.8.0 milestone Feb 9, 2018
@arturpzol
Copy link

@behlendorf logs are below:

Feb  9 22:22:29 [kern.warning] [  377.489617] WARNING: MMP writes to pool 'Pool-0' have not succeeded in over 5s; suspending pool
Feb  9 22:22:29 [kern.warning] [  377.489619] WARNING: Pool 'Pool-0' has encountered an uncorrectable I/O failure and has been suspended.
 zpool status 
  pool: Pool-0
 state: DEGRADED
status: One or more devices are faulted in response to IO failures.
action: Make sure the affected devices are connected, then run 'zpool clear'.
   see: http://zfsonlinux.org/msg/ZFS-8000-HC
  scan: none requested
config:

	NAME                                  STATE     READ WRITE CKSUM
	Pool-0                                DEGRADED     0     0     0
	  mirror-0                            ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27105-4  ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27106-1  ONLINE       0     0     0
	  mirror-1                            ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27106-3  ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27105-2  ONLINE       0     0     0
	  mirror-2                            ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27106-2  ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27105-3  ONLINE       0     0     0
	  mirror-3                            DEGRADED     0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27105-5  ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27106-6  UNAVAIL      0     0     0
	  mirror-4                            ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27106-4  ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27105-6  ONLINE       0     0     0
	logs
	  mirror-5                            ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27106-5  ONLINE       0     0     0
	    scsi-SQEMU_QEMU_HARDDISK_27105-1  ONLINE       0     0     0

errors: No known data errors

One of below output when suspended occurred:

root@192.168.251.29:~$ cat /proc/spl/kstat/zfs/Pool-0/multihost
42 0 0x01 100 6400 238845868803 382681319766
txg        timestamp  mmp_delay    vdev_guid                vdev_label vdev_path
48         1518211341 141080365    681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
48         1518211341 140602769    10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
48         1518211341 140135147    18057160725883311765     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-6-part1
48         1518211342 140135147    18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
48         1518211342 159493356    17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
48         1518211342 159241524    910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
48         1518211342 158252001    10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
48         1518211342 157640315    18341480158787420334     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
48         1518211342 157640315    10177757756453132442     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
48         1518211342 160475326    14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
48         1518211342 159845575    11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
48         1518211342 159221078    9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
48         1518211342 158605657    1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
48         1518211342 157989288    11256790852081755000     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
48         1518211342 157377318    9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
48         1518211343 156782524    1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
48         1518211343 156180460    11256790852081755000     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
48         1518211343 155579253    1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
48         1518211343 154998647    10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
48         1518211343 154409776    1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
48         1518211343 153825170    910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
48         1518211343 153243775    9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
48         1518211343 152870550    10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
48         1518211343 151298354    10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
48         1518211343 150358968    18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
48         1518211343 149808409    9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
48         1518211343 149269959    18341480158787420334     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
48         1518211343 148724915    18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
48         1518211344 148184659    9419569107561384512      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
48         1518211344 147659366    910253119764622067       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
49         1518211344 147125962    910253119764622067       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
49         1518211349 146602266    9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
49         1518211349 5285945086   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
49         1518211349 5245469164   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
49         1518211349 5204933327   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
49         1518211349 5164895650   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
49         1518211349 5125178499   14570255669995168516     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
49         1518211350 5085758106   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
49         1518211350 5085758106   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
49         1518211350 5047271232   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
49         1518211350 5008469667   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
49         1518211350 4969963821   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
49         1518211350 4931759194   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
49         1518211350 4856144566   910253119764622067       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
49         1518211350 4818307779   910253119764622067       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
49         1518211350 4781290391   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
50         1518211350 4744560266   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211350 4708137854   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211350 4671966245   910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211350 4636084906   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211351 4600522280   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211351 4565176822   9419569107561384512      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211351 4530208415   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211351 4495369357   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211351 4460870918   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211351 4426644471   11256790852081755000     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211351 4392686738   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211351 4358994010   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211351 4325569786   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211351 4292395891   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211351 4259484079   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211351 4226834454   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211352 4194438851   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211352 4130399981   11256790852081755000     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211352 4098756591   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211352 4067363836   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211352 4036215218   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211352 4005299404   910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211352 3974633108   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211352 3944211443   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211352 3914017710   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211352 3884064412   18341480158787420334     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211352 3854345317   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211352 3824858782   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3795601537   17018430076091510625     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3766578452   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3737775463   681564880262157605       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211353 3709194250   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211353 3680845714   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3652711005   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211353 3624798870   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211353 3597104138   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3569626482   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211353 3542363971   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3515318891   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211353 3488477131   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211354 3461852221   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3435430409   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211354 3409217657   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211354 3383202329   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3357396639   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211354 3331791677   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3306392193   910253119764622067       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211354 3281181084   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211354 3256177600   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3206742774   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3182314190   1804647075253825155      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
root@192.168.251.29:~$ cat /proc/spl/kstat/zfs/Pool-0/multihost
42 0 0x01 100 6400 238845868803 386972415152
txg        timestamp  mmp_delay    vdev_guid                vdev_label vdev_path
51         1518211351 4530208415   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211351 4495369357   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211351 4460870918   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211351 4426644471   11256790852081755000     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211351 4392686738   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211351 4358994010   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211351 4325569786   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211351 4292395891   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211351 4259484079   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211351 4226834454   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211352 4194438851   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211352 4162297754   15327618201285114584     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211352 4130399981   11256790852081755000     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211352 4098756591   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211352 4067363836   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211352 4036215218   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211352 4005299404   910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211352 3974633108   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211352 3944211443   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211352 3914017710   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211352 3884064412   18341480158787420334     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211352 3854345317   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211352 3824858782   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3795601537   17018430076091510625     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3766578452   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3737775463   681564880262157605       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211353 3709194250   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211353 3680845714   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3652711005   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211353 3624798870   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211353 3597104138   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3542363971   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3515318891   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211353 3488477131   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211354 3461852221   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3435430409   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211354 3409217657   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211354 3383202329   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3357396639   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211354 3331791677   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3306392193   910253119764622067       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211354 3281181084   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211354 3256177600   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3231357820   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3206742774   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3182314190   1804647075253825155      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211354 3158077733   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 3134030821   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211355 3110171043   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 3086498881   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 3063010652   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 3039705541   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211355 3016582512   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211355 2993636305   1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211355 2970877314   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211355 2948288540   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 2925885527   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211355 2903646117   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 2881591844   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2860129404   910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211356 2837980040   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211356 2816432986   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211356 2795059434   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211356 2752797229   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211356 2731920979   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211356 2711198321   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2690641319   14570255669995168516     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211356 2670254207   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211356 2650012552   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2629931514   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2610009856   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2590243961   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2570633022   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211357 2551174925   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211357 2531868937   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211357 2512712502   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2493707821   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2474850891   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211357 2456145419   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2437577418   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211357 2419164183   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2400884645   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211357 2382757411   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2364765070   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211358 2346916021   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211358 2329205609   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2311634034   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211358 2294195628   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2276900903   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211358 2259732862   1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2242708945   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2225812723   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2209048575   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211358 2192414801   681564880262157605       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211358 2175906817   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211359 2143286756   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211359 2127172165   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
root@192.168.251.29:~$ cat /proc/spl/kstat/zfs/Pool-0/multihost
42 0 0x01 100 6400 238845868803 388641128851
txg        timestamp  mmp_delay    vdev_guid                vdev_label vdev_path
51         1518211352 3854345317   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211352 3824858782   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3795601537   17018430076091510625     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3766578452   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3737775463   681564880262157605       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211353 3709194250   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211353 3680845714   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3652711005   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211353 3624798870   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211353 3597104138   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211353 3569626482   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211353 3542363971   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211353 3515318891   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211353 3488477131   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211354 3461852221   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3435430409   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211354 3409217657   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211354 3383202329   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3357396639   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211354 3331791677   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3306392193   910253119764622067       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211354 3281181084   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211354 3256177600   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3231357820   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3206742774   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3182314190   1804647075253825155      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211354 3158077733   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 3134030821   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211355 3110171043   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 3086498881   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 3063010652   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 3016582512   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211355 2993636305   1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211355 2970877314   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211355 2948288540   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 2925885527   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211355 2903646117   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 2881591844   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2860129404   910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211356 2837980040   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211356 2816432986   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211356 2795059434   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211356 2773843302   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211356 2752797229   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211356 2731920979   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211356 2711198321   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2690641319   14570255669995168516     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211356 2670254207   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211356 2650012552   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2629931514   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2610009856   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2590243961   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2570633022   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211357 2551174925   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211357 2531868937   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211357 2512712502   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2493707821   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2474850891   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211357 2456145419   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2437577418   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211357 2419164183   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2400884645   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211357 2382757411   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2346916021   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211358 2329205609   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2311634034   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211358 2294195628   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2276900903   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211358 2259732862   1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2242708945   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2225812723   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2209048575   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211358 2192414801   681564880262157605       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211358 2175906817   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2159537583   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211359 2143286756   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211359 2127172165   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211359 2111176732   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211359 2095305694   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211359 2079565435   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 2063939329   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211359 2048444422   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 2033061917   10177757756453132442     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211359 2017807655   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211359 2002663154   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 1987644006   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 1972740539   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211360 1957957689   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211360 1943282278   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211360 1928725157   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211360 1928725157   14570255669995168516     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211360 1914911053   11256790852081755000     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211360 1900571195   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211360 1886352701   1804647075253825155      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211360 1872239664   910253119764622067       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211360 1844345981   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211360 1830557157   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
root@192.168.251.29:~$ cat /proc/spl/kstat/zfs/Pool-0/multihost
42 0 0x01 100 6400 238845868803 389681087719
txg        timestamp  mmp_delay    vdev_guid                vdev_label vdev_path
51         1518211353 3488477131   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211354 3461852221   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3435430409   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211354 3409217657   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211354 3383202329   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3357396639   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211354 3331791677   10177757756453132442     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211354 3306392193   910253119764622067       2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211354 3281181084   15327618201285114584     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211354 3256177600   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211354 3231357820   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3206742774   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211354 3182314190   1804647075253825155      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211354 3158077733   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 3134030821   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211355 3110171043   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 3086498881   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 3063010652   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 3039705541   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211355 3016582512   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211355 2993636305   1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211355 2970877314   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211355 2948288540   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211355 2925885527   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211355 2903646117   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211355 2881591844   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2860129404   910253119764622067       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211356 2837980040   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211356 2816432986   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211356 2795059434   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211356 2773843302   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211356 2731920979   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211356 2711198321   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2690641319   14570255669995168516     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211356 2670254207   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211356 2650012552   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2629931514   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211356 2610009856   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2590243961   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2570633022   15327618201285114584     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211357 2551174925   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211357 2531868937   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211357 2512712502   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211357 2493707821   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2474850891   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211357 2456145419   18341480158787420334     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2437577418   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211357 2419164183   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211357 2400884645   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211357 2382757411   17018430076091510625     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2364765070   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211358 2346916021   10177757756453132442     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211358 2329205609   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2311634034   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211358 2294195628   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2276900903   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211358 2259732862   1804647075253825155      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2242708945   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2225812723   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211358 2209048575   9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211358 2192414801   681564880262157605       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211358 2175906817   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211358 2159537583   681564880262157605       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211359 2127172165   1804647075253825155      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211359 2111176732   11256790852081755000     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211359 2095305694   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211359 2079565435   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 2063939329   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211359 2048444422   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 2033061917   10177757756453132442     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-4-part1
51         1518211359 2017807655   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211359 2002663154   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 1987644006   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211359 1972740539   17018430076091510625     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211360 1957957689   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211360 1943282278   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211360 1928725157   18341480158787420334     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-4-part1
51         1518211360 1928725157   14570255669995168516     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211360 1914911053   11256790852081755000     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211360 1900571195   14570255669995168516     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
51         1518211360 1886352701   1804647075253825155      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211360 1872239664   910253119764622067       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
51         1518211360 1858239571   9419569107561384512      1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211360 1844345981   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211360 1830557157   681564880262157605       1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-2-part1
51         1518211360 1816880702   9419569107561384512      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211360 1803316408   10399206071513992966     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211361 1789848216   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211361 1776490035   9419569107561384512      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
51         1518211361 1763242397   17018430076091510625     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211361 1750091511   1804647075253825155      2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-1-part1
51         1518211361 1737043471   10399206071513992966     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211361 1724092574   10399206071513992966     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
51         1518211361 1711248136   11256790852081755000     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-1-part1
51         1518211361 1698503891   17018430076091510625     2          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-5-part1
51         1518211361 1673318277   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1
51         1518211361 1648430078   10399206071513992966     0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-5-part1
Feb  9 2018 22:22:29.547416339 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x13397ca7eb4857cc
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0xfa97ec025e68d695
        vdev_state = "REMOVED" (0x3)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-6-part1"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x5a7e1115 0x20a0e913 
        eid = 0x30

Feb  9 2018 22:22:29.547416339 ereport.fs.zfs.vdev.unknown
        class = "ereport.fs.zfs.vdev.unknown"
        ena = 0x57d976f43500001
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x13397ca7eb4857cc
                vdev = 0xfa97ec025e68d695
        (end detector)
        pool = "Pool-0"
        pool_guid = 0x13397ca7eb4857cc
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xfa97ec025e68d695
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-6-part1"
        vdev_ashift = 0xc
        vdev_complete_ts = 0x56994a601a
        vdev_delta_ts = 0x1d810
        vdev_read_errors = 0x0
        vdev_write_errors = 0x0
        vdev_cksum_errors = 0x0
        parent_guid = 0x60de6e008bcba99c
        parent_type = "mirror"
        vdev_spare_paths = 
        vdev_spare_guids = 
        prev_state = 0x1
        time = 0x5a7e1115 0x20a0e913 
        eid = 0x31

Feb  9 2018 22:22:29.547416339 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x13397ca7eb4857cc
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0xfa97ec025e68d695
        vdev_state = "UNAVAIL" (0x4)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-6-part1"
        vdev_laststate = "REMOVED" (0x3)
        time = 0x5a7e1115 0x20a0e913 
        eid = 0x32

Feb  9 2018 22:22:29.547416339 ereport.fs.zfs.io_failure
        class = "ereport.fs.zfs.io_failure"
        ena = 0x57d979f6d800001
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x13397ca7eb4857cc
        (end detector)
        pool = "Pool-0"
        pool_guid = 0x13397ca7eb4857cc
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        time = 0x5a7e1115 0x20a0e913 
        eid = 0x33

@arturpzol
Copy link

I have tried to test 891b2e7, abd17be and seems that suspending with my test is very hard repeatable but still easy possible with zfs_multihost_fail_intervals smaller then default. Default value and MMP delay logic changes looks promising and reduces the risk of suspend occurrence.

Is there any chance to merge commits to the master in a short time?

@ofaaland
Copy link
Contributor

@arturpzol I'm working on patches to make the multihost history more useful, and using those to investigate the issue with a removed disk triggering suspend via MMP. I'll get those pushed where you can get them, it will help clarify what's going on.

I see your test involves removing a physical disk. Are you able to reproduce just by offlining or detaching a disk via zpool offline or zpool detach (do this on you test environment, not your production system)?

It seems from what you've described like there's a bug in the way MMP handles vdev changes that needs to be fixed, regardless of any changes to the suspend timeouts/import delay. Also, there are concerns with those two patches; not that that they don't work, but that the import time is unbounded. So it's not up to me, but I do not believe those patches are ready to be merged to master.

I'm sorry this isn't going faster, doing the best we can.

@ofaaland
Copy link
Contributor

  48         1518211344 148184659    9419569107561384512      0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
  48         1518211344 147659366    910253119764622067       0          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
  49         1518211344 147125962    910253119764622067       3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-3-part1
  49         1518211349 146602266    9419569107561384512      3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-3-part1
**49         1518211349 5285945086   14570255669995168516     1          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27106-2-part1
  49         1518211349 5245469164   15327618201285114584     3          /dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27105-6-part1

The mmp write marked with asterisks is after the pool was suspended; the timestamp 1518211349 matches the time of the zfs.io_failure event.

Feb  9 2018 22:22:29.547416339 ereport.fs.zfs.io_failure
        class = "ereport.fs.zfs.io_failure"
        ena = 0x57d979f6d800001
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x13397ca7eb4857cc
        (end detector)
        pool = "Pool-0"
        pool_guid = 0x13397ca7eb4857cc
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        time = 0x5a7e1115 0x20a0e913 
        eid = 0x33

At that point mmp_delay was over 5 seconds, when it had been 141ms prior to that. And the most recent mmp write was 5 seconds before that. So something caused the mmp thread to stop attempting to write for 5 seconds (multihost history shows each attempt whether it was successful or not).

The other ZFS events all have the same time stamp, including the first one that says vdev_state = "REMOVED" or similar. So something happened 5 seconds before that which caused the problem; perhaps 5 seconds before is when you actually caused the device removal. @arturpzol do you have a way to know if that is the case?

I'm guessing the process of detecting that the device was now gone and trying to gather data on it somehow holds up the mmp thread. I suspect that code has taken the config lock as writer and held it for 5 seconds. I'll look there next.

@arturpzol
Copy link

@ofaaland thank you for your notes.

I have also leaded to pool suspend with zpool offline even with zpool online but that is depend of I/O load.

@arturpzol do you have a way to know if that is the case?

Unfortunately I do not have suspend logs from my previous note so below logs from zpool offline operations:


zpool offline Pool-0 scsi-SQEMU_QEMU_HARDDISK_27147-4 scsi-SQEMU_QEMU_HARDDISK_27146-2 scsi-SQEMU_QEMU_HARDDISK_27146-3 scsi-SQEMU_QEMU_HARDDISK_27147-3 scsi-SQEMU_QEMU_HARDDISK_27147-1 scsi-SQEMU_QEMU_HARDDISK_27147-5
cannot offline scsi-SQEMU_QEMU_HARDDISK_27147-1: pool I/O is currently suspended
cannot offline scsi-SQEMU_QEMU_HARDDISK_27147-5: pool I/O is currently suspended
zpool status
  pool: Pool-0
 state: DEGRADED
status: One or more devices are faulted in response to IO failures.
action: Make sure the affected devices are connected, then run 'zpool clear'.
   see: http://zfsonlinux.org/msg/ZFS-8000-HC
  scan: resilvered 21.6M in 0h0m with 0 errors on Thu Feb 15 12:00:18 2018
config:

        NAME                                  STATE     READ WRITE CKSUM
        Pool-0                                DEGRADED     0     0     0
          mirror-0                            DEGRADED     0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27147-4  OFFLINE      0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27146-6  ONLINE       0     0     0
          mirror-1                            DEGRADED     0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27146-2  OFFLINE      0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27147-6  ONLINE       0     0     0
          mirror-2                            DEGRADED     0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27146-3  OFFLINE      0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27147-2  ONLINE       0     0     0
          mirror-3                            DEGRADED     0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27147-3  OFFLINE      0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27146-5  ONLINE       0     0     0
          mirror-4                            ONLINE       0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27147-1  ONLINE       0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27146-4  ONLINE       0     0     0
        logs
          mirror-5                            ONLINE       0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27147-5  ONLINE       0     0     0
            scsi-SQEMU_QEMU_HARDDISK_27146-1  ONLINE       0     0     0

zfs multihost history log is available at: https://pastebin.com/U2PR1fSL

zpool events -v
TIME                           CLASS
Feb 15 2018 12:01:24.196313524 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0xfe43ca2576da0c07
        vdev_state = "OFFLINE" (0x2)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27147-4-part1"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x5a856884 0xbb381b4 
        eid = 0x59

Feb 15 2018 12:01:30.996313524 sysevent.fs.zfs.config_sync
        version = 0x0
        class = "sysevent.fs.zfs.config_sync"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        time = 0x5a85688a 0x3b6289b4 
        eid = 0x5a

Feb 15 2018 12:01:35.916313524 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0xb86e229b480b3df8
        vdev_state = "OFFLINE" (0x2)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27146-2-part1"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x5a85688f 0x369dd5b4 
        eid = 0x5b

Feb 15 2018 12:01:48.036313524 sysevent.fs.zfs.config_sync
        version = 0x0
        class = "sysevent.fs.zfs.config_sync"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        time = 0x5a85689c 0x22a19b4 
        eid = 0x5c

Feb 15 2018 12:01:48.876313524 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0x4f400d135c30458f
        vdev_state = "OFFLINE" (0x2)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27146-3-part1"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x5a85689c 0x343b7bb4 
        eid = 0x5d

Feb 15 2018 12:01:54.756313524 sysevent.fs.zfs.config_sync
        version = 0x0
        class = "sysevent.fs.zfs.config_sync"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        time = 0x5a8568a2 0x2d146db4 
        eid = 0x5e

Feb 15 2018 12:02:02.556313524 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0x3df04e55259c0c16
        vdev_state = "OFFLINE" (0x2)
        vdev_path = "/dev/disk/by-id/scsi-SQEMU_QEMU_HARDDISK_27147-3-part1"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x5a8568aa 0x2128abb4 
        eid = 0x5f

Feb 15 2018 12:02:02.636313524 ereport.fs.zfs.io_failure
        class = "ereport.fs.zfs.io_failure"
        ena = 0x7c53c7f94500001
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x59f81b00cfe4f345
        (end detector)
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        time = 0x5a8568aa 0x25ed5fb4 
        eid = 0x60

Feb 15 2018 12:02:05.686313524 sysevent.fs.zfs.config_sync
        version = 0x0
        class = "sysevent.fs.zfs.config_sync"
        pool = "Pool-0"
        pool_guid = 0x59f81b00cfe4f345
        pool_state = 0x0
        pool_context = 0x0
        time = 0x5a8568ad 0x28e85034 
        eid = 0x61

Below kernel logs are with some changes in code for facility debug:

+++ mmp.c       (working copy)
@@ -26,6 +26,7 @@
 #include <sys/spa_impl.h>
+#include <sys/time.h>
 #include <sys/vdev.h>
@@ -438,6 +439,11 @@
                 */
                if (!suspended && mmp_fail_intervals && multihost &&
                    (start - mmp->mmp_last_write) > max_fail_ns) {
+                       cmn_err(CE_WARN, "MMP writes to pool '%s' have not "
+                           "succeeded in over %llu - %llu; suspending pool",
+                           spa_name(spa),
+                           (start - mmp->mmp_last_write),
+                           max_fail_ns);
                        zio_suspend(spa, NULL);
                }

[  534.123616] WARNING: MMP writes to pool 'Pool-0' have not succeeded in over 7881101041 - 5000000000; suspending pool
[  534.123620] WARNING: Pool 'Pool-0' has encountered an uncorrectable I/O failure and has been suspended.

@arturpzol
Copy link

@ofaaland should I repeat the test with zfs 0.7.7 ?

Is there any chance to fix unexpected suspend in case of vdev state change or only one solution for now is
zfs_multihost_fail_intervals=0 ?

@ofaaland
Copy link
Contributor

@arturpzol , Sorry, I do not expect the patches in 0.7.7 fixed your issue, unfortunately. I'm still working on it, but do not have a working fix yet.

@behlendorf
Copy link
Contributor

Closing. I believe this was resolved in 0.7.7 by c30e716.

@morphinz
Copy link
Author

@behlendorf I will test again when I got free time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants