Affected version:3.1.4.2 3.1.5
Fix version: 4.0.1
The listed workaround in the bug details for SUP-664 are as following:
-disable zfs-diagnosis in FMA from bash like this - this is to mitigate against the risk of more disks being classified as failed whilst the disks resilver
#fmadm unload zfs-diagnosis
#zpool scrub <pool name>
-Once it has completed and the disks resilvered:
- reload the zfs-diagnosis like this:
#fmadm load zfs-diagnosis
Relax the number of timeouts and window size the time outs are checked in.
Take a checkpoint, add the following to /etc/system:
#Added as part of a workaround for SUP-664
set zfs:zio_min_timeout_ms = 30000
set zfs:zio_max_timeout_ms = 30000
A reboot will be necessary for these values to take effect - do this in your next available maintenance window.
And to set dynamically:
echo zio_min_timeout_ms/W0t30000 | mdb -kw
echo zio_max_timeout_ms/W0t30000 | mdb -kw
-----------------\
relevant messages
-----------------/
Sep 26 2013 13:46:41.049898727 ereport.fs.zfs.timeout
Sep 26 2013 13:46:41.049044013 ereport.fs.zfs.timeout
Sep 26 2013 13:46:41.049932158 ereport.fs.zfs.timeout
Sep 26 2013 13:46:41.049964307 ereport.fs.zfs.timeout
Sep 26 2013 13:46:41.049983021 ereport.fs.zfs.timeout
Sep 26 2013 13:46:41.048559945 ereport.fs.zfs.timeout
Sep 26 2013 13:46:41.050028063 ereport.fs.zfs.timeout
Sep 26 2013 12:09:37.070413763 ereport.io.scsi.cmd.disk.dev.rqs.derr
devid = id1,sd@n5000c5003010367f
Sep 26 2013 13:09:17.513816671 ereport.io.scsi.cmd.disk.dev.rqs.derr
devid = id1,sd@n5000c5003010368f
Sep 26 2013 13:09:35.036055029 ereport.io.scsi.cmd.disk.dev.rqs.derr
Sep 26 2013 13:46:41.049932158 ereport.fs.zfs.timeout
vdev_devid = id1,sd@n5000cca01b6a2c98/a
Sep 26 2013 13:46:41.049964307 ereport.fs.zfs.timeout
vdev_devid = id1,sd@n5000cca01b6a2d60/a
Sep 26 2013 13:46:41.049983021 ereport.fs.zfs.timeout
vdev_devid = id1,sd@n5000cca01b57940c/a
Sep 26 2013 13:46:41.048559945 ereport.fs.zfs.timeout
Sep 27 2013 08:09:16.584383276 ereport.io.scsi.cmd.disk.dev.rqs.derr
devid = id1,sd@n5000c5003010368f
Sep 27 2013 08:09:29.268499042 ereport.io.scsi.cmd.disk.dev.rqs.derr
devid = id1,sd@n5000c5003010367f
Sep 27 2013 09:09:15.997154935 ereport.io.scsi.cmd.disk.dev.rqs.derr
devid = id1,sd@n5000c5003010368f
Sep 27 2013 09:09:28.644663402 ereport.io.scsi.cmd.disk.dev.rqs.der