Increased error rate since move from Illumos (5.11) ZFS to OpenZFS (0.8.3 and/or 2.1.4) #13474

maxboone · 2022-05-18T09:33:37Z

maxboone
May 18, 2022

We recently decided to make the move from Illumos (OpenIndiana, 5.11) to Linux (and ZFS on Linux / OpenZFS) for our storage systems. We're noticing that the rate that our disks error is a lot higher than we formerly had on the Illumos-based systems.

Type	Version/Name
Distribution Name	Ubuntu
Distribution Version	Ubuntu 20.04.4 LTS
Kernel Version	5.4.0-104-generic
Architecture	x86_64
OpenZFS Version	2.1.4-0york0~20.04

Disks that we use are enterprise SSDs or HDDs and SMART-monitoring shows no issues with them. Besides, procedurally, when SMART is showing the least bit of issues with a disk, we pro-actively replace it with a new disk.

Logging for the IO-errors that we're seeing

May 17 20:18:21 storage.machine.hostname kernel: ata1.00: Enabling discard_zeroes_data
May 17 20:18:21 storage.machine.hostname kernel: ata2.00: Enabling discard_zeroes_data
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x00000000284070dc), outstanding for 30500 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1768 CDB: Read(10) 28 00 2a 29 7e 57 00 00 2a 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x00000000284070dc)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1768 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1768 CDB: Read(10) 28 00 2a 29 7e 57 00 00 2a 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 707362391 op 0x0:(READ) flags 0x700 phys_seg 1 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=1 offset=362168495616 size=21504 flags=180880
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x000000001b80dffd), outstanding for 31016 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1732 CDB: Write(10) 2a 00 2e 1d ed 17 00 00 f1 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x000000001b80dffd) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x000000001b80dffd)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1732 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1732 CDB: Write(10) 2a 00 2e 1d ed 17 00 00 f1 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 773713175 op 0x1:(WRITE) flags 0x700 phys_seg 5 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=2 offset=396140097024 size=123392 flags=40080c80
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x00000000b4432a51), outstanding for 31016 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1721 CDB: Write(10) 2a 00 2b 48 fc 32 00 00 02 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x00000000b4432a51) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x00000000b4432a51)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1721 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1721 CDB: Write(10) 2a 00 2b 48 fc 32 00 00 02 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 726203442 op 0x1:(WRITE) flags 0x700 phys_seg 1 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=2 offset=371815113728 size=1024 flags=180880
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x0000000002cdbff9), outstanding for 31020 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1710 CDB: Write(10) 2a 00 2e 1d ef 08 00 00 d6 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x0000000002cdbff9) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x0000000002cdbff9)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1710 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1710 CDB: Write(10) 2a 00 2e 1d ef 08 00 00 d6 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 773713672 op 0x1:(WRITE) flags 0x700 phys_seg 4 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=2 offset=396140351488 size=109568 flags=40080c80
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x00000000fc2dd7af), outstanding for 31020 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1707 CDB: Write(10) 2a 00 2e 1d ee 08 00 01 00 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x00000000fc2dd7af) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x00000000fc2dd7af)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1707 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1707 CDB: Write(10) 2a 00 2e 1d ee 08 00 01 00 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 773713416 op 0x1:(WRITE) flags 0x700 phys_seg 1 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=2 offset=396140220416 size=131072 flags=40080c80
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x00000000d58a0c72), outstanding for 31020 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1706 CDB: Write(10) 2a 00 2e 1d ef de 00 00 2b 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x00000000d58a0c72) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x00000000d58a0c72)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1706 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1706 CDB: Write(10) 2a 00 2e 1d ef de 00 00 2b 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 773713886 op 0x1:(WRITE) flags 0x700 phys_seg 6 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=2 offset=396140461056 size=22016 flags=180880
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x00000000fc797df4), outstanding for 31020 ms & timeout 60000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1687 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x00000000fc797df4) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x00000000fc797df4)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: attempting task abort!scmd(0x00000000c2b6eb81), outstanding for 31008 ms & timeout 30000 ms
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1686 CDB: Read(10) 28 00 2d 37 e1 59 00 00 2a 00
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: handle(0x000f), sas_address(0x500304801e80aac5), phy(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure logical id(0x500304801e80aaff), slot(5)
May 17 20:18:54 storage.machine.hostname kernel: scsi target3:0:5: enclosure level(0x0000), connector name(     )
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: No reference found at driver, assuming scmd(0x00000000c2b6eb81) might have completed
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: task abort: SUCCESS scmd(0x00000000c2b6eb81)
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1686 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 17 20:18:54 storage.machine.hostname kernel: sd 3:0:5:0: [sdh] tag#1686 CDB: Read(10) 28 00 2d 37 e1 59 00 00 2a 00
May 17 20:18:54 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdh, sector 758636889 op 0x0:(READ) flags 0x700 phys_seg 1 prio class 0
May 17 20:18:54 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 error=5 type=1 offset=388421038592 size=21504 flags=180880
May 17 20:18:55 storage.machine.hostname kernel: sd 3:0:5:0: Power-on or device reset occurred
May 17 20:18:55 storage.machine.hostname zed[3835553]: eid=14779 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:18:56 storage.machine.hostname zed[3836965]: eid=14780 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:18:57 storage.machine.hostname zed[3838255]: eid=14781 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:18:58 storage.machine.hostname zed[3840233]: eid=14782 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:18:58 storage.machine.hostname zed[3841478]: eid=14783 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:18:59 storage.machine.hostname zed[3842605]: eid=14784 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:00 storage.machine.hostname zed[3843536]: eid=14785 class=delay pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:01 storage.machine.hostname zed[3845874]: eid=14786 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:01 storage.machine.hostname zed[3847913]: eid=14787 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:02 storage.machine.hostname zed[3850459]: eid=14788 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:03 storage.machine.hostname zed[3852827]: eid=14789 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:04 storage.machine.hostname zed[3855055]: eid=14790 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:04 storage.machine.hostname zed[3857903]: eid=14791 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:05 storage.machine.hostname zed[3860620]: eid=14792 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:06 storage.machine.hostname zed[3862669]: eid=14793 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:06 storage.machine.hostname zed[3865102]: eid=14794 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:07 storage.machine.hostname zed[3867979]: eid=14795 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:08 storage.machine.hostname zed[3870899]: eid=14796 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:09 storage.machine.hostname zed[3873182]: eid=14797 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:09 storage.machine.hostname zed[3875290]: eid=14798 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:10 storage.machine.hostname zed[3878147]: eid=14799 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:11 storage.machine.hostname zed[3880955]: eid=14800 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:11 storage.machine.hostname zed[3883466]: eid=14801 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:12 storage.machine.hostname zed[3885529]: eid=14802 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:13 storage.machine.hostname zed[3888361]: eid=14803 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:13 storage.machine.hostname zed[3891332]: eid=14804 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:14 storage.machine.hostname zed[3893650]: eid=14805 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:15 storage.machine.hostname zed[3896048]: eid=14806 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:16 storage.machine.hostname zed[3898911]: eid=14807 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:16 storage.machine.hostname zed[3901623]: eid=14808 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:17 storage.machine.hostname zed[3903610]: eid=14809 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:18 storage.machine.hostname zed[3906351]: eid=14810 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:18 storage.machine.hostname zed[3909158]: eid=14811 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:19 storage.machine.hostname zed[3911902]: eid=14812 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:20 storage.machine.hostname zed[3913452]: eid=14813 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:21 storage.machine.hostname zed[3916514]: eid=14814 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:21 storage.machine.hostname zed[3919341]: eid=14815 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:22 storage.machine.hostname zed[3921106]: eid=14816 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:23 storage.machine.hostname zed[3924241]: eid=14817 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:23 storage.machine.hostname zed[3927272]: eid=14818 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:24 storage.machine.hostname zed[3930265]: eid=14819 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:25 storage.machine.hostname zed[3932471]: eid=14820 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:25 storage.machine.hostname zed[3935208]: eid=14821 class=io pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1
May 17 20:19:26 storage.machine.hostname zed[3938613]: eid=14822 class=statechange pool_guid=0x57BC455790534722 vdev_path=/dev/disk/by-id/wwn-0x500080d910c603ff-part1 vdev_state=FAULTED
May 17 20:19:27 storage.machine.hostname zed[3941991]: vdev wwn-0x500080d910c603ff-part1 set '/sys/class/enclosure/3:0:24:0/Slot05/fault' LED to 1

We've turned to the logs to track what errors were occurring, but they make it look like the physical disks are the culprit. As if it's not related to OpenZFS, but our disks are just failing. Yet, considering this happens immediately after switching from Illumos to Linux on multiple machines and we find it hard to believe that mpt3sas is fundamentally different between Illumos & Linux, we'd like to figure out if this isn't OpenZFS first.

It feels more likely that tweaking ZFS will yield better results than looking at the mpt3sas library. Besides, the machines are well-maintained and machines with similar age that are still running Illumos (quite some) don't show this behaviour, making a physical problem (humidity, temperature, cabling, etc.) unlikely.

Similar things happen to HDDs, however the rate of errors seems to be a bit lower (but still a lot higher than we have on Illumos).

SMART information about this disk in particular

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000a   100   100   000    Old_age   Always       -       0
  2 Throughput_Performance  0x0005   100   100   050    Pre-fail  Offline      -       0
  3 Spin_Up_Time            0x0007   100   100   050    Pre-fail  Always       -       0
  5 Reallocated_Sector_Ct   0x0013   100   100   050    Pre-fail  Always       -       0
  7 Unknown_SSD_Attribute   0x000b   100   100   050    Pre-fail  Always       -       0
  8 Unknown_SSD_Attribute   0x0005   100   100   050    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0012   100   100   000    Old_age   Always       -       33857
 10 Unknown_SSD_Attribute   0x0013   100   100   050    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0012   100   100   000    Old_age   Always       -       12
167 SSD_Protect_Mode        0x0022   100   100   000    Old_age   Always       -       0
168 SATA_PHY_Error_Count    0x0012   100   100   000    Old_age   Always       -       0
169 Bad_Block_Count         0x0013   100   100   010    Pre-fail  Always       -       100
170 Unknown_Attribute       0x0013   100   100   010    Pre-fail  Always       -       0
173 Erase_Count             0x0012   173   173   000    Old_age   Always       -       23205773913485
174 Unknown_Attribute       0x0012   196   196   000    Old_age   Always       -       4415238748967
175 Program_Fail_Count_Chip 0x0013   100   100   010    Pre-fail  Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0012   100   100   000    Old_age   Always       -       10
194 Temperature_Celsius     0x0022   070   055   000    Old_age   Always       -       30 (Min/Max 23/45)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
232 Available_Reservd_Space 0x0022   100   100   000    Old_age   Always       -       0
240 Unknown_SSD_Attribute   0x0013   100   100   050    Pre-fail  Always       -       0
241 Total_LBAs_Written      0x0012   100   100   000    Old_age   Always       -       36804924
242 Total_LBAs_Read         0x0012   100   100   000    Old_age   Always       -       10367975
243 Unknown_Attribute       0x0012   100   100   000    Old_age   Always       -       0
249 Unknown_Attribute       0x0022   100   100   000    Old_age   Always       -       6478437

We tried changing some parameters, like lowering the zfs_vdev_trim_max_active to 1 for the SSD-backed machines. That didn't lower the amount of errors we're getting. Considering the errors seem to happen at quite "random" moments we haven't been able to do detailed traces yet and are struggling with finding a good start for collecting traces and/or more information about these errors.

I'm not sure if collecting perf-data (and how we would match that with the moments errors occur) is sensible. Or if there are other tools to resort to when this kind of "random" behaviour occurs. If anyone can point me in the right direction (or a book / blog / article) on tracing situations like these, I'm happy to learn.

Some more information about the SAS controller and the device (`sas3ircu 0 DISPLAY`)

root@st6:~# sas3ircu 0 DISPLAY
Avago Technologies SAS3 IR Configuration Utility.
Version 17.00.00.00 (2018.04.02)
Copyright (c) 2009-2018 Avago Technologies. All rights reserved.

Read configuration has been initiated for controller 0
------------------------------------------------------------------------
Controller information
------------------------------------------------------------------------
  Controller type                         : SAS3008
  BIOS version                            : 8.37.00.00
  Firmware version                        : 16.00.01.00
  Channel description                     : 1 Serial Attached SCSI
  Initiator ID                            : 0
  Maximum physical devices                : 1023
  Concurrent commands supported           : 9856
  Slot                                    : 3
  Segment                                 : 0
  Bus                                     : 179
  Device                                  : 0
  Function                                : 0
  RAID Support                            : No
------------------------------------------------------------------------
...
Device is a Hard disk
  Enclosure #                             : 2
  Slot #                                  : 5
  SAS Address                             : 5003048-0-1e80-aac5
  State                                   : Ready (RDY)
  Size (in MB)/(in sectors)               : 1831420/3750748847
  Manufacturer                            : ATA
  Model Number                            : TOSHIBA THNSN81Q
  Firmware Revision                       : 6102
  Serial No                               : 774S104ITHYT
  Unit Serial No(VPD)                     : 774S104ITHYT
  GUID                                    : 500080d910c603ff
  Protocol                                : SATA
  Drive Type                              : SATA_SSD

ZFS `zpool events -v`

TIME                           CLASS
May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0dcca9702c01
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0d8e911
        vdev_delta_ts = 0x75df5560b
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x11894498d2d404
        zio_delta = 0x0
        zio_offset = 0x5c3bcb2800
        zio_size = 0x600
        zio_objset = 0x3145
        zio_object = 0xb
        zio_level = 0x1
        zio_blkid = 0x141
        time = 0x6283e70f 0x41c58e3
        eid = 0x39de

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0de29ba00801
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x1189449a2e19f6
        zio_delta = 0x0
        zio_offset = 0x5c3bcebc00
        zio_size = 0x5400
        zio_objset = 0x3203
        zio_object = 0x12
        zio_level = 0x0
        zio_blkid = 0x20cbe
        time = 0x6283e70f 0x41c58e3
        eid = 0x39df

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0df05ef03401
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x1189449a2cd08b
        zio_delta = 0x0
        zio_offset = 0x5c3bce6600
        zio_size = 0x5600
        zio_objset = 0x3203
        zio_object = 0x12
        zio_level = 0x0
        zio_blkid = 0x20c9d
        time = 0x6283e70f 0x41c58e3
        eid = 0x39e0

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0dfeea501401
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x1189449a2b9484
        zio_delta = 0x0
        zio_offset = 0x5c3bce1000
        zio_size = 0x5600
        zio_objset = 0x3203
        zio_object = 0x12
        zio_level = 0x0
        zio_blkid = 0x20c30
        time = 0x6283e70f 0x41c58e3
        eid = 0x39e1

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0d6132e01801
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x1189449806041b
        zio_delta = 0x0
        zio_offset = 0x5c3bcaaa00
        zio_size = 0x2000
        zio_objset = 0x3145
        zio_object = 0xb
        zio_level = 0x1
        zio_blkid = 0x133
        time = 0x6283e70f 0x41c58e3
        eid = 0x39e2

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0e1f09f01801
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x11894497ef9b03
        zio_delta = 0x0
        zio_offset = 0x5c3bca8a00
        zio_size = 0x2000
        zio_objset = 0x3145
        zio_object = 0xb
        zio_level = 0x1
        zio_blkid = 0x132
        time = 0x6283e70f 0x41c58e3
        eid = 0x39e3

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0e3157302401
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x11894497e452b9
        zio_delta = 0x0
        zio_offset = 0x5c3bca6e00
        zio_size = 0x1c00
        zio_objset = 0x3145
        zio_object = 0xb
        zio_level = 0x1
        zio_blkid = 0x130
        time = 0x6283e70f 0x41c58e3
        eid = 0x39e4

May 17 2022 20:18:55.068966627 ereport.fs.zfs.io
        class = "ereport.fs.zfs.io"
        ena = 0x94bf0e479ba02801
        detector = (embedded nvlist)
                version = 0x0
                scheme = "zfs"
                pool = 0x57bc455790534722
                vdev = 0xd45d2c2f3f13c0f
        (end detector)
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        pool_failmode = "wait"
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_type = "disk"
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_ashift = 0x9
        vdev_complete_ts = 0x11894bf0de6237
        vdev_delta_ts = 0x75e6113d1
        vdev_read_errors = 0x2
        vdev_write_errors = 0x22
        vdev_cksum_errors = 0x0
        vdev_delays = 0x7
        parent_guid = 0x537e61b3d9963c1c
        parent_type = "raidz"
        vdev_spare_paths =
        vdev_spare_guids =
        zio_err = 0x5
        zio_flags = 0x380880
        zio_stage = 0x1000000
        zio_pipeline = 0x1700000
        zio_delay = 0x0
        zio_timestamp = 0x11894497d2d7bb
        zio_delta = 0x0
        zio_offset = 0x5c3bca2e00
        zio_size = 0x2000
        zio_objset = 0x3145
        zio_object = 0xb
        zio_level = 0x1
        zio_blkid = 0x128
        time = 0x6283e70f 0x41c58e3
        eid = 0x39e5

May 17 2022 20:19:21.956821176 resource.fs.zfs.statechange
        version = 0x0
        class = "resource.fs.zfs.statechange"
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        vdev_guid = 0xd45d2c2f3f13c0f
        vdev_state = "FAULTED" (0x5)
        vdev_path = "/dev/disk/by-id/wwn-0x500080d910c603ff-part1"
        vdev_devid = "scsi-3500080d910c603ff-part1"
        vdev_physpath = "pci-0000:b3:00.0-sas-exp0x500304801e80aaff-phy5-lun-0"
        vdev_enc_sysfs_path = "/sys/class/enclosure/3:0:24:0/Slot05"
        vdev_laststate = "ONLINE" (0x7)
        time = 0x6283e729 0x3907eeb8
        eid = 0x39e6

May 17 2022 20:19:22.488818298 sysevent.fs.zfs.config_sync
        version = 0x0
        class = "sysevent.fs.zfs.config_sync"
        pool = "st6"
        pool_guid = 0x57bc455790534722
        pool_state = 0x0
        pool_context = 0x0
        time = 0x6283e72a 0x1d22c67a
        eid = 0x39e7

Update

After creating this post, we upgraded the OpenZFS version on the machine and continued testing, we also enabled debugging and have the following logging. Are there (still) "notorious" problems between the LSI HBA SAS 2008 / 3008 controllers and OpenZFS?

Kernel logging of some write errors that occurred

May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:9:0: [sdl] tag#8336 CDB: Read defect data(12) b7 0c 00 00 00 00 00 00 00 08 00 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aac9), phy(9)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(9)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0013), ioc_status(success)(0x0000), smid(8337)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(8), underflow(0), resid(8)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:9:0: [sdl] tag#8361 CDB: Read Defect Data(10) 37 00 0c 00 00 00 00 00 04 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aac9), phy(9)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(9)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0013), ioc_status(success)(0x0000), smid(8362)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:9:0: [sdl] tag#8374 CDB: Log Sense 4d 00 43 00 00 00 00 00 04 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aac9), phy(9)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(9)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0013), ioc_status(success)(0x0000), smid(8375)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:10:0: [sdm] tag#8393 CDB: ATA command pass through(16) 85 06 2c 00 da 00 00 00 00 00 4f 00 c2 00 b0 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aaca), phy(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0014), ioc_status(success)(0x0000), smid(8394)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(0)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x01,0x00,0x1d], count(22)
May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:10:0: [sdm] tag#8444 CDB: Log Sense 4d 00 40 ff 00 00 00 3e fc 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aaca), phy(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0014), ioc_status(success)(0x0000), smid(8445)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(16124), underflow(0), resid(16124)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:10:0: [sdm] tag#8407 CDB: Read defect data(12) b7 0c 00 00 00 00 00 00 00 08 00 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aaca), phy(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0014), ioc_status(success)(0x0000), smid(8408)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(8), underflow(0), resid(-21496)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(21504), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:17 storage.machine.hostname kernel: sd 8:0:10:0: [sdm] tag#8343 CDB: Read Defect Data(10) 37 00 0c 00 00 00 00 00 04 00
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aaca), phy(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(10)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0014), ioc_status(success)(0x0000), smid(8344)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(-26620)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(26624), sc->result(0x00000002)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:17 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:10:0: [sdm] tag#8347 CDB: Log Sense 4d 00 43 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aaca), phy(10)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(10)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0014), ioc_status(success)(0x0000), smid(8348)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:11:0: [sdn] tag#8338 CDB: ATA command pass through(16) 85 06 2c 00 da 00 00 00 00 00 4f 00 c2 00 b0 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aacb), phy(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0015), ioc_status(success)(0x0000), smid(8339)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(0)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x01,0x00,0x1d], count(22)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:11:0: [sdn] tag#8331 CDB: Log Sense 4d 00 40 ff 00 00 00 3e fc 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aacb), phy(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0015), ioc_status(success)(0x0000), smid(8332)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(16124), underflow(0), resid(-9988)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(26112), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:11:0: [sdn] tag#8350 CDB: Read defect data(12) b7 0c 00 00 00 00 00 00 00 08 00 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aacb), phy(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0015), ioc_status(success)(0x0000), smid(8351)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(8), underflow(0), resid(8)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:11:0: [sdn] tag#8324 CDB: Read Defect Data(10) 37 00 0c 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aacb), phy(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0015), ioc_status(success)(0x0000), smid(8325)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(-2044)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(2048), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:11:0: [sdn] tag#8340 CDB: Log Sense 4d 00 43 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aacb), phy(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(11)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0015), ioc_status(success)(0x0000), smid(8341)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:12:0: [sdo] tag#8326 CDB: ATA command pass through(16) 85 06 2c 00 da 00 00 00 00 00 4f 00 c2 00 b0 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadc), phy(28)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(12)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0016), ioc_status(success)(0x0000), smid(8327)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(0)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x01,0x00,0x1d], count(22)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:12:0: [sdo] tag#8353 CDB: Log Sense 4d 00 40 ff 00 00 00 3e fc 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadc), phy(28)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(12)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0016), ioc_status(success)(0x0000), smid(8354)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(16124), underflow(0), resid(16124)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:12:0: [sdo] tag#8329 CDB: Read defect data(12) b7 0c 00 00 00 00 00 00 00 08 00 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadc), phy(28)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(12)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0016), ioc_status(success)(0x0000), smid(8330)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(8), underflow(0), resid(-504)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(512), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:12:0: [sdo] tag#8375 CDB: Read Defect Data(10) 37 00 0c 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadc), phy(28)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(12)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0016), ioc_status(success)(0x0000), smid(8376)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(-508)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(512), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:12:0: [sdo] tag#8335 CDB: Log Sense 4d 00 43 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadc), phy(28)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(12)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0016), ioc_status(success)(0x0000), smid(8336)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8350 CDB: ATA command pass through(16) 85 06 2c 00 da 00 00 00 00 00 4f 00 c2 00 b0 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8351)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(0)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x01,0x00,0x1d], count(22)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8365 CDB: Log Sense 4d 00 40 ff 00 00 00 3e fc 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8366)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(16124), underflow(0), resid(-4356)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(20480), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8345 CDB: Read defect data(12) b7 0c 00 00 00 00 00 00 00 08 00 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8346)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(8), underflow(0), resid(-24568)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(24576), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8365 CDB: Read Defect Data(10) 37 00 0c 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8366)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:18 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8380 CDB: Log Sense 4d 00 43 00 00 00 00 00 04 00
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8381)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:18 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:49 storage.machine.hostname kernel: sd 8:0:13:0: attempting task abort!scmd(0x0000000045642aa9), outstanding for 30848 ms & timeout 60000 ms
May 18 18:32:49 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8373 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
May 18 18:32:49 storage.machine.hostname kernel: scsi target8:0:13: handle(0x0017), sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:49 storage.machine.hostname kernel: scsi target8:0:13: enclosure logical id(0x500304801e80aaff), slot(13) 
May 18 18:32:49 storage.machine.hostname kernel: scsi target8:0:13: enclosure level(0x0000), connector name(     )
May 18 18:32:49 storage.machine.hostname kernel: mpt3sas_cm0: sending tm: handle(0x0017), task_type(0x01), smid(8374), timeout(30), tr_method(0x0)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: Discovery: (start)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: discovery event: (start)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8327 CDB: Read(10) 28 00 83 88 9e 18 00 00 30 00
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8329 CDB: Mode Sense(6) 1a 00 0a 00 40 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(scsi ioc terminated)(0x004b), smid(8330)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(scsi ioc terminated)(0x004b), smid(8328)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8338 CDB: Read(10) 28 00 83 da da c0 00 00 30 00
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8357 CDB: Read(10) 28 00 85 d8 1f 38 00 00 28 00
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8373 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8365 CDB: Write(10) 2a 00 67 c8 9d 00 00 00 08 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(24576), underflow(24576), resid(24576)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(64), underflow(0), resid(64)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x000b0000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x000b0000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(good)(0x00), scsi_state(state terminated no status )(0x0c)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(scsi task terminated)(0x0048), smid(8374)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(scsi ioc terminated)(0x004b), smid(8366)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(good)(0x00), scsi_state(state terminated no status )(0x0c)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(0)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4096), underflow(4096), resid(4096)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(scsi ioc terminated)(0x004b), smid(8339)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(scsi ioc terminated)(0x004b), smid(8358)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x000b0000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(24576), underflow(24576), resid(24576)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(20480), underflow(20480), resid(20480)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x000b0000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x000b0000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00080000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(good)(0x00), scsi_state(state terminated no status )(0x0c)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(good)(0x00), scsi_state(state terminated no status )(0x0c)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(good)(0x00), scsi_state(state terminated no status )(0x0c)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(good)(0x00), scsi_state(state terminated no status )(0x0c)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: complete tm: ioc_status(0x0000), loginfo(0x00000000), term_count(0x00000006)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: response_code(0x0): task management request completed
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: task abort: SUCCESS scmd(0x0000000045642aa9)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: attempting task abort!scmd(0x00000000528534de), outstanding for 31264 ms & timeout 30000 ms
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8365 CDB: Write(10) 2a 00 67 c8 9d 00 00 00 08 00
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: handle(0x0017), sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure logical id(0x500304801e80aaff), slot(13) 
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: No reference found at driver, assuming scmd(0x00000000528534de) might have completed
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: task abort: SUCCESS scmd(0x00000000528534de)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8365 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8365 CDB: Write(10) 2a 00 67 c8 9d 00 00 00 08 00
May 18 18:32:50 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdp, sector 1741200640 op 0x1:(WRITE) flags 0x700 phys_seg 1 prio class 0
May 18 18:32:50 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c257b1-part1 error=5 type=2 offset=891493679104 size=4096 flags=180880
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: attempting task abort!scmd(0x000000001571f9aa), outstanding for 31264 ms & timeout 30000 ms
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8357 CDB: Read(10) 28 00 85 d8 1f 38 00 00 28 00
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: handle(0x0017), sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure logical id(0x500304801e80aaff), slot(13) 
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: No reference found at driver, assuming scmd(0x000000001571f9aa) might have completed
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: task abort: SUCCESS scmd(0x000000001571f9aa)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8357 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8357 CDB: Read(10) 28 00 85 d8 1f 38 00 00 28 00
May 18 18:32:50 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdp, sector 2245533496 op 0x0:(READ) flags 0x700 phys_seg 1 prio class 0
May 18 18:32:50 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c257b1-part1 error=5 type=1 offset=1149712101376 size=20480 flags=180880
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: attempting task abort!scmd(0x00000000b0b7852f), outstanding for 31264 ms & timeout 30000 ms
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8338 CDB: Read(10) 28 00 83 da da c0 00 00 30 00
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: handle(0x0017), sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure logical id(0x500304801e80aaff), slot(13) 
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: No reference found at driver, assuming scmd(0x00000000b0b7852f) might have completed
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: task abort: SUCCESS scmd(0x00000000b0b7852f)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8338 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8338 CDB: Read(10) 28 00 83 da da c0 00 00 30 00
May 18 18:32:50 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdp, sector 2212158144 op 0x0:(READ) flags 0x700 phys_seg 1 prio class 0
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: SAS Topology Change List
May 18 18:32:50 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c257b1-part1 error=5 type=1 offset=1132623921152 size=24576 flags=180880
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: sas topology change: (responding)
May 18 18:32:50 storage.machine.hostname kernel:         handle(0x0009), enclosure_handle(0x0002) start_phy(29), count(22)
May 18 18:32:50 storage.machine.hostname kernel:         phy(29), attached_handle(0x0017): link rate change: link rate: new(0x0b), old(0x0b)
May 18 18:32:50 storage.machine.hostname kernel:         phy(30), attached_handle(0x0018): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: attempting task abort!scmd(0x0000000001067076), outstanding for 31268 ms & timeout 30000 ms
May 18 18:32:50 storage.machine.hostname kernel:         phy(31), attached_handle(0x0019): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8327 CDB: Read(10) 28 00 83 88 9e 18 00 00 30 00
May 18 18:32:50 storage.machine.hostname kernel:         phy(32), attached_handle(0x001a): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel:         phy(33), attached_handle(0x001b): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: handle(0x0017), sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure logical id(0x500304801e80aaff), slot(13) 
May 18 18:32:50 storage.machine.hostname kernel:         phy(34), attached_handle(0x001c): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel:         phy(35), attached_handle(0x001d): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel:         phy(36), attached_handle(0x001e): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: scsi target8:0:13: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel:         phy(37), attached_handle(0x001f): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: No reference found at driver, assuming scmd(0x0000000001067076) might have completed
May 18 18:32:50 storage.machine.hostname kernel:         phy(38), attached_handle(0x0020): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: task abort: SUCCESS scmd(0x0000000001067076)
May 18 18:32:50 storage.machine.hostname kernel:         phy(39), attached_handle(0x0021): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8327 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
May 18 18:32:50 storage.machine.hostname kernel:         phy(40), attached_handle(0x0022): target responding: link rate: new(0x0b), old(0x00)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8327 CDB: Read(10) 28 00 83 88 9e 18 00 00 30 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: updating handles for sas_host(0x500605b00dbc84b0)
May 18 18:32:50 storage.machine.hostname kernel: blk_update_request: I/O error, dev sdp, sector 2206768664 op 0x0:(READ) flags 0x700 phys_seg 1 prio class 0
May 18 18:32:50 storage.machine.hostname kernel: zio pool=st6 vdev=/dev/disk/by-id/wwn-0x500080d910c257b1-part1 error=5 type=1 offset=1129864507392 size=24576 flags=180880
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: Discovery: (stop)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: discovery event: (stop)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8330 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8331)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(-21504)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(21504), sc->result(0x00000000)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x06,0x29,0x00], count(18)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: [sdp] tag#8330 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aadd), phy(29)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(13)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0017), ioc_status(success)(0x0000), smid(8331)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(-21504)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(21504), sc->result(0x00000002)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x06,0x29,0x00], count(18)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:13:0: Power-on or device reset occurred
May 18 18:32:50 storage.machine.hostname zed[2350504]: eid=369 class=delay pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=24576 offset=1132623921152 priority=0 err=5 flags=0x180880 delay=31264ms bookmark=398:18:0:242403
May 18 18:32:50 storage.machine.hostname zed[2350506]: eid=371 class=delay pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=20480 offset=1149712101376 priority=0 err=5 flags=0x180880 delay=31266ms bookmark=1161:10:0:376323
May 18 18:32:50 storage.machine.hostname zed[2350543]: eid=373 class=io pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=24576 offset=1132623921152 priority=0 err=5 flags=0x180880 delay=31264ms bookmark=398:18:0:242403
May 18 18:32:50 storage.machine.hostname zed[2350542]: eid=372 class=delay pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=4096 offset=891493679104 priority=1 err=5 flags=0x180880 delay=31264ms bookmark=1290:0:-2:1000177
May 18 18:32:50 storage.machine.hostname zed[2350544]: eid=375 class=io pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=20480 offset=1149712101376 priority=0 err=5 flags=0x180880 delay=31266ms bookmark=1161:10:0:376323
May 18 18:32:50 storage.machine.hostname zed[2350539]: eid=376 class=io pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=4096 offset=891493679104 priority=1 err=5 flags=0x180880 delay=31264ms bookmark=1290:0:-2:1000177
May 18 18:32:50 storage.machine.hostname zed[2350545]: eid=370 class=delay pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=24576 offset=1129864507392 priority=0 err=5 flags=0x180880 delay=31267ms bookmark=1029:11:0:1370815
May 18 18:32:50 storage.machine.hostname zed[2350551]: eid=374 class=io pool='st6' vdev=wwn-0x500080d910c257b1-part1 size=24576 offset=1129864507392 priority=0 err=5 flags=0x180880 delay=31267ms bookmark=1029:11:0:1370815
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:14:0: [sdq] tag#8326 CDB: ATA command pass through(16) 85 06 2c 00 da 00 00 00 00 00 4f 00 c2 00 b0 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aade), phy(30)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(14)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0018), ioc_status(success)(0x0000), smid(8327)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(0), underflow(0), resid(0)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(0), transfer_count(0), sc->result(0x00000002)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x01,0x00,0x1d], count(22)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:14:0: [sdq] tag#8366 CDB: Log Sense 4d 00 40 ff 00 00 00 3e fc 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aade), phy(30)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(14)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0018), ioc_status(success)(0x0000), smid(8367)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(16124), underflow(0), resid(-9988)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(26112), sc->result(0x00000002)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x24,0x00], count(18)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:14:0: [sdq] tag#8324 CDB: Read defect data(12) b7 0c 00 00 00 00 00 00 00 08 00 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aade), phy(30)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(14)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0018), ioc_status(success)(0x0000), smid(8325)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(8), underflow(0), resid(-26616)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(26624), sc->result(0x00000002)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)
May 18 18:32:50 storage.machine.hostname kernel: sd 8:0:14:0: [sdq] tag#8336 CDB: Read Defect Data(10) 37 00 0c 00 00 00 00 00 04 00
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         sas_address(0x500304801e80aade), phy(30)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure logical id(0x500304801e80aaff), slot(14)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0: enclosure level(0x0000), connector name(     )
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         handle(0x0018), ioc_status(success)(0x0000), smid(8337)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         request_len(4), underflow(0), resid(4)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         tag(65535), transfer_count(0), sc->result(0x00000002)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         scsi_status(check condition)(0x02), scsi_state(autosense valid )(0x01)
May 18 18:32:50 storage.machine.hostname kernel: mpt3sas_cm0:         [sense_key,asc,ascq]: [0x05,0x20,0x00], count(18)

Answered by bghira

May 18, 2022

use the zfs PPA that backports newer versions, or, upgrade to Ubuntu 22.04. you're using an ancient build of ZFS that no longer receives basic fixes.

View full answer

maxboone · 2022-05-18T12:47:51Z

maxboone
May 18, 2022
Author

In the meantime, I enabled debug logging on mpt3sas

echo 0x3f8 > /sys/module/mpt3sas/parameters/logging_level

To see, if another error occurs, if anything precedes it.

0 replies

bghira · 2022-05-18T13:39:15Z

bghira
May 18, 2022

use the zfs PPA that backports newer versions, or, upgrade to Ubuntu 22.04. you're using an ancient build of ZFS that no longer receives basic fixes.

5 replies

maxboone May 18, 2022
Author

Didn't resolve the problem, still experiencing high error counts. But, this time we've been able to get a little more debugging information from the mpt3sas module. Updated the original post with the new version and new logging.

bghira May 18, 2022

ah, i totally missed the mpt3sas thing in the other post. we had to put some of those controllers to pasture at my workplace because of issues on Linux, switched back to OmniOS on the systems where that wasn't an option.

it's definitely a controller problem, and it is dependent on firmware version. if you're strapped for budget, fixing it is possible but it'll be less heartache to just vow them off forever.

bghira May 18, 2022

i should clarify, that we were not even using ZFS when we had issues with those controllers in Linux.

maxboone May 19, 2022
Author

Ah, makes sense; We'll experiment a bit with different firmwares and see if that improves the situation, otherwise I think we might switch to either FreeBSD (that's been running stable for us on other machines) or the Illumos-based system that we were using before.

maxboone May 24, 2022
Author

We upgraded the firmware of the controller to 16.00.10.00 (latest available public release, even though there is a more recent pre-release version from TrueNAS), gave the kernel an update to 5.12 and updated OpenZFS to the latest version.

It seems to be working so far, did multiple load tests and seem to be back at the previous level of error rates. We're going to test if the out-of-the-box versions of Ubuntu 22.04 LTS work (with the update to 16.00.10.00 of the controller (9003-8i), but it seems that there are some bugs in older controller firmware that causes these issues with Linux.

jittygitty · 2022-05-19T05:56:17Z

jittygitty
May 19, 2022

@maxboone Were these controllers already flashed to IT mode? If not, would still be interesting why same worked better in illumos. You could experiment with firmware like @bghira hinted.

1 reply

maxboone May 19, 2022
Author

Yeah, they're in IT-mode - we'll experiment a bit with the firmware!

rincebrain · 2022-05-19T08:26:20Z

rincebrain
May 19, 2022
Collaborator

So, first, mpt_sas and mpt[23]sas on Linux are definitely different, origin overlap aside.

Are they all the same model SSD/HBA/firmware for each/etc?

My guess would be some feature that illumos doesn't feel the need to implement but Linux does, or some IO pattern differences.

Those errors look like "SSD stopped responding for 30 seconds so we aborted the IO" - does illumos perhaps have a different timeout length it defaults to configuring, or not treat a single timeout as a fatal error to bubble up the OS stack?

1 reply

maxboone May 19, 2022
Author

Yeah, we keep the firmware versions and hardware consistent over multiple machines (even though we've had similar results with HDDs on other machines, with the same controller).

I'll dig into the timeouts on Friday and see if I can find why those timeouts are occurring on Linux but not Solaris.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increased error rate since move from Illumos (5.11) ZFS to OpenZFS (0.8.3 and/or 2.1.4) #13474

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 4 comments 7 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Increased error rate since move from Illumos (5.11) ZFS to OpenZFS (0.8.3 and/or 2.1.4) #13474

maxboone May 18, 2022

Replies: 4 comments · 7 replies

maxboone May 18, 2022 Author

bghira May 18, 2022

maxboone May 18, 2022 Author

bghira May 18, 2022

bghira May 18, 2022

maxboone May 19, 2022 Author

maxboone May 24, 2022 Author

jittygitty May 19, 2022

maxboone May 19, 2022 Author

rincebrain May 19, 2022 Collaborator

maxboone May 19, 2022 Author

maxboone
May 18, 2022

Replies: 4 comments 7 replies

maxboone
May 18, 2022
Author

bghira
May 18, 2022

maxboone May 18, 2022
Author

maxboone May 19, 2022
Author

maxboone May 24, 2022
Author

jittygitty
May 19, 2022

maxboone May 19, 2022
Author

rincebrain
May 19, 2022
Collaborator

maxboone May 19, 2022
Author