lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20170611154002.02c0dabb@Vantage.cJ>
Date:   Sun, 11 Jun 2017 15:40:02 -0400
From:   Jérôme Carretero <cJ-ko@...gloub.eu>
To:     黃清隆 <ching2048@...ca.com.tw>
Cc:     linux-scsi@...r.kernel.org, linux-kernel@...r.kernel.org,
        billion.wu@...ca.com.tw
Subject: arcmsr: abort device command message weirdness (LUN mismatch)

Hi Ching,


Context: when a drive finally failed in my JBOD array, I discovered that
the whole ARC1880X controller would timeout, disabling access to any drive,
which is kind of sad.
I've performed a firmware upgrade and added back the failing drive to see
what happens with a newer device firmware (to be continued).

While doing "cat /dev/${FAILING_DRIVE}", at some point the command fails
(as expected) and while looking at the system logs, I observed that there
were reports of 2 abort sequences initiated then completed, but the
completion message mentions a LUN that is not the one of the failing drive,
which is curious.

[  959.065760] arcmsr0: abort device command of scsi id = 0 lun = 4
[  961.804842] arcmsr0: abort device command of scsi id = 0 lun = 4

...

[  991.834471] arcmsr0: abort device command of scsi id = 0 lun = 0
[  991.840503] arcmsr0: scsi id = 0 lun = 0 ccb = '0xffff8808594a6b80' poll command abort successfully 
[  991.849675] arcmsr0: scsi id = 0 lun = 4 ccb = '0xffff880859424600' poll command abort successfully 
[  991.858869] arcmsr: executing bus reset eh.....num_resets = 0, num_aborts = 3 
[  991.866199] arcmsr0: executing hw bus reset .....
[ 1005.135825] Areca RAID Controller0: Model ARC-1880, F/W V1.54 2016-11-23
[ 1005.229790] arcmsr: scsi bus reset eh returns with success
[ 1019.145652] sd 0:0:0:4: [sdac] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 1019.154095] sd 0:0:0:4: [sdac] tag#0 Sense Key : Medium Error [current] 
[ 1019.160876] sd 0:0:0:4: [sdac] tag#0 Add. Sense: Unrecovered read error
[ 1019.167512] sd 0:0:0:4: [sdac] tag#0 CDB: Read(10) 28 00 00 04 36 00 00 02 00 00
[ 1019.174920] blk_update_request: I/O error, dev sdac, sector 275968

(kernel 4.12.0-rc4-00310-g6b7ed4588ce6).


Regards,

-- 
Jérôme

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ