linux-kernel - Re: [bug report] scsi: SATA devices missing after FLR is triggered during HBA suspended

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <f27d6fa7-3088-0e60-043e-e71232066b12@huawei.com>
Date: Sat, 22 Jun 2024 11:31:29 +0800
From: Yihang Li <liyihang9@...wei.com>
To: Damien Le Moal <dlemoal@...nel.org>
CC: <cassel@...nel.org>, <James.Bottomley@...senPartnership.com>,
	<martin.petersen@...cle.com>, <john.g.garry@...cle.com>,
	<yanaijie@...wei.com>, <linux-kernel@...r.kernel.org>,
	<linux-scsi@...r.kernel.org>, <linuxarm@...wei.com>,
	<chenxiang66@...ilicon.com>, <prime.zeng@...wei.com>, <liyihang9@...wei.com>
Subject: Re: [bug report] scsi: SATA devices missing after FLR is triggered
 during HBA suspended

Hi Damien,

Thanks for your reply.

On 2024/6/19 7:11, Damien Le Moal wrote:
> On 6/18/24 22:29, Yihang Li wrote:
>> Hi Damien,
>>
>> I found out that two issues is caused by commit 0c76106cb975 ("scsi: sd:
>> Fix TCG OPAL unlock on system resume") and 626b13f015e0 ("scsi: Do not
>> rescan devices with a suspended queue").
>>
>> The two issues as follows for the situation that there are ATA disks
>> connected with SAS controller:
> 
> Which controller ? What is the driver ?

I'm using the hisi_sas_v3_hw driver and it supports HiSilicon's SAS controller.

> 
>> (1) FLR is triggered after all disks and controller are suspended. As a
>> result, the number of disks is abnormal.
> 
> I am assuming here that FLR means PCI "Function Level Reset" ?

Yes, I am talking about the PCI "Function Level Reset"

> FLR and disk/controller suspend execution timing are unrelated. FLR can be
> triggered at any time through sysfs. So please give details here. Why is FLR
> done when the system is being suspended ?

Yes, it is because FLR can be triggered at any time that we are testing the
reliability of executing FLR commands after disk/controller suspended.

Also, the system does not suspended because we have multiple controllers and
we only suspend one of them and the attached disk devices while the system is
running in the other controller.

> 
>> (2) After all disks and controller are suspended, and resuming all disks
>> again, the driver reference counting is not 0 (The value of "Used" in the
>> lsmod command output is not 0).
> 
> Resuming all disks again ? So you mean system resume ?
> Are we talking about system suspend to ram ? Hybernation ? or something else ?
> (e.g. a controller reset through PCI FLR ?)

As mentioned earlier, we have multiple controllers, only suspend one of them and
the attached data disks, and then resuming the disks again.

> 
> Please clarify exactly what your adapter is and the full procedure you do to
> trigger the issue so that we can try to recreate it.

The system has two HiSilicon's SAS controllers. Controller A is connected to the
system disk, and controller B is connected to multiple SATA disks.

The issue 1:
a. Suspend all disks on controller B.
b. Suspend controller B.
c. Trigger the PCI FLR on controller B through sysfs.
d. The SATA disks connected to controller B is disabled by libata layer.

kernel message is as follows:
[root@...alhost]# echo 1 > /sys/bus/pci/devices/0000:b4:02.0/reset		------> trigger PCI FLR
[  270.479991] hisi_sas_v3_hw 0000:b4:02.0: resuming from operating state [D0]	------> resuming SAS controller
[  271.819775] hisi_sas_v3_hw 0000:b4:02.0: waiting up to 25 seconds for 7 phys to resume
[  271.820324] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy7 link_rate=10(sata)
[  271.835183] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy0 link_rate=10(sata)
[  271.835199] hisi_sas_v3_hw 0000:b4:02.0: dev[8:5] found
[  271.835786] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy5 link_rate=10(sata)
[  271.835791] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy6 link_rate=10(sata)
[  271.846911] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy4 link_rate=10(sata)
[  271.851676] hisi_sas_v3_hw 0000:b4:02.0: dev[9:5] found
[  271.851688] sas: Enter sas_scsi_recover_host busy: 0 failed: 0
[  271.851702] sas: ata5: end_device-6:0: dev error handler
[  271.851708] sas: ata6: end_device-6:1: dev error handler
[  271.851710] sas: ata7: end_device-6:2: dev error handler
[  271.851716] sas: ata8: end_device-6:3: dev error handler
[  271.851717] sas: ata9: end_device-6:4: dev error handler
[  271.851718] sas: ata10: end_device-6:5: dev error handler
[  271.855161] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy1 link_rate=10(sata)
[  271.855547] hisi_sas_v3_hw 0000:b4:02.0: dev[10:5] found
[  271.855760] hisi_sas_v3_hw 0000:b4:02.0: phydown: phy7 phy_state=0x73
[  271.855763] hisi_sas_v3_hw 0000:b4:02.0: ignore flutter phy7 down
[  271.899322] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy3 link_rate=11
[  271.902737] hisi_sas_v3_hw 0000:b4:02.0: dev[11:5] found
[  271.950079] hisi_sas_v3_hw 0000:b4:02.0: dev[12:5] found
[  271.955569] hisi_sas_v3_hw 0000:b4:02.0: dev[13:5] found
[  271.961037] hisi_sas_v3_hw 0000:b4:02.0: dev[14:1] found
[  271.961052] hisi_sas_v3_hw 0000:b4:02.0: end of resuming controller	------> end of resuming controller
[  271.973073] hisi_sas_v3_hw 0000:b4:02.0: FLR prepare			------> PCI FLR start
[  272.032623] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy7 link_rate=10(sata)
[  272.039656] sas: sas_form_port: phy7 belongs to port0 already(1)!
[  272.201518] ata5.00: configured for UDMA/133
[  272.207713] sas: --- Exit sas_scsi_recover_host: busy: 0 failed: 0 tries: 1
[  272.217777] sas: Enter sas_scsi_recover_host busy: 0 failed: 0
[  272.227672] sas: ata5: end_device-6:0: dev error handler
[  272.227676] sas: ata6: end_device-6:1: dev error handler
[  272.227682] sas: ata7: end_device-6:2: dev error handler
[  272.227688] sas: ata8: end_device-6:3: dev error handler
[  272.227695] sas: ata10: end_device-6:5: dev error handler
[  272.227694] sas: ata9: end_device-6:4: dev error handler
[  274.888594] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy7 link_rate=10(sata)
[  274.895614] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy5 link_rate=10(sata)
[  274.895616] sas: sas_form_port: phy7 belongs to port0 already(1)!
[  274.900251] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy0 link_rate=10(sata)
[  274.902647] sas: sas_form_port: phy5 belongs to port1 already(1)!
[  274.902833] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy6 link_rate=10(sata)
[  274.903023] sas: sas_form_port: phy0 belongs to port2 already(1)!
[  274.914529] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy1 link_rate=10(sata)
[  274.916099] sas: sas_form_port: phy6 belongs to port3 already(1)!
[  274.916259] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy4 link_rate=10(sata)
[  274.916439] sas: sas_form_port: phy1 belongs to port5 already(1)!
[  274.961013] sas: sas_form_port: phy4 belongs to port4 already(1)!
[  274.967338] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy3 link_rate=11
[  274.983663] sas: sas_form_port: phy3 belongs to port6 already(1)!
[  275.037230] hisi_sas_v3_hw 0000:b4:02.0: FLR done			------> PCI FLR done
[  275.037232] hisi_sas_v3_hw 0000:b4:02.0: phydown: phy0 phy_state=0xfa
[  275.049223] hisi_sas_v3_hw 0000:b4:02.0: ignore flutter phy0 down
[  275.204142] hisi_sas_v3_hw 0000:b4:02.0: phyup: phy0 link_rate=10(sata)
[  275.211001] sas: sas_form_port: phy0 belongs to port2 already(1)!
[  278.223079] hisi_sas_v3_hw 0000:b4:02.0: entering suspend state	------> the controller suspend again
[  280.527655] ata7.00: qc timeout after 5000 msecs (cmd 0x27)		------> revalidate ATA devices
[  280.535667] sas: sas_ata_internal_abort: Task 00000000682de2e7 already finished.
[  280.543483] ata7.00: failed to read native max address (err_mask=0x4)
[  280.551671] ata7.00: HPA support seems broken, skipping HPA handling
[  280.558317] ata7.00: revalidation failed (errno=-5)
[  280.563437] sas: Executing internal abort failed 5000000000000600 (-22)
[  280.571670] hisi_sas_v3_hw 0000:b4:02.0: I_T nexus reset: internal abort (-22)
[  280.579338] sas: ata7: end_device-6:2: Unable to reset ata device?
[  280.751675] sas: lldd_execute_task returned: -22
[  280.759664] ata7.00: failed to IDENTIFY (I/O error, err_mask=0x40)
[  280.766063] ata7.00: revalidation failed (errno=-5)
[  285.911663] sas: Executing internal abort failed 5000000000000600 (-22)
[  285.919667] hisi_sas_v3_hw 0000:b4:02.0: I_T nexus reset: internal abort (-22)
[  285.927353] sas: ata7: end_device-6:2: Unable to reset ata device?
[  286.095677] sas: lldd_execute_task returned: -22
[  286.103666] ata7.00: failed to IDENTIFY (I/O error, err_mask=0x40)
[  286.110078] ata7.00: revalidation failed (errno=-5)			------> revalidation failed due to the controller is suspend state
[  286.119424] ata7.00: disable device					------> disable device due to revalidation failed
[  286.123185] sas: --- Exit sas_scsi_recover_host: busy: 0 failed: 0 tries: 1
[  286.133236] sas: sas_resume_sata: for direct-attached device 5000000000000600 returned -19
...

The issue 2:
a. Suspend all disks on controller B.
b. Suspend controller B.
c. Resuming all disks on controller B.
d. Run the "lsmod" command to check the driver reference counting.

Thanks,
Yihang

> 
>> For the issue 1, After all disks and controller are suspended, FLR command
>> will resuming the controller and all sas ports. libsas layer will call
>> ata_sas_port_resume() to resume ata port and schedule EH to recover it.
>> In libata standard error handler ata_std_error_handler(), it will call ata
>> reset function, revalidate ATA devices and issue ATA device command
>> ATA_CMD_READ_NATIVE_MAX_EXT to read native max address. This command will
>> failed due to the controller enter suspend state again and libata disable
>> the device finally. The controller enter suspend state again because FLR
>> command completes and the runtime PM usage counter is 0.
>>
>> In commit 0c76106cb975 ("scsi: sd: Fix TCG OPAL unlock on system resume")
>> and 626b13f015e0 ("scsi: Do not rescan devices with a suspended queue"),
>> use blk_queue_pm_only() to check the device request queue state, if the
>> device request queue is not running, the device will not be rescanned.
>> Therefore, the runtime PM usage counter of the controller will not
>> increase so that the controller enters the suspended state again.
>>
>> For the issue 2, the cause is unknown.
>>
>> How to solve these two issues?
>>
>> regards,
>> Yihang
>>
>