[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <EE11001F9E5DDD47B7634E2F8A612F2E40BE2681@FRAEML521-MBX.china.huawei.com>
Date: Wed, 30 Aug 2017 08:38:31 +0000
From: Gabriele Paoloni <gabriele.paoloni@...wei.com>
To: Gabriele Paoloni <gabriele.paoloni@...wei.com>,
"helgaas@...nel.org" <helgaas@...nel.org>
CC: Linuxarm <linuxarm@...wei.com>,
"liudongdong (C)" <liudongdong3@...wei.com>,
"linux-pci@...r.kernel.org" <linux-pci@...r.kernel.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>
Subject: RE: [PATCH v2] PCIe AER: report uncorrectable errors only to the
functions that logged the errors
ping...
> -----Original Message-----
> From: Gabriele Paoloni
> Sent: 18 August 2017 12:02
> To: helgaas@...nel.org
> Cc: Gabriele Paoloni; Linuxarm; liudongdong (C); linux-
> pci@...r.kernel.org; linux-kernel@...r.kernel.org
> Subject: [PATCH v2] PCIe AER: report uncorrectable errors only to the
> functions that logged the errors
>
> Currently if an uncorrectable error is reported by an EP the AER
> driver walks over all the devices connected to the upstream port
> bus and in turns call the report_error_detected() callback.
> If any of the devices connected to the bus does not implement
> dev->driver->err_handler->error_detected() do_recovery() will fail
> leaving all the bus hierarchy devices unrecovered.
>
> However for non fatal errors the PCIe link should not be considered
> compromised, therefore it makes sense to report the error only to
> all the functions that logged an error.
> This patch implements this new behaviour for non fatal errors.
>
> Signed-off-by: Gabriele Paoloni <gabriele.paoloni@...wei.com>
> Signed-off-by: Dongdong Liu <liudongdong3@...wei.com>
> ---
> Changes from v1:
> - now errors are reported only to the fucntions that logged the
> error
> instead of all the functions in the same device.
> - the patch subject has changed to match the new implementation
> ---
> drivers/pci/pcie/aer/aerdrv_core.c | 9 ++++++++-
> 1 file changed, 8 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/pci/pcie/aer/aerdrv_core.c
> b/drivers/pci/pcie/aer/aerdrv_core.c
> index b1303b3..057465ad 100644
> --- a/drivers/pci/pcie/aer/aerdrv_core.c
> +++ b/drivers/pci/pcie/aer/aerdrv_core.c
> @@ -390,7 +390,14 @@ static pci_ers_result_t
> broadcast_error_message(struct pci_dev *dev,
> * If the error is reported by an end point, we think this
> * error is related to the upstream link of the end point.
> */
> - pci_walk_bus(dev->bus, cb, &result_data);
> + if (state == pci_channel_io_normal)
> + /*
> + * the error is non fatal so the bus is ok, just
> invoke
> + * the callback for the function that logged the
> error.
> + */
> + cb(dev, &result_data);
> + else
> + pci_walk_bus(dev->bus, cb, &result_data);
> }
>
> return result_data.result;
> --
> 2.7.4
>
Powered by blists - more mailing lists