[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250813182141.GA284875@bhelgaas>
Date: Wed, 13 Aug 2025 13:21:41 -0500
From: Bjorn Helgaas <helgaas@...nel.org>
To: Lukas Wunner <lukas@...ner.de>
Cc: Riana Tauro <riana.tauro@...el.com>,
Aravind Iddamsetty <aravind.iddamsetty@...ux.intel.com>,
"Sean C. Dardis" <sean.c.dardis@...el.com>,
Terry Bowman <terry.bowman@....com>,
Sathyanarayanan Kuppuswamy <sathyanarayanan.kuppuswamy@...ux.intel.com>,
Niklas Schnelle <schnelle@...ux.ibm.com>,
Linas Vepstas <linasvepstas@...il.com>,
Mahesh J Salgaonkar <mahesh@...ux.ibm.com>,
Oliver OHalloran <oohall@...il.com>,
Manivannan Sadhasivam <manivannan.sadhasivam@....qualcomm.com>,
linuxppc-dev@...ts.ozlabs.org, linux-pci@...r.kernel.org,
Shahed Shaikh <shshaikh@...vell.com>,
Manish Chopra <manishc@...vell.com>, GR-Linux-NIC-Dev@...vell.com,
Nilesh Javali <njavali@...vell.com>,
GR-QLogic-Storage-Upstream@...vell.com,
Edward Cree <ecree.xilinx@...il.com>, linux-net-drivers@....com,
James Smart <james.smart@...adcom.com>,
Dick Kennedy <dick.kennedy@...adcom.com>,
"James E.J. Bottomley" <James.Bottomley@...senpartnership.com>,
"Martin K. Petersen" <martin.petersen@...cle.com>,
linux-scsi@...smtp.subspace.kernel.org,
er.kernel.org@...ts.ozlabs.org, Andrew Lunn <andrew+netdev@...n.ch>,
"David S. Miller" <davem@...emloft.net>,
Eric Dumazet <edumazet@...gle.com>, Jakub Kicinski <kuba@...nel.org>,
Paolo Abeni <pabeni@...hat.com>, netdev@...r.kernel.org
Subject: Re: [PATCH 0/5] PCI: Reduce AER / EEH deviations
On Wed, Aug 13, 2025 at 07:11:00AM +0200, Lukas Wunner wrote:
> The kernel supports three different PCI error recovery mechanisms:
>
> * AER per PCIe r7.0 sec 6.2 (drivers/pci/pcie/aer.c + err.c)
> * EEH on PowerPC (arch/powerpc/kernel/eeh_driver.c)
> * zPCI on s390 (arch/s390/pci/pci_event.c)
>
> In theory, they should all follow Documentation/PCI/pci-error-recovery.rst
> to afford uniform behavior to drivers across platforms.
>
> In practice, there are deviations which this series seeks to reduce.
>
> One particular pain point is AER not allowing drivers to opt in to a
> Bus Reset on Non-Fatal Errors (patch [1/5]). EEH allows this and the
> "xe" graphics driver would like to take advantage of it on AER-capable
> platforms. Patches [2/5] to [4/5] address various other deviations,
> while patch [5/5] cleans up old gunk in code comments.
>
> I've gone through all drivers implementing pci_error_handlers to ascertain
> that no regressions are introduced by these changes. Nevertheless further
> reviewing and testing would be appreciated to raise the confidence.
> Thanks!
>
> Lukas Wunner (5):
> PCI/AER: Allow drivers to opt in to Bus Reset on Non-Fatal Errors
> PCI/ERR: Fix uevent on failure to recover
> PCI/ERR: Notify drivers on failure to recover
> PCI/ERR: Update device error_state already after reset
> PCI/ERR: Remove remnants of .link_reset() callback
>
> .../ethernet/qlogic/qlcnic/qlcnic_83xx_hw.c | 1 -
> .../net/ethernet/qlogic/qlcnic/qlcnic_main.c | 2 -
> drivers/net/ethernet/sfc/efx_common.c | 3 --
> drivers/net/ethernet/sfc/falcon/efx.c | 3 --
> drivers/net/ethernet/sfc/siena/efx_common.c | 3 --
> drivers/pci/pcie/err.c | 40 ++++++++++++++-----
> drivers/scsi/lpfc/lpfc_init.c | 2 +-
> drivers/scsi/qla2xxx/qla_os.c | 5 ---
> 8 files changed, 32 insertions(+), 27 deletions(-)
Applied to pci/aer for v6.18, thanks, Lukas!
Powered by blists - more mailing lists