[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <cover.1755008151.git.lukas@wunner.de>
Date: Wed, 13 Aug 2025 07:11:00 +0200
From: Lukas Wunner <lukas@...ner.de>
To: Bjorn Helgaas <helgaas@...nel.org>
Cc: Riana Tauro <riana.tauro@...el.com>,
Aravind Iddamsetty <aravind.iddamsetty@...ux.intel.com>,
"Sean C. Dardis" <sean.c.dardis@...el.com>,
Terry Bowman <terry.bowman@....com>,
Sathyanarayanan Kuppuswamy <sathyanarayanan.kuppuswamy@...ux.intel.com>,
Niklas Schnelle <schnelle@...ux.ibm.com>,
Linas Vepstas <linasvepstas@...il.com>,
"Mahesh J Salgaonkar" <mahesh@...ux.ibm.com>,
"Oliver OHalloran" <oohall@...il.com>,
Manivannan Sadhasivam <manivannan.sadhasivam@....qualcomm.com>,
linuxppc-dev@...ts.ozlabs.org, linux-pci@...r.kernel.org,
Shahed Shaikh <shshaikh@...vell.com>,
Manish Chopra <manishc@...vell.com>, GR-Linux-NIC-Dev@...vell.com,
Nilesh Javali <njavali@...vell.com>,
GR-QLogic-Storage-Upstream@...vell.com,
Edward Cree <ecree.xilinx@...il.com>, linux-net-drivers@....com,
James Smart <james.smart@...adcom.com>,
Dick Kennedy <dick.kennedy@...adcom.com>,
"James E.J. Bottomley" <James.Bottomley@...senPartnership.com>,
"Martin K. Petersen" <martin.petersen@...cle.com>,
linux-scsi@...smtp.subspace.kernel.org,
er.kernel.org@....codeaurora.org,
Andrew Lunn <andrew+netdev@...n.ch>,
"David S. Miller" <davem@...emloft.net>,
Eric Dumazet <edumazet@...gle.com>, Jakub Kicinski <kuba@...nel.org>,
Paolo Abeni <pabeni@...hat.com>, netdev@...r.kernel.org
Subject: [PATCH 0/5] PCI: Reduce AER / EEH deviations
The kernel supports three different PCI error recovery mechanisms:
* AER per PCIe r7.0 sec 6.2 (drivers/pci/pcie/aer.c + err.c)
* EEH on PowerPC (arch/powerpc/kernel/eeh_driver.c)
* zPCI on s390 (arch/s390/pci/pci_event.c)
In theory, they should all follow Documentation/PCI/pci-error-recovery.rst
to afford uniform behavior to drivers across platforms.
In practice, there are deviations which this series seeks to reduce.
One particular pain point is AER not allowing drivers to opt in to a
Bus Reset on Non-Fatal Errors (patch [1/5]). EEH allows this and the
"xe" graphics driver would like to take advantage of it on AER-capable
platforms. Patches [2/5] to [4/5] address various other deviations,
while patch [5/5] cleans up old gunk in code comments.
I've gone through all drivers implementing pci_error_handlers to ascertain
that no regressions are introduced by these changes. Nevertheless further
reviewing and testing would be appreciated to raise the confidence.
Thanks!
Lukas Wunner (5):
PCI/AER: Allow drivers to opt in to Bus Reset on Non-Fatal Errors
PCI/ERR: Fix uevent on failure to recover
PCI/ERR: Notify drivers on failure to recover
PCI/ERR: Update device error_state already after reset
PCI/ERR: Remove remnants of .link_reset() callback
.../ethernet/qlogic/qlcnic/qlcnic_83xx_hw.c | 1 -
.../net/ethernet/qlogic/qlcnic/qlcnic_main.c | 2 -
drivers/net/ethernet/sfc/efx_common.c | 3 --
drivers/net/ethernet/sfc/falcon/efx.c | 3 --
drivers/net/ethernet/sfc/siena/efx_common.c | 3 --
drivers/pci/pcie/err.c | 40 ++++++++++++++-----
drivers/scsi/lpfc/lpfc_init.c | 2 +-
drivers/scsi/qla2xxx/qla_os.c | 5 ---
8 files changed, 32 insertions(+), 27 deletions(-)
--
2.47.2
Powered by blists - more mailing lists