[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID:
<CO1PR18MB46826E60C5512A113B5F8938A7F22@CO1PR18MB4682.namprd18.prod.outlook.com>
Date: Mon, 10 Feb 2025 17:54:43 +0000
From: Wilson Ding <dingwei@...vell.com>
To: Manivannan Sadhasivam <manivannan.sadhasivam@...aro.org>
CC: "cassel@...nel.org" <cassel@...nel.org>,
Bjorn Helgaas
<helgaas@...nel.org>,
"lpieralisi@...nel.org" <lpieralisi@...nel.org>,
"thomas.petazzoni@...tlin.com" <thomas.petazzoni@...tlin.com>,
"kw@...ux.com"
<kw@...ux.com>, "robh@...nel.org" <robh@...nel.org>,
"bhelgaas@...gle.com"
<bhelgaas@...gle.com>,
"linux-pci@...r.kernel.org"
<linux-pci@...r.kernel.org>,
"linux-arm-kernel@...ts.infradead.org"
<linux-arm-kernel@...ts.infradead.org>,
"linux-kernel@...r.kernel.org"
<linux-kernel@...r.kernel.org>,
Sanghoon Lee <salee@...vell.com>
Subject: Re: [PATCH 1/1] PCI: armada8k: Add link-down handle
> -----Original Message-----
> From: Manivannan Sadhasivam <manivannan.sadhasivam@...aro.org>
> Sent: Saturday, February 8, 2025 2:36 AM
> To: Wilson Ding <dingwei@...vell.com>
> Cc: cassel@...nel.org; Bjorn Helgaas <helgaas@...nel.org>;
> lpieralisi@...nel.org; thomas.petazzoni@...tlin.com; kw@...ux.com;
> robh@...nel.org; bhelgaas@...gle.com; linux-pci@...r.kernel.org; linux-
> arm-kernel@...ts.infradead.org; linux-kernel@...r.kernel.org; Sanghoon Lee
> <salee@...vell.com>
> Subject: Re: [EXTERNAL] Re: [PATCH 1/1] PCI: armada8k: Add link-down
> handle
>
> On Fri, Feb 07, 2025 at 06: 46: 22PM +0000, Wilson Ding wrote: > > > > -----
> Original Message----- > > From: Manivannan Sadhasivam
> <manivannan. sadhasivam@ linaro. org> > > Sent: Friday, February 7, 2025
> 9: 58 AM >
> On Fri, Feb 07, 2025 at 06:46:22PM +0000, Wilson Ding wrote:
> >
> >
> > > -----Original Message-----
> > > From: Manivannan Sadhasivam <manivannan.sadhasivam@...aro.org>
> > > Sent: Friday, February 7, 2025 9:58 AM
> > > To: Wilson Ding <dingwei@...vell.com>; cassel@...nel.org
> > > Cc: Bjorn Helgaas <helgaas@...nel.org>; lpieralisi@...nel.org;
> > > thomas.petazzoni@...tlin.com; kw@...ux.com; robh@...nel.org;
> > > bhelgaas@...gle.com; linux-pci@...r.kernel.org; linux-arm-
> > > kernel@...ts.infradead.org; linux-kernel@...r.kernel.org; Sanghoon
> > > Lee <salee@...vell.com>
> > > Subject: [EXTERNAL] Re: [PATCH 1/1] PCI: armada8k: Add link-down
> > > handle
> > >
> > > + Niklas (who was interested in link down handling) On Sat, Feb 01,
> > > + 2025 at 11: 05: 56PM +0000, Wilson Ding wrote: > > On November 13,
> 2024 3:
> > > + 02: 55 AM GMT+05: 30, Bjorn Helgaas > > <mailto: helgaas@ kernel.
> > > + org>
> > > + wrote: > >
> > >
> > > + Niklas (who was interested in link down handling)
> > >
> > > On Sat, Feb 01, 2025 at 11:05:56PM +0000, Wilson Ding wrote:
> > > > > On November 13, 2024 3:02:55 AM GMT+05:30, Bjorn Helgaas
> > > > > <mailto:helgaas@...nel.org> wrote:
> > > > > >In subject:
> > > > > >
> > > > > > PCI: armada8k: Add link-down handling
> > > > > >
> > > > > >On Mon, Nov 11, 2024 at 10:48:13PM -0800, Jenishkumar
> > > > > >Maheshbhai
> > > > > Patel wrote:
> > > > > >> In PCIE ISR routine caused by RST_LINK_DOWN we schedule work
> > > > > >> to handle the link-down procedure.
> > > > > >> Link-down procedure will:
> > > > > >> 1. Remove PCIe bus
> > > > > >> 2. Reset the MAC
> > > > > >> 3. Reconfigure link back up
> > > > > >> 4. Rescan PCIe bus
> > > > > >
> > > > > >s/PCIE/PCIe/
> > > > > >
> > > > > >Rewrap to fill 75 columns.
> > > > > >
> > > > > >I assume this basically removes a Root Port (and the hierarchy
> > > > > >below
> > > > > >it) if the link goes down, and then resets the MAC and tries to
> > > > > >bring up the link and enumerate the hierarchy again.
> > > > > >
> > > > > >No other drivers do this, so why does armada8k need it? Is
> > > > > >this to work around some unreliable link?
> > > > >
> > > > > Certainly Qcom IPs have this same feature and I was also looking
> > > > > to implement it. But the link down should not be handled by this
> > > > > in the
> > > controller driver.
> > > > >
> > > > > Instead, it should be tied to bus reset in the core and the
> > > > > reset should be done through a callback implemented in the
> > > > > controller drivers. This way, the reset cannot happen in the
> > > > > back of PCI core and client
> > > drivers.
> > > > >
> > > > > That said, the Link down IRQ received by this driver should also
> > > > > be propagated back to the PCI core and the core should then call
> > > > > the callback to reset the bus that I mentioned above.
> > > > >
> > > >
> > > > It's more than a work-around for the unreliable link. A few
> > > > customers may have such application - independent power supply to
> > > > the device with dedicated reset GPIO to #PRST. In this way, the
> > > > power cycle and warm reset of RC and EP won't have impact on each
> > > > other. However, it may lead into the PCI driver not aware of the
> > > > link down when an unexpected
> > > power down or reset occurs on the device.
> > > > We cannot assume the link will be recovered soon. The worse thing
> > > > is the driver may continue access to the device, which may hang the bus.
> > > > Since the device is no longer present on the bus, it's better to
> > > > remove it. Besides, in order to bring up the link, the only way is
> > > > to reset the MAC, which starts over the state machine of LTSSM.
> > > >
> > > > Well, we also noticed that there is no other driver that did this.
> > > > I agree it is not necessary if the power cycle or warm reset of
> > > > the device is done gracefully. The user can remove the device
> > > > prior to the power cycle/reset. And do the rescan after the link
> > > > is recovered. However,
> > > the unexpected power down is still possible.
> > > > Please enlighten me if there is any better approach to handle such
> > > > unexpected link down.
> > > >
> > >
> > > There is no issue in retraining the link. My concern is that, the
> > > retrain should not happen autonomously in the controller driver. PCI
> > > core should be made informed of it. More below.
> > >
> >
> > Do you mean
> > - pass the link down/up events to PCI core
> > - remove the device or hierarchy by PCI core upon link down
> > - initiate the link retraining in PCI core by calling the platform
> > retrain callbacks
> > - rescan the bus once link is recovered
> >
>
> Yeah. This is what I came up with quickly:
>
> ```
> diff --git a/drivers/pci/probe.c b/drivers/pci/probe.c index
> b6536ed599c3..561eeb464220 100644
> --- a/drivers/pci/probe.c
> +++ b/drivers/pci/probe.c
> @@ -706,6 +706,33 @@ void pci_free_host_bridge(struct pci_host_bridge
> *bridge) } EXPORT_SYMBOL(pci_free_host_bridge);
>
> +void pci_host_bridge_handle_link_down(struct pci_host_bridge *bridge) {
> + struct pci_bus *bus = bridge->bus;
> + struct pci_dev *child, *tmp;
> + int ret;
> +
> + pci_lock_rescan_remove();
> +
> + /* Knock the devices off bus since we cannot access them */
> + list_for_each_entry_safe(child, tmp, &bus->devices, bus_list)
> + pci_stop_and_remove_bus_device(child);
> +
> + /* Now retrain the link in a controller specific way to bring it back */
> + if (bus->ops->retrain_link) {
> + ret = bus->ops->retrain_link(bus);
> + if (ret) {
> + dev_err(&bridge->dev, "Failed to retrain the link!\n");
> + pci_unlock_rescan_remove();
> + return;
> + }
> + }
> +
> + pci_rescan_bus(bus);
> + pci_unlock_rescan_remove();
> +}
> +EXPORT_SYMBOL(pci_host_bridge_handle_link_down);
> +
> /* Indexed by PCI_X_SSTATUS_FREQ (secondary bus mode and frequency) */
> static const unsigned char pcix_bus_speed[] = {
> PCI_SPEED_UNKNOWN, /* 0 */
> diff --git a/include/linux/pci.h b/include/linux/pci.h index
> 47b31ad724fa..1c6f18a51bdd 100644
> --- a/include/linux/pci.h
> +++ b/include/linux/pci.h
> @@ -637,6 +637,7 @@ struct pci_host_bridge *pci_alloc_host_bridge(size_t
> priv); struct pci_host_bridge *devm_pci_alloc_host_bridge(struct device *dev,
> size_t priv); void pci_free_host_bridge(struct
> pci_host_bridge *bridge);
> +void pci_host_bridge_handle_link_down(struct pci_host_bridge *bridge);
> struct pci_host_bridge *pci_find_host_bridge(struct pci_bus *bus);
>
> void pci_set_host_bridge_release(struct pci_host_bridge *bridge, @@ -804,6
> +805,7 @@ struct pci_ops {
> void __iomem *(*map_bus)(struct pci_bus *bus, unsigned int devfn, int
> where);
> int (*read)(struct pci_bus *bus, unsigned int devfn, int where, int size, u32
> *val);
> int (*write)(struct pci_bus *bus, unsigned int devfn, int where, int size,
> u32 val);
> + int (*retrain_link)(struct pci_bus *bus);
> };
>
> /*
> ```
>
> Your controller driver has to call pci_host_bridge_handle_link_down() during
> the link down event (make it threaded if not done already). Then you should
> also populate the pci_ops::retrain_link() callback with the function that
> retrains the broken link. Finally, the bus will be rescanned to enumerate the
> devices.
>
> I do have plans to plug this retrain callback to one of the bus_reset() functions
> in the future so that we can bring the link back while doing bus level reset
> (uncorrectable AERs and such). But this will do the job for now.
>
> I will send a series on Monday with Qcom driver as a reference.
>
> - Mani
>
Sounds great. Looking forward to your patches. Thanks a lot!
> --
> மணிவண்ணன் சதாசிவம்
Powered by blists - more mailing lists