[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20100530173014.GB14556@basil.fritz.box>
Date: Sun, 30 May 2010 19:30:15 +0200
From: Andi Kleen <andi@...stfloor.org>
To: Michael Chan <mchan@...adcom.com>
Cc: 'Andi Kleen' <andi@...stfloor.org>,
"'davem@...emloft.net'" <davem@...emloft.net>,
"'netdev@...r.kernel.org'" <netdev@...r.kernel.org>,
"'linux-pci@...r.kernel.org'" <linux-pci@...r.kernel.org>
Subject: Re: [PATCH] bnx2: Fix IRQ failures during kdump.
On Sun, May 30, 2010 at 09:12:15AM -0700, Michael Chan wrote:
> Andi Kleen wrote:
>
> > "Michael Chan" <mchan@...adcom.com> writes:
> >
> > > When switching from the crashed kernel to the kdump kernel without
> > going
> > > through PCI reset, IRQs may not work if a different IRQ mode is used
> > on
> >
> > PCIe with AER actually does support per link root port reset
> > (e.g. used for AER)
>
> Do you mean the slot_reset function in the pci_error_handlers? This
Well the fallback code in the PCIE root port driver
that does the actual resets.
It could be called directly before kexec.
> needs to be called in the context of the crashed kernel, right?
It could be done on kexec, however of course you would rely
on PCI root port data structures still being intact on a crash
(I guess that's reasonable, they are not very complicated)
>
> >
> > I've been wondering for some time if kexec should not simply
> > use that to reset all the devices, instead of addings hacks
> > around this to all drivers.
> >
> > That would fix your problems too, right?
>
> If it is called in the context of the crashed kernel, it won't work.
> We would reset it and put in back into the same IRQ mode.
Who would put it back? Your driver wouldn't be called anymore.
>
> >
> > The question is just if AER is widely enough supported for this.
> >
>
> Some newer PCIe devices support Function Level Reset, and that would
> be ideal. But most existing devices including bnx2 devices don't have
> this feature.
Root port reset should be fine for this case. Even if some
innocent device on the same root port gets reset too that shouldn't matter.
Only drawback for the NIC would be that you have to renegotiate links I think.
Also there are systems without AER support.
-Andi
--
ak@...ux.intel.com -- Speaking for myself only.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists