lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <080d7aef-0139-4da4-8f43-aedbf9bb9948@nvidia.com>
Date: Thu, 15 Jan 2026 13:55:09 -0800
From: Terje Bergstrom <tbergstrom@...dia.com>
To: Bjorn Helgaas <helgaas@...nel.org>
Cc: Johnny-CC Chang (張晋嘉)
 <Johnny-CC.Chang@...iatek.com>, "lukas@...ner.de" <lukas@...ner.de>,
 Project_Global_Digits_Upstream_Group
 <Project_Global_Digits_Upstream_Group@...iatek.com>,
 AngeloGioacchino Del Regno <angelogioacchino.delregno@...labora.com>,
 "linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
 "linux-arm-kernel@...ts.infradead.org"
 <linux-arm-kernel@...ts.infradead.org>,
 "linux-pci@...r.kernel.org" <linux-pci@...r.kernel.org>,
 "linux-mediatek@...ts.infradead.org" <linux-mediatek@...ts.infradead.org>,
 "bhelgaas@...gle.com" <bhelgaas@...gle.com>,
 "matthias.bgg@...il.com" <matthias.bgg@...il.com>,
 Jason Gunthorpe <jgg@...dia.com>, Alex Williamson <alex@...zbot.org>
Subject: Re: [PATCH] PCI: Mark Nvidia GB10 to avoid bus reset

On 1/15/26 12:53, Bjorn Helgaas wrote:
> OK, so you do SBR to some endpoint below a GB10 Root Port, and after > the SBR, the link to the endpoint retrains with a lower lane count > and config reads to the endpoint time out?

That's right. The symptoms can vary, i.e. sometimes it retrains with lower
lane count, and sometimes config reads start timing out, and very often
it works just fine.

> I see you're from NVIDIA, so if you're confirming that this is a > hardware erratum (not an issue with the GB10 PCI controller driver), > we should definitely apply this, and I'll wordsmith the commit log > and comment something like this: > > When asserting Secondary Bus Reset to downstream devices via a GB10 > Root Port, the link doesn't retrain correctly. The link may retrain > with a lower lane count, and config accesses to downstream devices > may fail.

Yes, I confirm this is a HW erratum. The problem doesn't occur every time, so
"the link may not retrain correctly" would be more correct, but that's a minor
comment.

Terje

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ