lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <39f14032374d5d60c62b283637267a96ce535861.camel@gmx.de>
Date: Thu, 28 Aug 2025 19:26:54 +0200
From: Mike Galbraith <efault@....de>
To: Sean Anderson <sean.anderson@...ux.dev>, Robert Hancock
	 <robert.hancock@...ian.com>
Cc: Breno Leitao <leitao@...ian.org>, "netdev@...r.kernel.org"
	 <netdev@...r.kernel.org>
Subject: Re: netpoll: raspberrypi [4 5] driver locking woes

On Thu, 2025-08-28 at 10:57 -0400, Sean Anderson wrote:
> Hi Mike,
> 
> On 8/27/25 12:02, Mike Galbraith wrote:
> > Unexpected addendum to done deal datapoint, so off list.
> > 
> > On Tue, 2025-08-26 at 11:49 +0200, Mike Galbraith wrote:
> > > 
> > > The pi5 gripe fix is equally trivial, but submitting that is pointless
> > > given there's something else amiss in fingered commit.  This is all of
> > > the crash info that escapes the box w/wo gripes silenced.
> > > 
> > > [   51.688868] sysrq: Trigger a crash
> > > [   51.688892] Kernel panic - not syncing: sysrq triggered crash
> > > [   51.698066] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.17.0-rc2-v8-lockdep #533 PREEMPTLAZY
> > > [   51.707234] Hardware name: Raspberry Pi 5 Model B Rev 1.0 (DT)
> > > [   51.713085] Call trace:
> > > [   51.715532]  show_stack+0x20/0x38 (C)
> > > [   51.719206]  dump_stack_lvl+0x38/0xd0
> > > [   51.722878]  dump_stack+0x18/0x28
> > > 
> > > That aspect is a punt and run atm (time.. and dash of laziness:).
> > 
> > Plan was to end datapoint thread, but after booting pi5's 6.12 kernel,
> > for some reason I fired up netconsole.. and box promptly exhibited the
> > netpoll locking bug warning, indicating presence of 138badbc21a0. 
> > Instead of saying to self "nope, just walk away", I poked SysRq-C.. and
> > the bloody damn monitoring box received a 100% complete death rattle. 
> > Well bugger.
> 
> Did you get a backtrace for this?

Yes, logs for 6.12.41 and 6.17.0-rc2 attached.

Since a patch has meanwhile landed, also a log of patched 6.17.0-rc3
now gripe free (yay) but with aforementioned broken output, followed by
addition of the e6a532185daa revert to confirm it still cures that.

> And to be clear, the steps to reproduce this are to boot a kernel with
> lockdep enabled with netconsole on macb and then hit sysrq?

Yup.

> > Two trees contain locking buglet introducing 138badbc21a0, 6.12.41 and
> > 6.17.0, but only the later reproduces the above mess.  A quick stare at
> > git diff inspired checking e6a532185daa, and sure enough reverting only
> > it reproduced a functional netconsole, modulo warnings, ie the mess
> > above manifests only with BOTH resident.  Either revert 138badbc21a0,
> > or revert e6a532185daa and fix the 138badbc21a0 locking buglet, and the
> > result is the same, a perfectly functional warning free netconsole.
> 


View attachment "netconsole-rpi5-6.12-eth0.log" of type "text/x-log" (18613 bytes)

View attachment "netconsole-rpi5-6.17-eth0.log" of type "text/x-log" (17135 bytes)

View attachment "netconsole-rpi5-6.17-eth0+patch-and+revert.log" of type "text/x-log" (3698 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ