lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Zbr4QsJ5JyLEkjn6@xsang-OptiPlex-9020>
Date: Thu, 1 Feb 2024 09:47:46 +0800
From: Oliver Sang <oliver.sang@...el.com>
To: Abel Wu <wuyun.abel@...edance.com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
	Peter Zijlstra <peterz@...radead.org>, <aubrey.li@...ux.intel.com>,
	<yu.c.chen@...el.com>, Tiwei Bie <tiwei.btw@...group.com>,
	<oliver.sang@...el.com>
Subject: Re: Re: [linus:master] [sched/eevdf] 2227a957e1:
 BUG:kernel_NULL_pointer_dereference,address

hi, Abel,

On Wed, Jan 31, 2024 at 08:34:09PM +0800, Abel Wu wrote:
> On 1/30/24 10:09 PM, Oliver Sang Wrote:
> > hi, Abel,
> > 
> > On Tue, Jan 30, 2024 at 06:13:32PM +0800, Abel Wu wrote:
> > > On 1/30/24 3:24 PM, kernel test robot Wrote:
> > > > 
> > > > we found this issue happens in very random way (23 out of 999 runs).
> > > > but keeps clean on parent.
> > > 
> > > Thanks for reporting, I will try to reproduce the issue. Does the 'parent'
> > > mean the same code branch without this commit?
> > 
> > 
> > it just means the parent in git log.
> > 
> > xsang@inn:/c/repo/linux$ git log --oneline --graph 2227a957e1
> > * 2227a957e1d5b sched/eevdf: Sort the rbtree by virtual deadline
> > * 84db47ca7146d sched/numa: Fix mm numa_scan_seq based unconditional scan  <-- parent
> 
> Thanks for clarifying, Oliver. And can you please check the parent's
> dmesg for the log:
> 
> 	pr_err("EEVDF scheduling fail, picking leftmost\n");
> 
> which potentially turns a kernel panic to a message.

yeah, there is. and there are 27 runs have this log from 999 runs, almost same
rate as the crash we reported for 2227a957e1 (which is 23 out of 999 runs).

attached one FYI.

[  600.569585][   T16] EEVDF scheduling fail, picking leftmost
[  600.569900][   T16]
[  600.569902][   T16] ======================================================
[  600.569903][   T16] WARNING: possible circular locking dependency detected
[  600.569905][   T16] 6.7.0-rc1-00005-g84db47ca7146 #1 Tainted: G        W        N
[  600.569907][   T16] ------------------------------------------------------
[  600.569908][   T16] rcu_preempt/16 is trying to acquire lock:
[  600.569910][   T16] c3bb4a48 (console_owner){-.-.}-{0:0}, at: console_lock_spinning_enable+0x27/0x50
[  600.569937][   T16]
[  600.569937][   T16] but task is already holding lock:
[  600.569938][   T16] e75ff0b0 (&rq->__lock){-.-.}-{2:2}, at: rq_lock+0x1f/0xe0
...


Download attachment "dmesg-parent.xz" of type "application/x-xz" (59040 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ