lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080801210919.GD14851@linux.vnet.ibm.com>
Date:	Fri, 1 Aug 2008 14:09:19 -0700
From:	"Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
To:	Linus Torvalds <torvalds@...ux-foundation.org>
Cc:	"Rafael J. Wysocki" <rjw@...k.pl>,
	Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
	Adrian Bunk <bunk@...nel.org>,
	Andrew Morton <akpm@...ux-foundation.org>,
	Natalie Protasevich <protasnb@...il.com>,
	Kernel Testers List <kernel-testers@...r.kernel.org>,
	Maximilian Engelhardt <maxi@...monizer.de>,
	Randy Dunlap <randy.dunlap@...cle.com>,
	James Bottomley <James.Bottomley@...senPartnership.com>,
	nickpiggin@...oo.com.au, adobriyan@...il.com
Subject: Re: 2.6.26-rc9: Reported regressions from 2.6.25

On Sun, Jul 06, 2008 at 08:46:09AM -0700, Linus Torvalds wrote:
> > Bug-Entry	: http://bugzilla.kernel.org/show_bug.cgi?id=10815
> > Subject		: 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0
> > Submitter	: Alexey Dobriyan <adobriyan@...il.com>
> > Date		: 2008-05-27 09:23 (41 days old)
> > References	: http://lkml.org/lkml/2008/5/27/9
> > 		  http://lkml.org/lkml/2008/6/14/87
> > Handled-By	: Oleg Nesterov <oleg@...sign.ru>
> > 		  Linus Torvalds <torvalds@...ux-foundation.org>
> > 		  Paul E. McKenney <paulmck@...ux.vnet.ibm.com>
> > Patch		: http://lkml.org/lkml/2008/5/28/16
> 
> This one is the same thing that is reported as unresolved, and no, I don't 
> think that existing patch was ever really tested to fix anything. Paul?

Alexey tested the above patch, and it did not fix his failure
(http://lkml.org/lkml/2008/6/15/93).  Neither did the patch
at http://lkml.org/lkml/2008/6/14/209.  I was never able to
reproduce Alexey's failure, whether by running LTP in parallel
with 170 kernel builds or by running either in parallel with
rcutorture.  Some enhancements to make rcutorture more vicious
were unable to provoke failures.

Alexey is able to provoke the failure on a maxcpus=1 configuration,
which should narrow things down quite a bit.  I dug through
assembly, and found no issues at that level.

Alexey, would you be willing to send along your vmlinux or disassembly
of the RCU functions?

In any case, I am working up additional diagnostics.

> I suspect SRCU will need to be simply marked BROKEN for now, because 
> nobody knows what the problem Alexey sees is. Apparently it's been seen by 
> a few other people too.

PREEMPT_RCU is already marked "default n" with a "Say N if you are
unsure.  Shouldn't that cover it?

I don't believe that SRCU is involved, please let me know if I missed
something.

Nick Piggin mentioned seeing failures similar to Alexey's, and I still
need his repeat-by.  Nick?

							Thanx, Paul
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ