linux-kernel - Re: 2.6.26-rc8 deadlock: RAID code?

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives

Hash Suite: Windows password security audit tool. GUI, reports in PDF.

[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]

Message-ID: <e9c3a7c20807050854o330279f7x3d64b1ec263f8800@mail.gmail.com>
Date:	Sat, 5 Jul 2008 08:54:31 -0700
From:	"Dan Williams" <dan.j.williams@...el.com>
To:	"George Spelvin" <linux@...izon.com>
Cc:	linux-raid@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: 2.6.26-rc8 deadlock: RAID code?

On Fri, Jul 4, 2008 at 5:55 AM, George Spelvin <linux@...izon.com> wrote:
> I've seen this twice before, but had to get remote logging working to
> capture the initial error; once the root file system locks up there's
> an unending stream of these messages and even syslog can't actually
> log anything.
>
> (In fact, it locked up and stopped working after capturing this here.
> I'd have to get a null modem cable and serial console to capture more.)
>
> I can do it again, but it takes a few days.
>
> Hardware: single-core Athlon 64, ECC memory (scrubbing enabled),
> 6x SATA drives on 3x SiI3132 controllers.  Root file system (where I
> believe the problem is) is ext3 over RAID-10 over all drives.  Another,
> larger file system (that I can't see why the sensors daemon would touch)
> is ext3 over RAID5 over the same drives.
>
> Kernel is 2.6.26-rc8 + EDAC patches + linuxpps support.  This problem
> was not observed in 2.6.25 kernels (with the same patches).
>
> Any ideas?  For now, I'm going to turn on frame pointers and
> CONFIG_PROVE_LOCKING to get more information.
>

Are you running with CONFIG_NUMA=n?  If so, you may be seeing the
effects of kswapd not running. See the patch at:

     http://marc.info/?l=linux-mm&m=121510360428340&w=2

--
Dan
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/