[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20120604000059.GA14144@redhat.com>
Date: Sun, 3 Jun 2012 20:00:59 -0400
From: Dave Jones <davej@...hat.com>
To: Linus Torvalds <torvalds@...ux-foundation.org>
Cc: Al Viro <viro@...iv.linux.org.uk>,
Linux Kernel <linux-kernel@...r.kernel.org>
Subject: Re: processes hung after sys_renameat, and 'missing' processes
On Sun, Jun 03, 2012 at 04:07:35PM -0700, Linus Torvalds wrote:
> On Sun, Jun 3, 2012 at 3:36 PM, Dave Jones <davej@...hat.com> wrote:
> >
> > It's still up and in this state if you want me to gather any further info
> > before I reboot it.
>
> You seem to have lockdep enabled, since I see the lock information,
> but presumably you didn't get a lockdep splat before this?
Nothing.
> Because yes, it looks like you have a boattload of trinity processes
> stuck on an inode mutex. Looks like every single one of them is in
> 'lock_rename()'. It *shouldn't* be an ABBA deadlock, since lockdep
> should have noticed that, but who knows.
another data point: the core dumps that were being deleted were in the
directory that trinity runs from. It's feasible that some of the processes
were using those dumps as input data for random syscalls, before they
got deleted.
> It looks like the lock information is somewhat truncated. I suspect
> the dmesg buffer had filled up with all the task data.
>
> Can you do just "sysrq-d" for the lock information (and maybe
> separately "sysrq-w" for the blocked tasks)? The non-truncated lock
> data might tell us more.
sysrq-w: http://fpaste.org/WUd9/
sysrq-d: http://fpaste.org/ow9O/
Dave
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists