[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <465C5FDF.4070401@atipa.com>
Date: Tue, 29 May 2007 12:16:15 -0500
From: Roger Heflin <rheflin@...pa.com>
To: Dave Kleikamp <shaggy@...ux.vnet.ibm.com>
CC: linux-kernel@...r.kernel.org
Subject: Re: Apparent Deadlock with nfsd/jfs on 2.6.21.1 under bonnie.
Dave Kleikamp wrote:
> On Thu, 2007-05-17 at 09:37 -0500, Roger Heflin wrote:
>> Dave Kleikamp wrote:
>>
>>> I don't have an answer to an ext3 deadlock, but this looks like a jfs
>>> problem that was recently fixed in linux-2.6.22-rc1. I had intended to
>>> send it to the stable kernel after it was picked up in mainline, but
>>> hadn't gotten to it yet.
>>>
>>> The patch is here:
>>> http://git.kernel.org/gitweb.cgi?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=05ec9e26be1f668ccba4ca54d9a4966c6208c611
>>>
>> Dave,
>>
>> That appears to have fixed the JFS hangup.
>>
>> MTBF before was about 1 hour, under the same test I am over 20 hours
>> and things appear to still be holding together.
>
> Great. The patch is queued for the 2.6.21 stable tree now.
>
> Thanks,
> Shaggy
Dave,
Apparently there appears to be another different similar lockup,
The MTBF has risen from 1-2 hours without that patch to >100 hours,
so I am fairly sure the patch did correct the original lockup, or
at the very least make it a lot less likely.
I hit the machine across NFS for 5 days before it deadlocked, before
the patch I could only get an hour or two (2-4 different tries).
Given that pdflush is "D" it does not appear to be an NFS issue.
Included is the sysrq-t.
This is with 2.6.21.1 + the JFSIO patch.
View attachment "jdb.lockup.txt" of type "text/plain" (233701 bytes)
Powered by blists - more mailing lists