[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20101218080158.58e17c21@notabene.brown>
Date: Sat, 18 Dec 2010 08:01:58 +1100
From: Neil Brown <neilb@...e.de>
To: Mike Snitzer <snitzer@...hat.com>
Cc: linux-raid@...r.kernel.org, linux-kernel@...r.kernel.org,
dm-devel@...hat.com
Subject: Re: reproducer for DM on MD flush deadlock? (was: Re: [PULL
REQUEST] md bug fixes)
On Fri, 17 Dec 2010 13:13:47 -0500 Mike Snitzer <snitzer@...hat.com> wrote:
> On Tue, Dec 14, 2010 at 2:22 AM, Neil Brown <neilb@...e.de> wrote:
> >
> >
> > Hi Linus,
> > here are a few bug fixes for md.
> > Some of the patches are actually clean-up rather than bug-fix,
> > but I that make the bugfix simpler to review.
> >
> > Thanks,
> > NeilBrown
> >
> >
> > The following changes since commit 6313e3c21743cc88bb5bd8aa72948ee1e83937b6:
> >
> > Merge branches 'x86-fixes-for-linus', 'perf-fixes-for-linus' and 'sched-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip (2010-12-08 06:40:59 -0800)
> >
> > are available in the git repository at:
> >
> > git://neil.brown.name/md/ for-linus
> >
> > NeilBrown (5):
> > md: remove handling of flush_pending in md_submit_flush_data
> > md: move code in to submit_flushes.
> > md: fix possible deadlock in handling flush requests.
>
> Hi Neil,
>
> Thanks for fixing this DM on MD flush issue. But my attempts to
> reproduce it have been unsuccessful.
>
> I've tried ext4 w/ barriers to a DM device above a 2 member MD RAID1.
> The DM device has a table with 2 linear targets to the same md0
> device:
>
> # dmsetup table
> multiple_targets: 0 24576 linear 9:0 2048
> multiple_targets: 24576 49152 linear 9:0 26624
>
> No amount of IO with flushes has enabled me to hit a deadlock (in
> md_flush_request, md_write_start, etc).
>
> Do you have a simple reproducer for this issue?
No. I think the issue is very sensitive to the exact placement of the border
between the two dm targets. You need to be able to produce a flush request
that crosses that border.
So to reproduce it I would:
Create an ext4 filesystem of some known size.
Impose some simple easily reproducible load and use e.g. blktrace to gets a
log of the flush requests.
Choose on such request that is larger than a sector and note it's location
Create a DM device of the same size with two targets on md devices where the
first target ends in the middle of where the flush request was
Repeat the above sequence on the dm device. That should result in a flush
request overlapping both targets and thus triggering the issue.
NeilBrown
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists