lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20101218080158.58e17c21@notabene.brown>
Date:	Sat, 18 Dec 2010 08:01:58 +1100
From:	Neil Brown <neilb@...e.de>
To:	Mike Snitzer <snitzer@...hat.com>
Cc:	linux-raid@...r.kernel.org, linux-kernel@...r.kernel.org,
	dm-devel@...hat.com
Subject: Re: reproducer for DM on MD flush deadlock? (was: Re: [PULL
 REQUEST] md bug fixes)

On Fri, 17 Dec 2010 13:13:47 -0500 Mike Snitzer <snitzer@...hat.com> wrote:

> On Tue, Dec 14, 2010 at 2:22 AM, Neil Brown <neilb@...e.de> wrote:
> >
> >
> > Hi Linus,
> >  here are a few bug fixes for md.
> >  Some of the patches are actually clean-up rather than bug-fix,
> >  but I that make the bugfix simpler to review.
> >
> > Thanks,
> > NeilBrown
> >
> >
> > The following changes since commit 6313e3c21743cc88bb5bd8aa72948ee1e83937b6:
> >
> >  Merge branches 'x86-fixes-for-linus', 'perf-fixes-for-linus' and 'sched-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip (2010-12-08 06:40:59 -0800)
> >
> > are available in the git repository at:
> >
> >  git://neil.brown.name/md/ for-linus
> >
> > NeilBrown (5):
> >      md: remove handling of flush_pending in md_submit_flush_data
> >      md: move code in to submit_flushes.
> >      md: fix possible deadlock in handling flush requests.
> 
> Hi Neil,
> 
> Thanks for fixing this DM on MD flush issue.  But my attempts to
> reproduce it have been unsuccessful.
> 
> I've tried ext4 w/ barriers to a DM device above a 2 member MD RAID1.
> The DM device has a table with 2 linear targets to the same md0
> device:
> 
> # dmsetup table
> multiple_targets: 0 24576 linear 9:0 2048
> multiple_targets: 24576 49152 linear 9:0 26624
> 
> No amount of IO with flushes has enabled me to hit a deadlock (in
> md_flush_request,  md_write_start, etc).
> 
> Do you have a simple reproducer for this issue?

No.  I think the issue is very sensitive to the exact placement of the border
between the two dm targets.  You need to be able to produce a flush request
that crosses that border.

So to reproduce it I would:

 Create an ext4 filesystem of some known size.
 Impose some simple easily reproducible load and use e.g. blktrace to gets a
 log of the flush requests.
 Choose on such request that is larger than a sector and note it's location
 Create a DM device of the same size with two targets on md devices where the
 first target ends in the middle of where the flush request was
 Repeat the above sequence on the dm device.  That should result in a flush
 request overlapping both targets and thus triggering the issue.

NeilBrown
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ