lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5098C4F8.9050700@kernel.dk>
Date:	Tue, 06 Nov 2012 09:06:16 +0100
From:	Jens Axboe <axboe@...nel.dk>
To:	Dave Chinner <david@...morbit.com>
CC:	Shaohua Li <shli@...nel.org>, NeilBrown <neilb@...e.de>,
	linux RAID <linux-raid@...r.kernel.org>,
	lkml <linux-kernel@...r.kernel.org>
Subject: Re: Problem with DISCARD and RAID5

On 2012-11-05 22:48, Dave Chinner wrote:
> On Fri, Nov 02, 2012 at 09:40:58AM +0800, Shaohua Li wrote:
>> On Thu, Nov 01, 2012 at 05:38:54PM +1100, NeilBrown wrote:
>>>
>>> Hi Shaohua,
>>>  I've been doing some testing and discovered a problem with your discard
>>>  support for RAID5.
>>>
>>>  The code in blkdev_issue_discard assumes that the 'granularity' is a power
>>>  of 2, and for example subtracts 1 to get a mask.
>>>
>>>  However RAID5 sets the granularity to be the stripe size which often is not
>>>  a power of two.  When this happens you can easily get into an infinite loop.
>>>
>>>  I suspect that to make this work properly, blkdev_issue_discard will need to
>>>  be changed to allow 'granularity' to be an arbitrary value.
>>>  When it is a power of two, the current masking can be used.
>>>  When it is anything else, it will need to use sector_div().
>>
>> Yep, looks we need use sector_div. And this isn't the only problem. discard
>> request can be merged, and the merge check only checks max_discard_sectors.
>> That means the split requests in blkdev_issue_discard can be merged again. The
>> split nerver works.
>>
>> I'm wondering what's purpose of discard_alignment and discard_granularity. Are
>> there devices with discard_granularity not 1 sector?
> 
> Most certainly. Thin provisioned storage often has granularity in the
> order of megabytes....

Can't really to to much about that...

>> If bio isn't discard
>> aligned, what device will do?
> 
> Up to the device.

We should not send those down, if they are violating the restrictions
set by the driver.

>> Further, why driver handles alignment/granularity
>> if device will ignore misaligned request.
> 
> When you send a series of sequential unaligned requests, the device
> may ignore them all. Hence you end up with nothing being discarded,
> even though the entire range being discarded is much, much larger
> than the discard granularity....

That's just tough luck, unfortunately. Shaohua, I'd suggest sending down
whatever discards you can, IFF they are aligned according to the
restrictions being set. If that ends up not discarding to devices that
have large alignment/size constraints, nothing we can do about that.

-- 
Jens Axboe

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ