[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <91e9bbe3-75cf-4874-9d64-0785f7ea21d9@oracle.com>
Date: Tue, 18 Jun 2024 08:46:31 +0100
From: John Garry <john.g.garry@...cle.com>
To: Christoph Hellwig <hch@....de>, Keith Busch <kbusch@...nel.org>
Cc: axboe@...nel.dk, sagi@...mberg.me, jejb@...ux.ibm.com,
martin.petersen@...cle.com, viro@...iv.linux.org.uk,
brauner@...nel.org, dchinner@...hat.com, jack@...e.cz,
djwong@...nel.org, linux-block@...r.kernel.org,
linux-kernel@...r.kernel.org, linux-nvme@...ts.infradead.org,
linux-fsdevel@...r.kernel.org, tytso@....edu, jbongio@...gle.com,
linux-scsi@...r.kernel.org, ojaswin@...ux.ibm.com, linux-aio@...ck.org,
linux-btrfs@...r.kernel.org, io-uring@...r.kernel.org,
nilay@...ux.ibm.com, ritesh.list@...il.com, willy@...radead.org,
agk@...hat.com, snitzer@...nel.org, mpatocka@...hat.com,
dm-devel@...ts.linux.dev, hare@...e.de,
Himanshu Madhani <himanshu.madhani@...cle.com>
Subject: Re: [PATCH v8 05/10] block: Add core atomic write support
On 18/06/2024 07:51, Christoph Hellwig wrote:
> On Mon, Jun 17, 2024 at 12:56:01PM -0600, Keith Busch wrote:
>> I'm not sure I follow why these two need to be the same. I can see
>> checking for 'chunk_sectors % boundary_sectors_hw == 0', but am I
>> missing something else?
For simplicity, initially I was just asking for them to be the same.
If we relax to chunk_sectors % boundary_sectors_hw == 0, then for normal
writing we could use a larger chunk size (than atomic boundary_sectors_hw).
I just don't know if this stuff exists which will have a larger
chunk_size than atomic boundary_sectors_hw and whether it is worth
trying to support them.
>>
>> The reason I ask, zone block devices redefine the "chunk_sectors" to
>> mean the zone size, and I'm pretty sure the typical zone size is much
>> larger than the any common atomic write size.
>
> Yeah. Then again atomic writes in the traditional sense don't really
> make sense for zoned devices anyway as the zoned devices never overwrite
> and require all data up to the write pointer to be valid. In theory
> they could be interpreted so that you don't get a partical write failure
> if you stick to the atomic write boundaries, but that is mostly
> pointless.
>
About NVMe, the spec says that NABSN and NOIOB may not be related to one
another (command set spec 1.0d 5.8.2.1), but I am wondering if people
really build HW which would have different NABSN/NABSPF and NOIOB. I
don't know.
Thanks,
John
Powered by blists - more mailing lists