[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <ee50fce8-d931-eb10-78eb-7157a2c9020b@huaweicloud.com>
Date: Sat, 17 Aug 2024 09:29:23 +0800
From: yangerkun <yangerkun@...weicloud.com>
To: alexjlzheng@...il.com, yangerkun@...weicloud.com,
"Darrick J. Wong" <djwong@...nel.org>
Cc: adilger.kernel@...ger.ca, alexjlzheng@...cent.com,
linux-ext4@...r.kernel.org, linux-kernel@...r.kernel.org, tytso@....edu
Subject: Re: [PATCH] ext4: disambiguate the return value of
ext4_dio_write_end_io()
在 2024/8/17 0:57, alexjlzheng@...il.com 写道:
> On Fri, 16 Aug 2024 20:21:22 +0800, yangerkun@...weicloud.com wrote:
>> 在 2024/8/15 19:27, alexjlzheng@...il.com 写道:
>>> From: Jinliang Zheng <alexjlzheng@...cent.com>
>>>
>>> The commit 91562895f803 ("ext4: properly sync file size update after O_SYNC
>>> direct IO") causes confusion about the meaning of the return value of
>>> ext4_dio_write_end_io().
>>>
>>> Specifically, when the ext4_handle_inode_extension() operation succeeds,
>>> ext4_dio_write_end_io() directly returns count instead of 0.
>>>
>>> This does not cause a bug in the current kernel, but the semantics of the
>>> return value of the ext4_dio_write_end_io() function are wrong, which is
>>> likely to introduce bugs in the future code evolution.
>>>
>>> Signed-off-by: Jinliang Zheng <alexjlzheng@...cent.com>
>>> ---
>>> fs/ext4/file.c | 5 +++--
>>> 1 file changed, 3 insertions(+), 2 deletions(-)
>>>
>>> diff --git a/fs/ext4/file.c b/fs/ext4/file.c
>>> index c89e434db6b7..6df5a92cec2b 100644
>>> --- a/fs/ext4/file.c
>>> +++ b/fs/ext4/file.c
>>> @@ -392,8 +392,9 @@ static int ext4_dio_write_end_io(struct kiocb *iocb, ssize_t size,
>>> */
>>> if (pos + size <= READ_ONCE(EXT4_I(inode)->i_disksize) &&
>>> pos + size <= i_size_read(inode))
>>> - return size;
>>> - return ext4_handle_inode_extension(inode, pos, size);
>>> + return 0;
>>> + error = ext4_handle_inode_extension(inode, pos, size);
>>> + return error < 0 ? error : 0;
>>
>> Why?
>
> Before commit 91562895f803 ("ext4: properly sync file size update after O_SYNC
> direct IO"), all filesystems' iomap_dio_ops.end_io() return 0 on success and
> negative value on failure.
>
> Moreover, this confusion of return value semantics caused data corruption when
> this above patch was merged to the stable branch. See
> https://lwn.net/Articles/954285/ for details.
Yeah, I know this problem, you should backport 936e114a245b("iomap:
update ki_pos a little later in iomap_dio_complete") too to help update
iocb->ki_pos since ext4_dio_write_end_io now return > 0.
>
>>
>> iomap_dio_complete can use the return value directly without any bug.
>> And I think the code now seems more clearly...
>>
>
> In my opinion, clean code should be clearly defined code, especially the
Agree.
> interface functions connecting various modules. So, what is the return value
> definition of iomap_dio_ops.end_io()? What is the return value definition of
> ext4_dio_write_end_io()?
I have not seen the definition of return value for
iomap_dio_ops.end_io(), so I think the code is ok now. If we give a
definition for the return value like Darrick describe, this patch looks
good to me.
>
> Thanks,
> Jinliang Zheng
>
>>> }
>>>
>>> static const struct iomap_dio_ops ext4_dio_write_ops = {
>
Powered by blists - more mailing lists