[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <x49r4ty69bw.fsf@segfault.boston.devel.redhat.com>
Date: Fri, 01 Jun 2012 16:55:31 -0400
From: Jeff Moyer <jmoyer@...hat.com>
To: Junxiao Bi <junxiao.bi@...cle.com>
Cc: ocfs2-devel@....oracle.com, linux-aio@...ck.org, mfasheh@...e.com,
jlbec@...lplan.org, bcrl@...ck.org, linux-kernel@...r.kernel.org,
stable@...r.kernel.org, joe.jin@...cle.com
Subject: Re: [PATCH 1/2] aio: make kiocb->private NUll in init_sync_kiocb()
Junxiao Bi <junxiao.bi@...cle.com> writes:
> On 05/31/2012 10:08 PM, Jeff Moyer wrote:
>> Junxiao Bi <junxiao.bi@...cle.com> writes:
>>
>>> Ocfs2 uses kiocb.*private as a flag of unsigned long size. In
>>> commit a11f7e6 ocfs2: serialize unaligned aio, the unaligned
>>> io flag is involved in it to serialize the unaligned aio. As
>>> *private is not initialized in init_sync_kiocb() of do_sync_write(),
>>> this unaligned io flag may be unexpectly set in an aligned dio.
>>> And this will cause OCFS2_I(inode)->ip_unaligned_aio decreased
>>> to -1 in ocfs2_dio_end_io(), thus the following unaligned dio
>>> will hang forever at ocfs2_aiodio_wait() in ocfs2_file_write_iter().
>>> We can't initialized this flag in ocfs2_file_write_iter() since
>>> it may be invoked several times by do_sync_write(). So we initialize
>>> it in init_sync_kiocb(), it's also useful for other similiar use of
>>> it in the future.
>> I don't see any ocfs2_file_write_iter in the upstream kernel.
>> ocfs2_file_aio_write most certainly could set ->private to 0, it
>> will only be called once for a given kiocb.
> From sys_io_submit->..->io_submit_one->aio_run_iocb->aio_rw_vect_retry,
> it seems that aio_write could be called two times. See the following
> scenario.
> 1. There is a file opened with direct io flag, in aio_rw_vect_retry,
> aio_write is called first time. If the direct io can
> not be completed, it will fall back into buffer io, see line 2329 in
> aio_write.
Huh? What's line 2329 in aio_write?
> 2. If the very buffer io is a partial write, then it will return back
> to aio_rw_vect_retry and issue the second aio_write.
For the generic case, the fallback to buffered I/O happens in
__generic_file_aio_write, without bouncing all the way back up the call
stack to aio_rw_vect_retry. I see in ocfs2, things are a bit different:
retry->aio_rw_vect_retry->ocfs2_file_aio_write->generic_file_direct_write
->ocfs2_direct_IO->__blockdev_direct_IO
That last function can return 0 if not all of the data was written via
direct I/O. At that point, you return all of the way up the chain to
aio_rw_vect_retry, which checks the return value (ret). If it was 0,
then it goes ahead and retries the complete I/O. How does that make any
progress?!
Cheers,
Jeff
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists