lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20251114170129.GI196370@frogsfrogsfrogs>
Date: Fri, 14 Nov 2025 09:01:29 -0800
From: "Darrick J. Wong" <djwong@...nel.org>
To: Jeff Layton <jlayton@...nel.org>
Cc: Christoph Hellwig <hch@....de>, Christian Brauner <brauner@...nel.org>,
	Al Viro <viro@...iv.linux.org.uk>, David Sterba <dsterba@...e.com>,
	Jan Kara <jack@...e.cz>, Mike Marshall <hubcap@...ibond.com>,
	Martin Brandenburg <martin@...ibond.com>,
	Carlos Maiolino <cem@...nel.org>, Stefan Roesch <shr@...com>,
	linux-kernel@...r.kernel.org, linux-btrfs@...r.kernel.org,
	gfs2@...ts.linux.dev, io-uring@...r.kernel.org,
	devel@...ts.orangefs.org, linux-unionfs@...r.kernel.org,
	linux-mtd@...ts.infradead.org, linux-xfs@...r.kernel.org,
	linux-nfs@...r.kernel.org
Subject: Re: re-enable IOCB_NOWAIT writes to files

On Fri, Nov 14, 2025 at 09:04:58AM -0500, Jeff Layton wrote:
> On Fri, 2025-11-14 at 07:26 +0100, Christoph Hellwig wrote:
> > Hi all,
> > 
> > commit 66fa3cedf16a ("fs: Add async write file modification handling.")
> > effectively disabled IOCB_NOWAIT writes as timestamp updates currently
> > always require blocking, and the modern timestamp resolution means we
> > always update timestamps.  This leads to a lot of context switches from
> > applications using io_uring to submit file writes, making it often worse
> > than using the legacy aio code that is not using IOCB_NOWAIT.
> > 
> > This series allows non-blocking updates for lazytime if the file system
> > supports it, and adds that support for XFS.
> > 
> > It also fixes the layering bypass in btrfs when updating timestamps on
> > device files for devices removed from btrfs usage, and FMODE_NOCMTIME
> > handling in the VFS now that nfsd started using it.  Note that I'm still
> > not sure that nfsd usage is fully correct for all file systems, as only
> > XFS explicitly supports FMODE_NOCMTIME, but at least the generic code
> > does the right thing now.
> > 
> > Diffstat:
> >  Documentation/filesystems/locking.rst |    2 
> >  Documentation/filesystems/vfs.rst     |    6 ++
> >  fs/btrfs/inode.c                      |    3 +
> >  fs/btrfs/volumes.c                    |   11 +--
> >  fs/fat/misc.c                         |    3 +
> >  fs/fs-writeback.c                     |   53 ++++++++++++++----
> >  fs/gfs2/inode.c                       |    6 +-
> >  fs/inode.c                            |  100 +++++++++++-----------------------
> >  fs/internal.h                         |    3 -
> >  fs/orangefs/inode.c                   |    7 ++
> >  fs/overlayfs/inode.c                  |    3 +
> >  fs/sync.c                             |    4 -
> >  fs/ubifs/file.c                       |    9 +--
> >  fs/utimes.c                           |    1 
> >  fs/xfs/xfs_iops.c                     |   29 ++++++++-
> >  fs/xfs/xfs_super.c                    |   29 ---------
> >  include/linux/fs.h                    |   17 +++--
> >  include/trace/events/writeback.h      |    6 --
> >  18 files changed, 152 insertions(+), 140 deletions(-)
> 
> This all looks pretty reasonable to me. There are a few changelog and
> subject line typos, but the code changes look fine. You can add:
> 
> Reviewed-by: Jeff Layton <jlayton@...nel.org>
> 
> As far as nfsd's usage of FMODE_NOCMTIME, it looks OK to me. That's
> implemented today by the check in file_modified_flags(), which is
> generic and should work across filesystems.
> 
> The main exception is xfs_exchange_range() which has some special
> handling for it, but nfsd doesn't use that functionality so that
> shouldn't be an issue.
> 
> Am I missing some subtlety?

In exchangerange specifically?

The FMODE_NOCMTIME checks in xfs_exchange_range exist to tell the
exchange-range code to update cmtime, but only if it decides to actually
go through with the mapping exchange.  Since the mapping exchange
requires a transaction anyway, it's cheap to bundle in timestamp
updates.

Also there's no way that we can do nonblocking exchangerange so a NOWAIT
flag wouldn't be much help here anyway.

(I hope that answers your question)

--D

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ