[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250721084412.370258-1-neil@brown.name>
Date: Mon, 21 Jul 2025 17:59:56 +1000
From: NeilBrown <neil@...wn.name>
To: Linus Torvalds <torvalds@...ux-foundation.org>,
Alexander Viro <viro@...iv.linux.org.uk>,
Christian Brauner <brauner@...nel.org>,
Jan Kara <jack@...e.cz>
Cc: linux-fsdevel@...r.kernel.org,
linux-kernel@...r.kernel.org
Subject: [PATCH 0/7 RFC] New APIs for name lookup and lock for directory operations
Hi,
these patches (against vfs.all) primarily introduce new APIs for
preparing dentries for create, remove, rename. The goal is to
centralise knowledge of how we do locking (currently by locking the
directory) so that we can eventually change the mechanism (e.g. to
locking just the dentry).
Naming is difficult and I've changed my mind several times. :-)
The basic approach is to return a dentry which can be passed to
vfs_create(), vfs_unlink() etc, and subsequently to release that
dentry. The closest analogue to this in the VFS is kern_path_create()
which is paired with done_path_create(), though there is also
kern_path_locked() which is paired with explicit inode_unlock() and
dput(). So my current approach uses "done_" for finishing up.
I have:
dentry_lookup() dentry_lookup_noperm() dentry_lookup_hashed()
dentry_lookup_killable()
paired with
done_dentry_lookup()
and also
rename_lookup() rename_lookup_noperm() rename_lookup_hashed()
paired with
done_rename_lookup()
(these take a "struct renamedata *" to which some qstrs are added.
There is also "dentry_lock_in()" which is used instead of
dentry_lookup() when you already have the dentry and want to lock it.
So you "lock" it "in" a given parent. I'm not very proud of this name,
but I don't want to use "dentry_lock" as I want to save that for
low-level locking primitives.
There is also done_dentry_lookup_return() which doesn't dput() the
dentry but returns it instread. In about 1/6 of places where I need
done_dentry_lookup() the code makes use of the dentry afterwards. Only
in half the places where done_dentry_lookup_return() is used is the
returned value immediately returned by the calling function. I could
do a dget() before done_dentry_lookup(), but that looks awkward and I
think having the _return version is justified. I'm happy to hear other
opinions.
In order for this dentry-focussed API to work we need to have the
dentry to unlock. vfs_rmdir() currently consumes the dentry on
failure, so we don't have it unless we clumsily keep a copy. So an
early patch changes vfs_rmdir() to both consume the dentry and drop the
lock on failure.
After these new APIs are refined, agreed, and applied I will have a
collection of patches to roll them out throughout the kernel. Then we
can start/continue discussing a new approach to locking which allows
directory operations to proceed in parallel.
If you want a sneak peek at some of this future work - for context
mostly - my current devel code is at https://github.com/neilbrown/linux.git
in a branch "pdirops". Be warned that a lot of the later code is under
development, is known to be wrong, and doesn't even compile. Not today
anyway. The rolling out of the new APIs is fairly mature though.
Please review and suggest better names, or tell me that my choices are adequate.
And find the bugs in the code too :-)
I haven't cc:ed the maintains of the non-VFS code that the patches
touch. I can do that once the approach and names have been approved.
Thanks,
NeilBrown
[PATCH 1/7] VFS: unify old_mnt_idmap and new_mnt_idmap in renamedata
[PATCH 2/7] VFS: introduce done_dentry_lookup()
[PATCH 3/7] VFS: Change vfs_mkdir() to unlock on failure.
[PATCH 4/7] VFS: introduce dentry_lookup() and friends
[PATCH 5/7] VFS: add dentry_lookup_killable()
[PATCH 6/7] VFS: add rename_lookup()
[PATCH 7/7] VFS: introduce dentry_lock_in()
Powered by blists - more mailing lists