[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <d4a3f3c717e786e8d0d26d044ef3dbe8b913452c.camel@themaw.net>
Date: Tue, 08 Jun 2021 09:26:25 +0800
From: Ian Kent <raven@...maw.net>
To: "Eric W. Biederman" <ebiederm@...ssion.com>
Cc: Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
Tejun Heo <tj@...nel.org>, Eric Sandeen <sandeen@...deen.net>,
Fox Chen <foxhlchen@...il.com>,
Brice Goglin <brice.goglin@...il.com>,
Al Viro <viro@...IV.linux.org.uk>,
Rick Lindsley <ricklind@...ux.vnet.ibm.com>,
David Howells <dhowells@...hat.com>,
Miklos Szeredi <miklos@...redi.hu>,
Marcelo Tosatti <mtosatti@...hat.com>,
Carlos Maiolino <cmaiolino@...hat.com>,
linux-fsdevel <linux-fsdevel@...r.kernel.org>,
Kernel Mailing List <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH v5 2/6] kernfs: add a revision to identify directory
node changes
On Mon, 2021-06-07 at 12:53 -0500, Eric W. Biederman wrote:
> Ian Kent <raven@...maw.net> writes:
>
> > Add a revision counter to kernfs directory nodes so it can be used
> > to detect if a directory node has changed.
> >
> > There's an assumption that sizeof(unsigned long) <= sizeof(pointer)
> > on all architectures and as far as I know that assumption holds.
> >
> > So adding a revision counter to the struct kernfs_elem_dir variant
> > of
> > the kernfs_node type union won't increase the size of the
> > kernfs_node
> > struct. This is because struct kernfs_elem_dir is at least
> > sizeof(pointer) smaller than the largest union variant. It's
> > tempting
> > to make the revision counter a u64 but that would increase the size
> > of
> > kernfs_node on archs where sizeof(pointer) is smaller than the
> > revision
> > counter.
> >
> > Signed-off-by: Ian Kent <raven@...maw.net>
> > ---
> > fs/kernfs/dir.c | 8 ++++++++
> > fs/kernfs/kernfs-internal.h | 24 ++++++++++++++++++++++++
> > include/linux/kernfs.h | 5 +++++
> > 3 files changed, 37 insertions(+)
> >
> > diff --git a/fs/kernfs/dir.c b/fs/kernfs/dir.c
> > index 33166ec90a112..b88432c48851f 100644
> > --- a/fs/kernfs/dir.c
> > +++ b/fs/kernfs/dir.c
> > @@ -372,6 +372,7 @@ static int kernfs_link_sibling(struct
> > kernfs_node *kn)
> > /* successfully added, account subdir number */
> > if (kernfs_type(kn) == KERNFS_DIR)
> > kn->parent->dir.subdirs++;
> > + kernfs_inc_rev(kn->parent);
> >
> > return 0;
> > }
> > @@ -394,6 +395,7 @@ static bool kernfs_unlink_sibling(struct
> > kernfs_node *kn)
> >
> > if (kernfs_type(kn) == KERNFS_DIR)
> > kn->parent->dir.subdirs--;
> > + kernfs_inc_rev(kn->parent);
> >
> > rb_erase(&kn->rb, &kn->parent->dir.children);
> > RB_CLEAR_NODE(&kn->rb);
> > @@ -1105,6 +1107,12 @@ static struct dentry
> > *kernfs_iop_lookup(struct inode *dir,
> >
> > /* instantiate and hash dentry */
> > ret = d_splice_alias(inode, dentry);
> > + if (!IS_ERR(ret)) {
> > + if (unlikely(ret))
> > + kernfs_set_rev(parent, ret);
> > + else
> > + kernfs_set_rev(parent, dentry);
>
> Do we care about d_time on non-NULL dentries?
Would we ever need to use it avoid a search for any other cases?
Probably not ... those export ops mean that some dentries might
not have d_time set.
Maybe it's best to put a comment in about only using it for
negative dentries and set it unconditionally in ->lookup() as
you describe.
>
> For d_splice_alias to return a different dentry implies
> that the dentry was non-NULL.
>
> I am wondering if having a guarantee that d_time never changes could
> help simplify the implementation. For never changing it would see to
> make sense to call kernfs_set_rev before d_splice_alias on dentry,
> and
> simply not worry about it after d_splice_alias.
Yes, I was tempted to do that.
>
> > + }
> > out_unlock:
> > mutex_unlock(&kernfs_mutex);
> > return ret;
> > diff --git a/fs/kernfs/kernfs-internal.h b/fs/kernfs/kernfs-
> > internal.h
> > index ccc3b44f6306f..1536002584fc4 100644
> > --- a/fs/kernfs/kernfs-internal.h
> > +++ b/fs/kernfs/kernfs-internal.h
> > @@ -81,6 +81,30 @@ static inline struct kernfs_node
> > *kernfs_dentry_node(struct dentry *dentry)
> > return d_inode(dentry)->i_private;
> > }
> >
> > +static inline void kernfs_set_rev(struct kernfs_node *kn,
> > + struct dentry *dentry)
> > +{
> > + if (kernfs_type(kn) == KERNFS_DIR)
> > + dentry->d_time = kn->dir.rev;
> > +}
> > +
> > +static inline void kernfs_inc_rev(struct kernfs_node *kn)
> > +{
> > + if (kernfs_type(kn) == KERNFS_DIR)
> > + kn->dir.rev++;
> > +}
> > +
> > +static inline bool kernfs_dir_changed(struct kernfs_node *kn,
> > + struct dentry *dentry)
> > +{
> > + if (kernfs_type(kn) == KERNFS_DIR) {
> > + /* Not really a time bit it does what's needed */
> > + if (time_after(kn->dir.rev, dentry->d_time))
> > + return true;
>
> Why not simply make this:
> if (kn->dir.rev != dentry->d_time)
> return true;
>
> I don't see what is gained by not counting as changed something in
> the
> wrong half of the values.
Yes, it was like that originally and really shouldn't make
any difference. I'll change it back.
Ian
>
> > + }
> > + return false;
> > +}
> > +
> > extern const struct super_operations kernfs_sops;
> > extern struct kmem_cache *kernfs_node_cache, *kernfs_iattrs_cache;
> >
> > diff --git a/include/linux/kernfs.h b/include/linux/kernfs.h
> > index 9e8ca8743c268..7947acb1163d7 100644
> > --- a/include/linux/kernfs.h
> > +++ b/include/linux/kernfs.h
> > @@ -98,6 +98,11 @@ struct kernfs_elem_dir {
> > * better directly in kernfs_node but is here to save
> > space.
> > */
> > struct kernfs_root *root;
> > + /*
> > + * Monotonic revision counter, used to identify if a
> > directory
> > + * node has changed during revalidation.
> > + */
> > + unsigned long rev;
> > };
> >
> > struct kernfs_elem_symlink {
>
> Eric
Powered by blists - more mailing lists