[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <d453f865-2224-ed53-a2f4-f43d574c130a@nvidia.com>
Date: Thu, 29 Aug 2019 19:21:03 -0700
From: John Hubbard <jhubbard@...dia.com>
To: Mike Marshall <hubcap@...ibond.com>
CC: <john.hubbard@...il.com>,
Andrew Morton <akpm@...ux-foundation.org>,
Christoph Hellwig <hch@...radead.org>,
Dan Williams <dan.j.williams@...el.com>,
Dave Chinner <david@...morbit.com>,
Dave Hansen <dave.hansen@...ux.intel.com>,
Ira Weiny <ira.weiny@...el.com>, Jan Kara <jack@...e.cz>,
Jason Gunthorpe <jgg@...pe.ca>,
Jérôme Glisse <jglisse@...hat.com>,
LKML <linux-kernel@...r.kernel.org>,
<amd-gfx@...ts.freedesktop.org>,
ceph-devel <ceph-devel@...r.kernel.org>,
<devel@...verdev.osuosl.org>, <devel@...ts.orangefs.org>,
<dri-devel@...ts.freedesktop.org>,
<intel-gfx@...ts.freedesktop.org>, <kvm@...r.kernel.org>,
<linux-arm-kernel@...ts.infradead.org>,
<linux-block@...r.kernel.org>, <linux-crypto@...r.kernel.org>,
<linux-fbdev@...r.kernel.org>,
linux-fsdevel <linux-fsdevel@...r.kernel.org>,
<linux-media@...r.kernel.org>, linux-mm <linux-mm@...ck.org>,
Linux NFS Mailing List <linux-nfs@...r.kernel.org>,
<linux-rdma@...r.kernel.org>,
<linux-rpi-kernel@...ts.infradead.org>,
<linux-xfs@...r.kernel.org>, <netdev@...r.kernel.org>,
<rds-devel@....oracle.com>, <sparclinux@...r.kernel.org>,
<x86@...nel.org>, <xen-devel@...ts.xenproject.org>
Subject: Re: [PATCH v3 00/39] put_user_pages(): miscellaneous call sites
On 8/29/2019 6:29 PM, Mike Marshall wrote:
> Hi John...
>
> I added this patch series on top of Linux 5.3rc6 and ran
> xfstests with no regressions...
>
> Acked-by: Mike Marshall <hubcap@...ibond.com>
>
Hi Mike (and I hope Ira and others are reading as well, because
I'm making a bunch of claims further down),
That's great news, thanks for running that test suite and for
the report and the ACK.
There is an interesting pause right now, due to the fact that
we've made some tentative decisions about gup pinning, that affect
the call sites. A key decision is that only pages that were
requested via FOLL_PIN, will require put_user_page*() to release
them. There are 4 main cases, which were first explained by Jan
Kara and Vlastimil Babka, and are now written up in my FOLL_PIN
patch [1].
So, what that means for this series is that:
1. Some call sites (mlock.c for example, and a lot of the mm/ files
in fact, and more) will not be converted: some of these patches will
get dropped, especially in mm/.
2. Call sites that do DirectIO or RDMA will need to set FOLL_PIN, and
will also need to call put_user_page().
3. Call sites that do RDMA will need to set FOLL_LONGTERM *and* FOLL_PIN,
3.a. ...and will at least in some cases need to provide a link to a
vaddr_pin object, and thus back to a struct file*...maybe. Still
under discussion.
4. It's desirable to keep FOLL_* flags (or at least FOLL_PIN) internal
to the gup() calls. That implies using a wrapper call such as Ira's
vaddr_pin_[user]_pages(), instead of gup(), and vaddr_unpin_[user]_pages()
instead of put_user_page*().
5. We don't want to churn the call sites unnecessarily.
With that in mind, I've taken another pass through all these patches
and narrowed it down to:
a) 12 call sites that I'd like to convert soon, but even those
really look cleaner with a full conversion to a wrapper call
similar to (identical to?) vaddr_pin_[user]_pages(), probably
just the FOLL_PIN only variant (not FOLL_LONGTERM). That
wrapper call is not ready yet, though.
b) Some more call sites that require both FOLL_PIN and FOLL_LONGTERM.
Definitely will wait to use the wrapper calls for these, because
they may also require hooking up to a struct file*.
c) A few more that were already applied, which is fine, because they
show where to convert, and simplify a few sites anyway. But they'll
need follow-on changes to, one way or another, set FOLL_PIN.
d) And of course a few sites whose patches get dropped, as mentioned
above.
[1] https://lore.kernel.org/r/20190821040727.19650-3-jhubbard@nvidia.com
thanks,
--
John Hubbard
NVIDIA
Powered by blists - more mailing lists