[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Pine.LNX.4.64.0801301251160.32559@schroedinger.engr.sgi.com>
Date: Wed, 30 Jan 2008 12:55:36 -0800 (PST)
From: Christoph Lameter <clameter@....com>
To: Jack Steiner <steiner@....com>
cc: Andrea Arcangeli <andrea@...ranet.com>, Robin Holt <holt@....com>,
Avi Kivity <avi@...ranet.com>, Izik Eidus <izike@...ranet.com>,
Nick Piggin <npiggin@...e.de>, kvm-devel@...ts.sourceforge.net,
Benjamin Herrenschmidt <benh@...nel.crashing.org>,
Peter Zijlstra <a.p.zijlstra@...llo.nl>,
linux-kernel@...r.kernel.org, linux-mm@...ck.org,
daniel.blueman@...drics.com, Hugh Dickins <hugh@...itas.com>
Subject: Re: [patch 2/6] mmu_notifier: Callbacks to invalidate address ranges
On Wed, 30 Jan 2008, Jack Steiner wrote:
> > Seems that we cannot rely on the invalidate_ranges for correctness at all?
> > We need to have invalidate_page() always. invalidate_range() is only an
> > optimization.
> >
>
> I don't understand your point "an optimization". How would invalidate_range
> as currently defined be correctly used?
We are changing definitions. The original patch by Andrea calls
invalidate_page for each pte that is cleared. So strictly you would not
need an invalidate_range.
> It _looks_ like it would work only if xpmem/gru/etc takes a refcnt on
> the page & drops it when invalidate_range is called. That may work (not sure)
> for xpmem but not for the GRU.
The refcount is not necessary if we adopt Andrea's approach of a callback
on the clearing of each pte. At that point the page is still guaranteed to
exist. If we do the range_invalidate later (as in V3) then the page may
have been released (see sys_remap_file_pages() f.e.) before we zap the GRU
ptes. So there will be a time when the GRU may write to a page that has
been freed and used for another purpose.
Taking a refcount on the page defers the free until the range_invalidate
runs.
I would prefer a solution that does not require taking refcounts (pins)
for establishing an external pte and for release (like what the GRU does).
If we could effectively determine that there are no external ptes in a
range then the invalidate_page() call may return immediately. Maybe it is
then effective to do these gazillions of invalidate_page() calls when a
process terminates or an remap is performed.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists