[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <1158728779.6002.265.camel@localhost.localdomain>
Date: Wed, 20 Sep 2006 15:06:19 +1000
From: Benjamin Herrenschmidt <benh@...nel.crashing.org>
To: Andrew Morton <akpm@...l.org>
Cc: Mike Waychison <mikew@...gle.com>, linux-mm@...ck.org,
Linux Kernel list <linux-kernel@...r.kernel.org>,
Linus Torvalds <torvalds@...l.org>
Subject: Re: [RFC] page fault retry with NOPAGE_RETRY
On Tue, 2006-09-19 at 20:05 -0700, Andrew Morton wrote:
> On Wed, 20 Sep 2006 11:57:09 +1000
> Benjamin Herrenschmidt <benh@...nel.crashing.org> wrote:
>
> > > You forget that the point of this optimisation is to undo mmap_sem while
> > > waiting on the disk IO. Once we've done that we cannot go looking at ptes
> > > or vmas: another thread could have munapped the whole lot or anything.
> > > (And we always need to be afraid of use_mm()..)
> >
> > Wait wait .. .we don't need to have the mmap sem to -look- at a PTE.
>
> The pte resides in a pagetable page. Once we've dropped mmap_sem, that
> pagetable page might not be there any more: munmap() might have freed it.
> We have to retake mmap_sem, do a find_vma() and a new pagetable walk.
Ok, see my other email, we have the mmap_sem anyway so it's fine. Note
that I got a little bit mislead on that "we can wlak the page table
without mmap_sem" because we can on powerpc :) but possibly not on
x86.... we use RCU to free pagetable pages to make that possible because
our hash refill code (equivalent to our TLB miss if you want to see it
that way) has to be able to do that. I think the x86 MMU has some way to
atomically do the walking which is why you don't need that on x86 but I
don't know x86 so ...
Ben.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists