[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Pine.LNX.4.64.0701051147090.28519@schroedinger.engr.sgi.com>
Date: Fri, 5 Jan 2007 12:02:47 -0800 (PST)
From: Christoph Lameter <clameter@....com>
To: Linus Torvalds <torvalds@...l.org>
cc: Nick Piggin <nickpiggin@...oo.com.au>,
Andrea Gelmini <gelma@...ma.net>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>
Subject: Re: VM: Fix nasty and subtle race in shared mmap'ed page writeback
On Fri, 5 Jan 2007, Linus Torvalds wrote:
> However, a lot of the code isn't really amenable to it as it stands now.
> We very much tend to call it in critical sections, and you have to move
> them all out of the locks they are now.
It looks as if most code handling the dirty bits already uses the page
lock?
> "Artistic". Good word. That said, most of the code in there needs its own
> locks for other reasons (ie the reason __set_page_dirty_nobuffers() ends
> up taking the tree-lock is because of the radix tree bits, which can NOT
> be protected by per-page locks _anyway_).
According to the comments: Most callers of __set_page_dirty_nobuffers hold
the page lock. The only exception seems to be zap_pte_range where we
transfer the dirty information from the pte to the page. This is now much
rarer since the dirty mmap tracking patches make the fault handler deal
with it.
Still, the page dirtying in zap_pte_range remains a potential trouble spot
for the remaining cases in which we need to dirty pages there since it is
not rate limited. There is a potential cause for creating deadlocks
because too many pages were dirtied and the file system cannot allocate
enough memory for writeout.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists