[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <bbe04eb10609060810j70562e3dt6a87067d95bc883e@mail.gmail.com>
Date: Wed, 6 Sep 2006 11:10:04 -0400
From: "Kimball Murray" <kimball.murray@...il.com>
To: "Andi Kleen" <ak@...e.de>
Cc: "Nick Piggin" <nickpiggin@...oo.com.au>,
linux-kernel@...r.kernel.org, akpm@...eo.com
Subject: Re: [Feature] x86_64 page tracking for Stratus servers
On 9/6/06, Andi Kleen <ak@...e.de> wrote:
>
> > Silly question, why can't you do all this from stop_machine_run context (or
> > your SMI) that doesn't have to worry about other CPUs dirtying memory?
>
> Because that would be too slow for continuous mirroring.
>
> You can't go through 10+GB of virtual memory (or more with shared
> memory because the scan has to be virtual) in an interrupt.
>
> The only sane way is to do it continuously.
That's exactly right on all counts. We can "harvest" dirty pages in a
continuous fashion, as a background task. But when we finally decide
to dive in to machine blackout for the final copy, we need to ensure
that the blackout is as short as possible. Toward that goal, we have
built a sort of DMA engine that can move pages over the backplane to
the spare cpu/memory module without interfering (much) with the active
module's activities. We continue doing this until the number of pages
for final copy is small enough that we can be sure the final blackout
copy will take a sufficiently short time.
And you're right about the virtual scan as well. Trouble is, it's a
potentially unbounded scan. With the right machine load, we can find
ourselves in a state where it takes so long to scan all the page
tables that the pages are all dirty again before we start the next
pass. For this situation we have two choices. We can either give up
and dive into the blackout passing a dirty list that basically
includes all of memory (long blackout time!), or we can continue
scanning for a long time, waiting and hoping the machine load will
decrease to a point where are target number of dirty pages is reached.
This sort of thing will always be a problem with a software solution.
-kimball
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists