[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <593F5452.2090109@intel.com>
Date: Tue, 13 Jun 2017 10:56:18 +0800
From: Wei Wang <wei.w.wang@...el.com>
To: Dave Hansen <dave.hansen@...el.com>,
"Michael S. Tsirkin" <mst@...hat.com>
CC: linux-kernel@...r.kernel.org, qemu-devel@...gnu.org,
virtualization@...ts.linux-foundation.org, kvm@...r.kernel.org,
linux-mm@...ck.org, david@...hat.com, cornelia.huck@...ibm.com,
akpm@...ux-foundation.org, mgorman@...hsingularity.net,
aarcange@...hat.com, amit.shah@...hat.com, pbonzini@...hat.com,
liliang.opensource@...il.com
Subject: Re: [PATCH v11 4/6] mm: function to offer a page block on the free
list
On 06/13/2017 04:54 AM, Dave Hansen wrote:
> On 06/12/2017 01:34 PM, Michael S. Tsirkin wrote:
>> On Mon, Jun 12, 2017 at 09:42:36AM -0700, Dave Hansen wrote:
>>> On 06/12/2017 09:28 AM, Michael S. Tsirkin wrote:
>>>>> The hypervisor is going to throw away the contents of these pages,
>>>>> right?
>>>> It should be careful and only throw away contents that was there before
>>>> report_unused_page_block was invoked. Hypervisor is responsible for not
>>>> corrupting guest memory. But that's not something an mm patch should
>>>> worry about.
>>> That makes sense. I'm struggling to imagine how the hypervisor makes
>>> use of this information, though. Does it make the pages read-only
>>> before this, and then it knows if there has not been a write *and* it
>>> gets notified via this new mechanism that it can throw the page away?
>> Yes, and specifically, this is how it works for migration. Normally you
>> start by migrating all of memory, then send updates incrementally if
>> pages have been modified. This mechanism allows skipping some pages in
>> the 1st stage, if they get changed they will be migrated in the 2nd
>> stage.
> OK, so the migration starts and marks everything read-only. All the
> pages now have read-only valuable data, or read-only worthless data in
> the case that the page is in the free lists. In order for a page to
> become non-worthless, it has to have a write done to it, which the
> hypervisor obviously knows about.
>
> With this mechanism, the hypervisor knows it can discard pages which
> have not had a write since they were known to have worthless contents.
>
> Correct?
Right. By the way, ready-only is one of the dirty page logging
methods that a hypervisor uses to capture the pages that are
written by the VM.
>
> That also seems like pretty good information to include in the
> changelog. Otherwise, folks are going to be left wondering what good
> the mechanism is. It's pretty non-trivial to figure out. :)
If necessary, I think it's better to keep the introduction at high-level:
Examples of using this API by a hypervisor:
To live migrate a VM from one physical machine to another,
the hypervisor usually transfers all the VM's memory content.
An optimization here is to skip the transfer of memory that are not
in use by the VM, because the content of the unused memory is
worthless.
This API is the used to report the unused pages to the hypervisor.
The pages that have been reported to the hypervisor as unused
pages may be used by the VM after the report. The hypervisor
has a good mechanism (i.e. dirty page logging) to capture
the change. Therefore, if the new used pages are written into some
data, the hypervisor will still transfer them to the destination machine.
What do you guys think?
Best,
Wei
Powered by blists - more mailing lists