linux-kernel - Re: [PATCH 6/9] VMware balloon: Do not limit the amount of frees and allocations in non-sleep mode.

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives

Hash Suite: Windows password security audit tool. GUI, reports in PDF.

[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]

Date:	Fri, 12 Jun 2015 19:15:13 +0300
From:	"dmitry.torokhov@...il.com" <dmitry.torokhov@...il.com>
To:	Philip Moltmann <moltmann@...are.com>
Cc:	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
	"pv-drivers@...are.com" <pv-drivers@...are.com>,
	Xavier Deguillard <xdeguillard@...are.com>,
	"gregkh@...uxfoundation.org" <gregkh@...uxfoundation.org>,
	"akpm@...ux-foundation.org" <akpm@...ux-foundation.org>
Subject: Re: [PATCH 6/9] VMware balloon: Do not limit the amount of frees and
 allocations in non-sleep mode.

On Fri, Jun 12, 2015 at 03:40:42PM +0000, Philip Moltmann wrote:
> Hi,
> > > 
> > > Three improvements contribute to the overall faster speed:
> > > - batched operations reduce the hypervisor overhead per page
> > > - 2m instead of 4k buffer reduce the hypervisor overhead per page
> > > - removing the rate-limiting for non-sleep allocations allows the 
> > > guest
> > > operating system to reclaim memory as fast as it can instead of
> > > artificially limiting it.
> > > 
> > > Any of these improvements is great by itself and helps a lot. The
> > > combination of all three makes a rather dramatic difference.
> > > 
> > > We cause hypervisor-level swapping if the balloon driver does not
> > > reclaim fast enough. As any of these improvements increases 
> > > reclamation
> > > speed, we reduce swapping risk in any case.
> > > 
> > > Unfortunately the first two improvements rely on hypervisor 
> > > support,
> > > the last does not.
> > 
> > As far as I can understand the justification for removing the limit
> > (improvement #3) is that we have #1 and #2, at least that's how I 
> > read
> > the patch description. I am saying: what if you running on a 
> > hypervisor
> > that does not support neither #1 nor #2? What was the first release 
> > that
> > of ESXi supports batching and 2M pages? What about workstation (I 
> > don't
> > recall if it started using ballooning at some point)?
> 
> I see how caused this confusion. The rate limiting was there to not
> cause the guest OS to stall while doing nothing else than ballooning.
> With the batching the time spend ballooning is smaller, hence this is
> less of a problem when these features are available.
> 
> Independent of that the yielding in the ballooning loop should help to
> reduce stalling. Also hypervisor swapping - because we rate limited
> ballooning - causes much worse stalling than in the balloon driver.

OK, fair enough. Please update the patch description to reflect that the
rate limiting is useful on its own and does not require additional
hypervisor changes (although when they present they improve the behavior
even further).

Thanks.

-- 
Dmitry
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/