lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1434123642.25956.64.camel@vmware.com>
Date:	Fri, 12 Jun 2015 15:40:42 +0000
From:	Philip Moltmann <moltmann@...are.com>
To:	"dmitry.torokhov@...il.com" <dmitry.torokhov@...il.com>
CC:	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
	"pv-drivers@...are.com" <pv-drivers@...are.com>,
	Xavier Deguillard <xdeguillard@...are.com>,
	"gregkh@...uxfoundation.org" <gregkh@...uxfoundation.org>,
	"akpm@...ux-foundation.org" <akpm@...ux-foundation.org>
Subject: Re: [PATCH 6/9] VMware balloon: Do not limit the amount of frees
 and allocations in non-sleep mode.

Hi,
> > 
> > Three improvements contribute to the overall faster speed:
> > - batched operations reduce the hypervisor overhead per page
> > - 2m instead of 4k buffer reduce the hypervisor overhead per page
> > - removing the rate-limiting for non-sleep allocations allows the 
> > guest
> > operating system to reclaim memory as fast as it can instead of
> > artificially limiting it.
> > 
> > Any of these improvements is great by itself and helps a lot. The
> > combination of all three makes a rather dramatic difference.
> > 
> > We cause hypervisor-level swapping if the balloon driver does not
> > reclaim fast enough. As any of these improvements increases 
> > reclamation
> > speed, we reduce swapping risk in any case.
> > 
> > Unfortunately the first two improvements rely on hypervisor 
> > support,
> > the last does not.
> 
> As far as I can understand the justification for removing the limit
> (improvement #3) is that we have #1 and #2, at least that's how I 
> read
> the patch description. I am saying: what if you running on a 
> hypervisor
> that does not support neither #1 nor #2? What was the first release 
> that
> of ESXi supports batching and 2M pages? What about workstation (I 
> don't
> recall if it started using ballooning at some point)?

I see how caused this confusion. The rate limiting was there to not
cause the guest OS to stall while doing nothing else than ballooning.
With the batching the time spend ballooning is smaller, hence this is
less of a problem when these features are available.

Independent of that the yielding in the ballooning loop should help to
reduce stalling. Also hypervisor swapping - because we rate limited
ballooning - causes much worse stalling than in the balloon driver.

Thanks
Philip

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ