lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:	Fri, 12 Jun 2015 19:15:13 +0300
From:	"dmitry.torokhov@...il.com" <dmitry.torokhov@...il.com>
To:	Philip Moltmann <moltmann@...are.com>
Cc:	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
	"pv-drivers@...are.com" <pv-drivers@...are.com>,
	Xavier Deguillard <xdeguillard@...are.com>,
	"gregkh@...uxfoundation.org" <gregkh@...uxfoundation.org>,
	"akpm@...ux-foundation.org" <akpm@...ux-foundation.org>
Subject: Re: [PATCH 6/9] VMware balloon: Do not limit the amount of frees and
 allocations in non-sleep mode.

On Fri, Jun 12, 2015 at 03:40:42PM +0000, Philip Moltmann wrote:
> Hi,
> > > 
> > > Three improvements contribute to the overall faster speed:
> > > - batched operations reduce the hypervisor overhead per page
> > > - 2m instead of 4k buffer reduce the hypervisor overhead per page
> > > - removing the rate-limiting for non-sleep allocations allows the 
> > > guest
> > > operating system to reclaim memory as fast as it can instead of
> > > artificially limiting it.
> > > 
> > > Any of these improvements is great by itself and helps a lot. The
> > > combination of all three makes a rather dramatic difference.
> > > 
> > > We cause hypervisor-level swapping if the balloon driver does not
> > > reclaim fast enough. As any of these improvements increases 
> > > reclamation
> > > speed, we reduce swapping risk in any case.
> > > 
> > > Unfortunately the first two improvements rely on hypervisor 
> > > support,
> > > the last does not.
> > 
> > As far as I can understand the justification for removing the limit
> > (improvement #3) is that we have #1 and #2, at least that's how I 
> > read
> > the patch description. I am saying: what if you running on a 
> > hypervisor
> > that does not support neither #1 nor #2? What was the first release 
> > that
> > of ESXi supports batching and 2M pages? What about workstation (I 
> > don't
> > recall if it started using ballooning at some point)?
> 
> I see how caused this confusion. The rate limiting was there to not
> cause the guest OS to stall while doing nothing else than ballooning.
> With the batching the time spend ballooning is smaller, hence this is
> less of a problem when these features are available.
> 
> Independent of that the yielding in the ballooning loop should help to
> reduce stalling. Also hypervisor swapping - because we rate limited
> ballooning - causes much worse stalling than in the balloon driver.

OK, fair enough. Please update the patch description to reflect that the
rate limiting is useful on its own and does not require additional
hypervisor changes (although when they present they improve the behavior
even further).

Thanks.

-- 
Dmitry
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ