[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <alpine.DEB.2.21.1906261211410.32342@nanos.tec.linutronix.de>
Date: Wed, 26 Jun 2019 12:33:30 +0200 (CEST)
From: Thomas Gleixner <tglx@...utronix.de>
To: Wanpeng Li <kernellwp@...il.com>
cc: Peter Zijlstra <peterz@...radead.org>,
Paolo Bonzini <pbonzini@...hat.com>,
Radim Krcmar <rkrcmar@...hat.com>,
Marcelo Tosatti <mtosatti@...hat.com>,
KarimAllah <karahmed@...zon.de>,
LKML <linux-kernel@...r.kernel.org>, kvm <kvm@...r.kernel.org>
Subject: Re: cputime takes cstate into consideration
On Wed, 26 Jun 2019, Wanpeng Li wrote:
> After exposing mwait/monitor into kvm guest, the guest can make
> physical cpu enter deeper cstate through mwait instruction, however,
> the top command on host still observe 100% cpu utilization since qemu
> process is running even though guest who has the power management
> capability executes mwait. Actually we can observe the physical cpu
> has already enter deeper cstate by powertop on host. Could we take
> cstate into consideration when accounting cputime etc?
If MWAIT can be used inside the guest then the host cannot distinguish
between execution and stuck in mwait.
It'd need to poll the power monitoring MSRs on every occasion where the
accounting happens.
This completely falls apart when you have zero exit guest. (think
NOHZ_FULL). Then you'd have to bring the guest out with an IPI to access
the per CPU MSRs.
I assume a lot of people will be happy about all that :)
Thanks,
tglx
Powered by blists - more mailing lists