[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <DB5C686A0A7EE44A895494A4E25D21FC1C91EB8A@G5W2731.americas.hpqcorp.net>
Date: Thu, 21 Aug 2014 06:49:18 +0000
From: "Zhao, Hui-Zhi (Steven, HPservers-Core-OE-PSC)" <hui-zhi.zhao@...com>
To: Radim Krčmář <rkrcmar@...hat.com>,
"kvm@...r.kernel.org" <kvm@...r.kernel.org>
CC: "linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"Paolo Bonzini" <pbonzini@...hat.com>,
Gleb Natapov <gleb@...nel.org>,
"Raghavendra KT" <raghavendra.kt@...ux.vnet.ibm.com>,
"Vinod, Chegu" <chegu_vinod@...com>,
Christian Borntraeger <borntraeger@...ibm.com>
Subject: RE: [PATCH v2 0/6] Dynamic Pause Loop Exiting window.
This patch have been tested by Lisa and me and it's success.
We created 4 VM guests and reboot them every 10 minutes for 12 hours around, and this issue is gone with the patch.
Please add Lisa and me to the "tested by:" list.
Tested-by: Mitchell, Lisa <lisa.mitchell@...com>
Tested-by: Zhao, Hui Zhi <hui-zhi.zhao@...com>
Regards,
Steven Zhao
-----Original Message-----
From: Radim Krčmář [mailto:rkrcmar@...hat.com]
Sent: Thursday, August 21, 2014 4:53 AM
To: kvm@...r.kernel.org
Cc: linux-kernel@...r.kernel.org; Paolo Bonzini; Gleb Natapov; Raghavendra KT; Vinod, Chegu; Zhao, Hui-Zhi (Steven, HPservers-Core-OE-PSC); Christian Borntraeger
Subject: [PATCH v2 0/6] Dynamic Pause Loop Exiting window.
v1 -> v2:
* squashed [v1 4/9] and [v1 5/9] (clamping)
* dropped [v1 7/9] (CPP abstractions)
* merged core of [v1 9/9] into [v1 4/9] (automatic maximum)
* reworked kernel_param_ops: closer to pure int [v2 6/6]
* introduced ple_window_actual_max & reworked clamping [v2 4/6]
* added seqlock for parameter modifications [v2 6/6]
---
PLE does not scale in its current form. When increasing VCPU count above 150, one can hit soft lockups because of runqueue lock contention.
(Which says a lot about performance.)
The main reason is that kvm_ple_loop cycles through all VCPUs.
Replacing it with a scalable solution would be ideal, but it has already been well optimized for various workloads, so this series tries to alleviate one different major problem while minimizing a chance of
regressions: we have too many useless PLE exits.
Just increasing PLE window would help some cases, but it still spirals out of control. By increasing the window after every PLE exit, we can limit the amount of useless ones, so we don't reach the state where CPUs spend 99% of the time waiting for a lock.
HP confirmed that this series prevents soft lockups and TSC sync errors on large guests.
Radim Krčmář (6):
KVM: add kvm_arch_sched_in
KVM: x86: introduce sched_in to kvm_x86_ops
KVM: VMX: make PLE window per-VCPU
KVM: VMX: dynamise PLE window
KVM: trace kvm_ple_window
KVM: VMX: runtime knobs for dynamic PLE window
arch/arm/kvm/arm.c | 4 ++
arch/mips/kvm/mips.c | 4 ++
arch/powerpc/kvm/powerpc.c | 4 ++
arch/s390/kvm/kvm-s390.c | 4 ++
arch/x86/include/asm/kvm_host.h | 2 +
arch/x86/kvm/svm.c | 6 ++
arch/x86/kvm/trace.h | 25 ++++++++
arch/x86/kvm/vmx.c | 124 ++++++++++++++++++++++++++++++++++++++--
arch/x86/kvm/x86.c | 6 ++
include/linux/kvm_host.h | 2 +
virt/kvm/kvm_main.c | 2 +
11 files changed, 179 insertions(+), 4 deletions(-)
--
2.0.4
Powered by blists - more mailing lists