[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1471139439.32433.12.camel@redhat.com>
Date: Sat, 13 Aug 2016 21:50:39 -0400
From: Rik van Riel <riel@...hat.com>
To: Ingo Molnar <mingo@...nel.org>
Cc: Wanpeng Li <kernellwp@...il.com>,
Frederic Weisbecker <fweisbec@...il.com>,
LKML <linux-kernel@...r.kernel.org>,
Paolo Bonzini <pbonzini@...hat.com>,
Peter Zijlstra <peterz@...radead.org>,
Wanpeng Li <wanpeng.li@...mail.com>,
Thomas Gleixner <tglx@...utronix.de>,
Radim Krcmar <rkrcmar@...hat.com>,
Mike Galbraith <efault@....de>
Subject: Re: [PATCH] time,virt: resync steal time when guest & host lose sync
On Sat, 2016-08-13 at 10:42 +0200, Ingo Molnar wrote:
> * Rik van Riel <riel@...hat.com> wrote:
>
> > On Wed, 10 Aug 2016 07:39:08 +0800
> > Wanpeng Li <kernellwp@...il.com> wrote:
> >
> > > The regression is caused by your commit "sched,time: Count
> > > actually
> > > elapsed irq & softirq time".
> >
> > Wanpeng, does this patch fix your issue?
> >
> > Paolo, what is your opinion on this issue?
> >
> > I can think of all kinds of ways in which guest and host might lose
> > sync with steal time, from uninitialized values at boot, to guest
> > pause, followed by save to disk, and reload, to live migration,
> > to...
> >
> > ---8<---
> >
> > Subject: time,virt: resync steal time when guest & host lose sync
> >
> > When guest and host wildly disagree on steal time, a guest can
> > do several things:
> > 1) Quickly account all the steal time at once (the kernel did this
> > before
> > 57430218317e ("sched/cputime: Count actually elapsed irq &
> > softirq time"),
> > when steal_account_process_ticks got ULONG_MAX as its maximum
> > value.
> > 2) Stay out of sync for an indeterminate amount of time. This is
> > what the
> > system does today.
> > 3) Sync up the guest value to the host-provided value, without
> > accounting
> > an absurdly large value in the cpu time statistics.
> >
> > This patch makes the kernel do (3), which seems like the right
> > thing
> > to do.
> >
> > The exact value of the threshold use probably does not matter too
> > much,
> > as long as it is long enough to cover all the timer ticks that
> > passed
> > during an idle period, because (irqtime_)account_idle_ticks can
> > process
> > a large amount of time all at once.
> >
> > Signed-off-by: Rik van Riel <riel@...hat.com>
> > Reported-by: Wanpeng Li <kernellwp@...il.com>
> > ---
> > kernel/sched/cputime.c | 12 +++++++++++-
> > 1 file changed, 11 insertions(+), 1 deletion(-)
>
> fails to build on x86 allnoconfig:
>
> kernel/sched/cputime.c:524:10: error: too many arguments to
> function ‘steal_account_process_time’
Which patch did you apply? The subject and comment
of the email suggest you tried applying the one
Paolo and Frederic objected to.
The compile error suggest you applied the patch with the
subject "time,virt: do not limit steal_account_process_time"
In that case, did you apply Wanpeng's patch that adds an
additional call site for steal_account_process_time?
I do not have that patch in my tree yet, and one additional
line of change will be needed.
--
All Rights Reversed.
Download attachment "signature.asc" of type "application/pgp-signature" (474 bytes)
Powered by blists - more mailing lists