[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <56C6004F.2050105@redhat.com>
Date: Thu, 18 Feb 2016 18:33:03 +0100
From: Paolo Bonzini <pbonzini@...hat.com>
To: Radim Krčmář <rkrcmar@...hat.com>
Cc: linux-kernel@...r.kernel.org, kvm@...r.kernel.org,
Yuki Shibuya <shibuya.yk@...s.nec.co.jp>,
Rik van Riel <riel@...hat.com>
Subject: Re: [PATCH v2 01/14] KVM: x86: change PIT discard tick policy
On 18/02/2016 17:56, Radim Krčmář wrote:
> 2016-02-18 17:13+0100, Paolo Bonzini:
> > On 17/02/2016 20:14, Radim Krčmář wrote:
> > > Discard policy uses ack_notifiers to prevent injection of PIT interrupts
> > > before EOI from the last one.
> > >
> > > This patch changes the policy to always try to deliver the interrupt,
> > > which makes a difference when its vector is in ISR.
> > > Old implementation would drop the interrupt, but proposed one injects to
> > > IRR, like real hardware would.
> >
> > This seems like what libvirt calls the "merge" policy:
>
> Oops, I never looked beyond QEMU after seeing that the naming in libvirt
> doesn't even match ...
>
> I think the policy that KVM implements (which I call discard) is "delay"
> in libvirt. (https://libvirt.org/formatdomain.html#elementsTime)
Suppose the scheduled ticks are at times 0, 20, 40, 60, 80. The EOI for
time 0 is only delivered at time 42, other EOIs are timely.
The resulting injections are:
- for discard: 0, 60, 80.
- for catchup, which QEMU calls slew: 0, 42, 51, 60, 80.
- for merge: 0, 20 (in IRR, delivered at 42), 60, 80.
For delay I *think* it would be 0, 42, 62, 82, 102.
You know the i8254 code better than I do. Does this make sense to you?
(Or in other words, does the code *really* do the above?...)
> The "may be delayed" there makes me feel like the timer has to support a
> guest visible counter of missed ticks.
Yes, it depends whether the guest uses PIT to count time, or just to do
periodic stuff (and then it reads the time from e.g. the PMTimer).
In either case, only catchup ensures that time is not delayed, and it's
used for Windows which uses the RTC periodic clock to count time.
>> > where the merged tick is the one placed into IRR. Unlike discard,
>> > "merge" can starve the guest through an interrupt storm.
> Yeah, starving a VCPU with an interrupt storm is more likely with the
> changed policy. It's a pretty sad situation if all the time that VCPU
> gets isn't even enough to run a PIT handler, so I didn't care.
True. On one hand the hardware policy is clearly merge, not discard.
The i8259 has an IRR! On the other hand I'm a bit wary of changing the
policy without seeing exactly what the old OSes were doing in the PIT
handler.
> The NMI watchdog bug can also be solved without changing the policy.
> (It's a hack in any case.)
Can you send a patch for that?
Paolo
Powered by blists - more mailing lists