[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <ZxcKsmvPkKJgsp2l@yzhao56-desk.sh.intel.com>
Date: Tue, 22 Oct 2024 10:15:14 +0800
From: Yan Zhao <yan.y.zhao@...el.com>
To: Sean Christopherson <seanjc@...gle.com>
CC: Paolo Bonzini <pbonzini@...hat.com>, Marc Zyngier <maz@...nel.org>,
"Oliver Upton" <oliver.upton@...ux.dev>, Tianrui Zhao
<zhaotianrui@...ngson.cn>, "Bibo Mao" <maobibo@...ngson.cn>, Huacai Chen
<chenhuacai@...nel.org>, "Michael Ellerman" <mpe@...erman.id.au>, Anup Patel
<anup@...infault.org>, "Paul Walmsley" <paul.walmsley@...ive.com>, Palmer
Dabbelt <palmer@...belt.com>, Albert Ou <aou@...s.berkeley.edu>, Christian
Borntraeger <borntraeger@...ux.ibm.com>, Janosch Frank
<frankja@...ux.ibm.com>, "Claudio Imbrenda" <imbrenda@...ux.ibm.com>,
<kvm@...r.kernel.org>, <linux-arm-kernel@...ts.infradead.org>,
<kvmarm@...ts.linux.dev>, <loongarch@...ts.linux.dev>,
<linux-mips@...r.kernel.org>, <linuxppc-dev@...ts.ozlabs.org>,
<kvm-riscv@...ts.infradead.org>, <linux-riscv@...ts.infradead.org>,
<linux-kernel@...r.kernel.org>, Alex Bennée
<alex.bennee@...aro.org>, David Matlack <dmatlack@...gle.com>, David Stevens
<stevensd@...omium.org>, Andrew Jones <ajones@...tanamicro.com>
Subject: Re: [PATCH v13 51/85] KVM: VMX: Use __kvm_faultin_page() to get APIC
access page/pfn
On Mon, Oct 21, 2024 at 11:57:42AM -0700, Sean Christopherson wrote:
> On Mon, Oct 21, 2024, Yan Zhao wrote:
> > On Thu, Oct 10, 2024 at 11:23:53AM -0700, Sean Christopherson wrote:
> > > Use __kvm_faultin_page() get the APIC access page so that KVM can
> > > precisely release the refcounted page, i.e. to remove yet another user
> > > of kvm_pfn_to_refcounted_page(). While the path isn't handling a guest
> > > page fault, the semantics are effectively the same; KVM just happens to
> > > be mapping the pfn into a VMCS field instead of a secondary MMU.
> > >
> > > Tested-by: Alex Bennée <alex.bennee@...aro.org>
> > > Signed-off-by: Sean Christopherson <seanjc@...gle.com>
> > > ---
>
> ...
>
> > > @@ -6838,10 +6840,13 @@ void vmx_set_apic_access_page_addr(struct kvm_vcpu *vcpu)
> > > vmcs_write64(APIC_ACCESS_ADDR, pfn_to_hpa(pfn));
> > >
> > > /*
> > > - * Do not pin apic access page in memory, the MMU notifier
> > > - * will call us again if it is migrated or swapped out.
> > > + * Do not pin the APIC access page in memory so that it can be freely
> > > + * migrated, the MMU notifier will call us again if it is migrated or
> > > + * swapped out. KVM backs the memslot with anonymous memory, the pfn
> > > + * should always point at a refcounted page (if the pfn is valid).
> > > */
> > > - kvm_release_pfn_clean(pfn);
> > > + if (!WARN_ON_ONCE(!refcounted_page))
> > > + kvm_release_page_clean(refcounted_page);
> > Why it's not
> > if (!WARN_ON_ONCE(!refcounted_page)) {
> > if (writable)
> > kvm_release_page_dirty(refcounted_page)
> > else
> > kvm_release_page_clean(refcounted_page)
> > }
> >
> > or simply not pass "writable" to __kvm_faultin_pfn() as we know the slot is
> > not read-only and then set dirty ?
>
> __kvm_faultin_pfn() requires a non-NULL @writable. The intent is to help ensure
Ah, right.
> the caller is actually checking whether a readable vs. writable mapping was
> acquired. For cases that explicitly pass FOLL_WRITE, it's awkward, but those
> should be few and far between.
Yes, a little weird though nothing wrong in this case by passing "writable"
without checking its value back :)
>
> > if (!WARN_ON_ONCE(!refcounted_page))
> > kvm_release_page_dirty(refcounted_page)
>
> Ya, this is probably more correct? Though I would strongly prefer to make any
> change in behavior on top of this series. The use of kvm_release_page_clean()
> was added by commit 878940b33d76 ("KVM: VMX: Retry APIC-access page reload if
> invalidation is in-progress"), and I suspect the only reason it added the
> kvm_set_page_accessed() call is because there was no "unused" variant. I.e. there
> was no concious decision to set Accessed but not Dirty.
Thanks for the explanation.
I asked that because I noticed that the policies for setting Accessed or Dirty
for the APIC access page are not aligned between L1 (only Accessed) and L2 (Dirty).
Powered by blists - more mailing lists