[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <X/Sz9EN0cKbyd1gQ@google.com>
Date: Tue, 5 Jan 2021 10:46:12 -0800
From: Sean Christopherson <seanjc@...gle.com>
To: Paolo Bonzini <pbonzini@...hat.com>
Cc: Ben Gardon <bgardon@...gle.com>, leohou1402 <leohou1402@...il.com>,
"maciej.szmigiero@...cle.com" <maciej.szmigiero@...cle.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"kvm@...r.kernel.org" <kvm@...r.kernel.org>,
"cannonmatthews@...gle.com" <cannonmatthews@...gle.com>,
"peterx@...hat.com" <peterx@...hat.com>,
"pshier@...gle.com" <pshier@...gle.com>,
"pfeiner@...gle.com" <pfeiner@...gle.com>,
"junaids@...gle.com" <junaids@...gle.com>,
"jmattson@...gle.com" <jmattson@...gle.com>,
"yulei.kernel@...il.com" <yulei.kernel@...il.com>,
"kernellwp@...il.com" <kernellwp@...il.com>,
"vkuznets@...hat.com" <vkuznets@...hat.com>
Subject: Re: reproducible BUG() in kvm_mmu_get_root() in TDP MMU
On Tue, Jan 05, 2021, Paolo Bonzini wrote:
> On 05/01/21 18:49, Ben Gardon wrote:
> > for_each_tdp_mmu_root(kvm, root) {
> > kvm_mmu_get_root(kvm, root);
> > <Do something, yield the MMU lock>
> > kvm_mmu_put_root(kvm, root);
> > }
> >
> > In these cases the get and put root calls are there to ensure that the
> > root is not freed while the function is running, however they do this
> > too well. If the put root call reduces the root's root_count to 0, it
> > should be removed from the roots list and freed before the MMU lock is
> > released. However the above pattern never bothers to free the root.
> > The following would fix this bug:
> >
> > -kvm_mmu_put_root(kvm, root);
> > +if (kvm_mmu_put_root(kvm, root))
> > + kvm_tdp_mmu_free_root(kvm, root);
>
> Is it worth writing a more complex iterator struct, so that
> for_each_tdp_mmu_root takes care of the get and put?
Ya, and maybe with an "as_id" variant to avoid the get/put? Not sure that's a
worthwhile optimization though.
On a related topic, there are a few subtleties with respect to
for_each_tdp_mmu_root() that we should document/comment. The flows that drop
mmu_lock while iterating over the roots don't protect against the list itself
from being modified. E.g. the next entry could be deleted, or a new root
could be added. I think I've convinced myself that there are no existing bugs,
but we should document that the exact current behavior is required for
correctness.
The use of "unsafe" list_for_each_entry() in particular is unintuitive, as using
the "safe" variant would dereference a deleted entry in the "next entry is
deleted" scenario.
And regarding addomg a root, using list_add_tail() instead of list_add() in
get_tdp_mmu_vcpu_root() would cause iteration to visit a root that was added
after the iteration started (though I don't think this would ever be problematic
in practice?).
Powered by blists - more mailing lists