[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <912c7a32-b39c-494f-a29c-4865cd92aeba@agordeev.local>
Date: Thu, 10 Apr 2025 18:07:49 +0200
From: Alexander Gordeev <agordeev@...ux.ibm.com>
To: Ryan Roberts <ryan.roberts@....com>
Cc: Andrew Morton <akpm@...ux-foundation.org>,
"David S. Miller" <davem@...emloft.net>,
Andreas Larsson <andreas@...sler.com>, Juergen Gross <jgross@...e.com>,
Boris Ostrovsky <boris.ostrovsky@...cle.com>,
Thomas Gleixner <tglx@...utronix.de>, Ingo Molnar <mingo@...hat.com>,
Borislav Petkov <bp@...en8.de>,
Dave Hansen <dave.hansen@...ux.intel.com>,
"H. Peter Anvin" <hpa@...or.com>,
"Matthew Wilcox (Oracle)" <willy@...radead.org>,
Catalin Marinas <catalin.marinas@....com>, linux-mm@...ck.org,
sparclinux@...r.kernel.org, xen-devel@...ts.xenproject.org,
linux-kernel@...r.kernel.org
Subject: Re: [PATCH v2 0/5] Fix lazy mmu mode
On Mon, Mar 03, 2025 at 02:15:34PM +0000, Ryan Roberts wrote:
Hi Ryan,
> I'm planning to implement lazy mmu mode for arm64 to optimize vmalloc. As part
> of that, I will extend lazy mmu mode to cover kernel mappings in vmalloc table
> walkers. While lazy mmu mode is already used for kernel mappings in a few
> places, this will extend it's use significantly.
>
> Having reviewed the existing lazy mmu implementations in powerpc, sparc and x86,
> it looks like there are a bunch of bugs, some of which may be more likely to
> trigger once I extend the use of lazy mmu.
Do you have any idea about generic code issues as result of not adhering to
the originally stated requirement:
/*
...
* the PTE updates which happen during this window. Note that using this
* interface requires that read hazards be removed from the code. A read
* hazard could result in the direct mode hypervisor case, since the actual
* write to the page tables may not yet have taken place, so reads though
* a raw PTE pointer after it has been modified are not guaranteed to be
* up to date.
...
*/
I tried to follow few code paths and at least this one does not look so good:
copy_pte_range(..., src_pte, ...)
ret = copy_nonpresent_pte(..., src_pte, ...)
try_restore_exclusive_pte(..., src_pte, ...) // is_device_exclusive_entry(entry)
restore_exclusive_pte(..., ptep, ...)
set_pte_at(..., ptep, ...)
set_pte(ptep, pte); // save in lazy mmu mode
// ret == -ENOENT
ptent = ptep_get(src_pte); // lazy mmu save is not observed
ret = copy_present_ptes(..., ptent, ...); // wrong ptent used
I am not aware whether the effort to "read hazards be removed from the code"
has ever been made and the generic code is safe in this regard.
What is your take on this?
Thanks!
Powered by blists - more mailing lists