[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Z-7zlcVCrsw4SkwX@arm.com>
Date: Thu, 3 Apr 2025 21:46:13 +0100
From: Catalin Marinas <catalin.marinas@....com>
To: Ryan Roberts <ryan.roberts@....com>
Cc: Will Deacon <will@...nel.org>,
Pasha Tatashin <pasha.tatashin@...een.com>,
Andrew Morton <akpm@...ux-foundation.org>,
Uladzislau Rezki <urezki@...il.com>,
Christoph Hellwig <hch@...radead.org>,
David Hildenbrand <david@...hat.com>,
"Matthew Wilcox (Oracle)" <willy@...radead.org>,
Mark Rutland <mark.rutland@....com>,
Anshuman Khandual <anshuman.khandual@....com>,
Alexandre Ghiti <alexghiti@...osinc.com>,
Kevin Brodsky <kevin.brodsky@....com>,
linux-arm-kernel@...ts.infradead.org, linux-mm@...ck.org,
linux-kernel@...r.kernel.org
Subject: Re: [PATCH v3 01/11] arm64: hugetlb: Cleanup huge_pte size discovery
mechanisms
On Tue, Mar 04, 2025 at 03:04:31PM +0000, Ryan Roberts wrote:
> Not all huge_pte helper APIs explicitly provide the size of the
> huge_pte. So the helpers have to depend on various methods to determine
> the size of the huge_pte. Some of these methods are dubious.
>
> Let's clean up the code to use preferred methods and retire the dubious
> ones. The options in order of preference:
>
> - If size is provided as parameter, use it together with
> num_contig_ptes(). This is explicit and works for both present and
> non-present ptes.
>
> - If vma is provided as a parameter, retrieve size via
> huge_page_size(hstate_vma(vma)) and use it together with
> num_contig_ptes(). This is explicit and works for both present and
> non-present ptes.
>
> - If the pte is present and contiguous, use find_num_contig() to walk
> the pgtable to find the level and infer the number of ptes from
> level. Only works for *present* ptes.
>
> - If the pte is present and not contiguous and you can infer from this
> that only 1 pte needs to be operated on. This is ok if you don't care
> about the absolute size, and just want to know the number of ptes.
>
> - NEVER rely on resolving the PFN of a present pte to a folio and
> getting the folio's size. This is fragile at best, because there is
> nothing to stop the core-mm from allocating a folio twice as big as
> the huge_pte then mapping it across 2 consecutive huge_ptes. Or just
> partially mapping it.
>
> Where we require that the pte is present, add warnings if not-present.
>
> Signed-off-by: Ryan Roberts <ryan.roberts@....com>
Reviewed-by: Catalin Marinas <catalin.marinas@....com>
Powered by blists - more mailing lists