[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <9206a7c4-bf88-4138-b8af-961625a82439@kernel.org>
Date: Wed, 4 Feb 2026 22:01:42 +0100
From: "David Hildenbrand (arm)" <david@...nel.org>
To: Ankur Arora <ankur.a.arora@...cle.com>, linux-kernel@...r.kernel.org,
linux-mm@...ck.org, x86@...nel.org
Cc: akpm@...ux-foundation.org, bp@...en8.de, dave.hansen@...ux.intel.com,
hpa@...or.com, mingo@...hat.com, mjguzik@...il.com, luto@...nel.org,
peterz@...radead.org, tglx@...utronix.de, willy@...radead.org,
raghavendra.kt@....com, chleroy@...nel.org, ioworker0@...il.com,
lizhe.67@...edance.com, boris.ostrovsky@...cle.com, konrad.wilk@...cle.com,
kernel test robot <lkp@...el.com>
Subject: Re: [PATCH v2] mm: folio_zero_user: open code range computation in
folio_zero_user()
On 1/28/26 19:59, Ankur Arora wrote:
> riscv64-gcc-linux-gnu (v8.5) reports a compile time assert in:
>
> r[2] = DEFINE_RANGE(clamp_t(s64, fault_idx - radius, pg.start, pg.end),
> clamp_t(s64, fault_idx + radius, pg.start, pg.end));
>
> where it decides that pg.start > pg.end in:
> clamp_t(s64, fault_idx + radius, pg.start, pg.end));
>
> where pg comes from:
> const struct range pg = DEFINE_RANGE(0, folio_nr_pages(folio) - 1);
>
> That does not seem like it could be true. Even for pg.start == pg.end,
> we would need folio_test_large() to evaluate to false at compile time:
>
> static inline unsigned long folio_nr_pages(const struct folio *folio)
> {
> if (!folio_test_large(folio))
> return 1;
> return folio_large_nr_pages(folio);
> }
>
> Workaround by open coding the range computation. Also, simplify the type
> declarations for the relevant variables.
>
> Reported-by: kernel test robot <lkp@...el.com>
> Closes: https://lore.kernel.org/oe-kbuild-all/202601240453.QCjgGdJa-lkp@intel.com/
> Fixes: 93552c9a3350 ("mm: folio_zero_user: cache neighbouring pages")
> Signed-off-by: Ankur Arora <ankur.a.arora@...cle.com>
> ---
>
> Hi Andrew
>
> As David pointed out, the previous open coded version makes a few
> unnecessary changes. Could you queue this one instead?
>
I'm late, maybe this is already upstream.
> Thanks
> Ankur
>
>
> mm/memory.c | 17 ++++++++---------
> 1 file changed, 8 insertions(+), 9 deletions(-)
>
> diff --git a/mm/memory.c b/mm/memory.c
> index ce933ee4a3dd..f5bfc082ab61 100644
> --- a/mm/memory.c
> +++ b/mm/memory.c
> @@ -7284,7 +7284,7 @@ void folio_zero_user(struct folio *folio, unsigned long addr_hint)
> const unsigned long base_addr = ALIGN_DOWN(addr_hint, folio_size(folio));
> const long fault_idx = (addr_hint - base_addr) / PAGE_SIZE;
> const struct range pg = DEFINE_RANGE(0, folio_nr_pages(folio) - 1);
> - const int radius = FOLIO_ZERO_LOCALITY_RADIUS;
> + const long radius = FOLIO_ZERO_LOCALITY_RADIUS;
> struct range r[3];
> int i;
>
> @@ -7292,24 +7292,23 @@ void folio_zero_user(struct folio *folio, unsigned long addr_hint)
> * Faulting page and its immediate neighbourhood. Will be cleared at the
> * end to keep its cachelines hot.
> */
> - r[2] = DEFINE_RANGE(clamp_t(s64, fault_idx - radius, pg.start, pg.end),
> - clamp_t(s64, fault_idx + radius, pg.start, pg.end));
> + r[2] = DEFINE_RANGE(fault_idx - radius < (long)pg.start ? pg.start : fault_idx - radius,
> + fault_idx + radius > (long)pg.end ? pg.end : fault_idx + radius);
> +
LGTM, although it could likely be made a bit more readable by using some temporary variables.
const long fault_idx_low = fault_idx - radius;
const long fault_idx_high = fault_idx + radius;
r[2] = DEFINE_RANGE(fault_idx_low < (long)pg.start ? pg.start : fault_idx_low,
fault_idx_high > (long)pg.end ? pg.end : fault_idx_high);
Well, still a bit unreadable, so ... :)
>
> /* Region to the left of the fault */
> - r[1] = DEFINE_RANGE(pg.start,
> - clamp_t(s64, r[2].start - 1, pg.start - 1, r[2].start));
> + r[1] = DEFINE_RANGE(pg.start, r[2].start - 1);
>
> /* Region to the right of the fault: always valid for the common fault_idx=0 case. */
> - r[0] = DEFINE_RANGE(clamp_t(s64, r[2].end + 1, r[2].end, pg.end + 1),
> - pg.end);
> + r[0] = DEFINE_RANGE(r[2].end + 1, pg.end);
TBH, without the clamp that looks much more readable here.
>
> for (i = 0; i < ARRAY_SIZE(r); i++) {
> const unsigned long addr = base_addr + r[i].start * PAGE_SIZE;
> - const unsigned int nr_pages = range_len(&r[i]);
> + const long nr_pages = (long)range_len(&r[i]);
> struct page *page = folio_page(folio, r[i].start);
>
> if (nr_pages > 0)
> - clear_contig_highpages(page, addr, nr_pages);
> + clear_contig_highpages(page, addr, (unsigned int)nr_pages);
Is that cast really required?
--
Cheers,
David
Powered by blists - more mailing lists