[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-Id: <175156948681.3519813.8652806937156134172.b4-ty@arm.com>
Date: Thu, 3 Jul 2025 20:04:57 +0100
From: Catalin Marinas <catalin.marinas@....com>
To: ryan.roberts@....com,
will@...nel.org,
dev.jain@....com,
Barry Song <baohua@...nel.org>,
Lance Yang <lance.yang@...ux.dev>,
Xavier Xia <xavier.qyxia@...il.com>
Cc: akpm@...ux-foundation.org,
david@...hat.com,
gshan@...hat.com,
linux-arm-kernel@...ts.infradead.org,
linux-kernel@...r.kernel.org,
linux-mm@...ck.org,
willy@...radead.org,
xavier_qy@....com,
ziy@...dia.com
Subject: Re: [PATCH v7] arm64/mm: Optimize loop to reduce redundant operations of contpte_ptep_get
On Tue, 24 Jun 2025 23:25:49 +0800, Xavier Xia wrote:
> This commit optimizes the contpte_ptep_get and contpte_ptep_get_lockless
> function by adding early termination logic. It checks if the dirty and
> young bits of orig_pte are already set and skips redundant bit-setting
> operations during the loop. This reduces unnecessary iterations and
> improves performance.
>
> In order to verify the optimization performance, a test function has been
> designed. The function's execution time and instruction statistics have
> been traced using perf, and the following are the operation results on a
> certain Qualcomm mobile phone chip:
>
> [...]
Applied to arm64 (for-next/misc), thanks!
[1/1] arm64/mm: Optimize loop to reduce redundant operations of contpte_ptep_get
https://git.kernel.org/arm64/c/093ae7a033cf
--
Catalin
Powered by blists - more mailing lists