[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20220408071222.219689-1-ying.huang@intel.com>
Date: Fri, 8 Apr 2022 15:12:19 +0800
From: Huang Ying <ying.huang@...el.com>
To: Peter Zijlstra <peterz@...radead.org>,
Mel Gorman <mgorman@...hsingularity.net>,
Andrew Morton <akpm@...ux-foundation.org>
Cc: linux-mm@...ck.org, linux-kernel@...r.kernel.org,
Huang Ying <ying.huang@...el.com>,
Michal Hocko <mhocko@...e.com>,
Rik van Riel <riel@...riel.com>,
Dave Hansen <dave.hansen@...ux.intel.com>,
Yang Shi <shy828301@...il.com>, Zi Yan <ziy@...dia.com>,
Wei Xu <weixugc@...gle.com>, osalvador <osalvador@...e.de>,
Shakeel Butt <shakeelb@...gle.com>,
Zhong Jiang <zhongjiang-ali@...ux.alibaba.com>
Subject: [PATCH 0/3] memory tiering: hot page selection
To optimize page placement in a memory tiering system with NUMA
balancing, the hot pages in the slow memory node need to be
identified. Essentially, the original NUMA balancing implementation
selects and promote the mostly recently accessed (MRU) pages. But the
recently accessed pages may be cold. So in this patchset, we
implement a new hot page identification algorithm based on the latency
between NUMA balancing page table scanning and hint page fault.
And the hot page promotion can incur some overhead in the system. To
control the overhead a simple promotion rate limit mechanism is
implemented.
The hot threshold used to identify the hot pages is workload dependent
usually. So we also implemented a hot threshold automatic adjustment
algorithm. The basic idea is to increase/decrease the hot threshold
to make the number of pages that pass the hot threshold (promote
candidate) near the rate limit.
We used the pmbench memory accessing benchmark tested the patchset on
a 2-socket server system with DRAM and PMEM installed. The test
results are as follows,
pmbench score promote rate
(accesses/s) MB/s
------------- ------------
base 146887704.1 725.6
hot selection 165695601.2 544.0
rate limit 162814569.8 165.2
auto adjustment 170495294.0 136.9
>From the results above,
With hot page selection patch [1/3], the pmbench score increases about
12.8%, and promote rate (overhead) decreases about 25.0%, compared with
base kernel.
With rate limit patch [2/3], pmbench score decreases about 1.7%, and
promote rate decreases about 69.6%, compared with hot page selection
patch.
With threshold auto adjustment patch [3/3], pmbench score increases
about 4.7%, and promote rate decrease about 17.1%, compared with rate
limit patch.
Best Regards,
Huang, Ying
Powered by blists - more mailing lists