linux-kernel - Re: [RFC PATCH v1 0/2] mm: multi-gen LRU scanning for page promotion

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives

Hash Suite: Windows password security audit tool. GUI, reports in PDF.

[<prev] [next>] [<thread-prev] [day] [month] [year] [list]

Message-ID: <CAJj2-QFCGZ7==azp+Vh7x7jJDtTUMMWTC5XyYTqVHWeVxETjTg@mail.gmail.com>
Date: Tue, 25 Mar 2025 14:55:52 -0700
From: Yuanchu Xie <yuanchu@...gle.com>
To: Bharata B Rao <bharata@....com>
Cc: Kinsey Ho <kinseyho@...gle.com>, linux-mm@...ck.org, linux-kernel@...r.kernel.org, 
	AneeshKumar.KizhakeVeetil@....com, Hasan.Maruf@....com, 
	Jonathan.Cameron@...wei.com, Michael.Day@....com, akpm@...ux-foundation.org, 
	dave.hansen@...el.com, david@...hat.com, feng.tang@...el.com, 
	gourry@...rry.net, hannes@...xchg.org, honggyu.kim@...com, hughd@...gle.com, 
	jhubbard@...dia.com, k.shutemov@...il.com, kbusch@...a.com, 
	kmanaouil.dev@...il.com, leesuyeon0506@...il.com, leillc@...gle.com, 
	liam.howlett@...cle.com, mgorman@...hsingularity.net, mingo@...hat.com, 
	nadav.amit@...il.com, nphamcs@...il.com, peterz@...radead.org, 
	raghavendra.kt@....com, riel@...riel.com, rientjes@...gle.com, 
	rppt@...nel.org, shivankg@....com, shy828301@...il.com, sj@...nel.org, 
	vbabka@...e.cz, weixugc@...gle.com, willy@...radead.org, 
	ying.huang@...ux.alibaba.com, ziy@...dia.com, dave@...olabs.net
Subject: Re: [RFC PATCH v1 0/2] mm: multi-gen LRU scanning for page promotion

On Tue, Mar 25, 2025 at 4:56 AM Bharata B Rao <bharata@....com> wrote:
>
> Thanks for your patchset. I haven't looked at the patches in detail yet,
> but gave it a quick try with the micro-benchmark that I have been using.
Thanks for running the numbers. Unfortunately neither of us can attend
LSF/MM in person, but we're excited about this opportunity for
collaboration.

>
> The below numbers can be compared with the base numbers that I have
> posted here
> (https://lore.kernel.org/linux-mm/20250325081832.209140-1-bharata@amd.com/).
> Test 2 in the above link is the one I tried with this patchset.
>
> kernel.numa_balancing = 0
> demotion=true
> cpufreq governor=performance
>
> Benchmark run configuration:
> Compute-node            = 1
> Memory-node             = 2
> Memory-size             = 206158430208
> Hot-region-size         = 1073741824
> Nr-hot-regions          = 192
> Access pattern          = random
> Access granularity      = 4096
> Delay b/n accesses      = 0
> Load/store ratio        = 50l50s
> THP used                = no
> Nr accesses             = 25769803776
> Nr repetitions          = 512
>
> Benchmark completed in 605983205.0 us
The benchmark does seem to complete in less time, but I'm not sure why
especially given the small number of pages promoted. I think it would
also be useful to see the usage breakdown of DRAM/CXL over time.

>
> numa_hit 63621437
> numa_miss 2721737
> numa_foreign 2721737
> numa_interleave 0
> numa_local 48243292
> numa_other 18099882
> pgpromote_success 0
> pgpromote_candidate 0
> pgdemote_kswapd 15409682
> pgdemote_direct 0
> pgdemote_khugepaged 0
> numa_pte_updates 0
> numa_huge_pte_updates 0
> numa_hint_faults 0
> numa_hint_faults_local 0
> numa_pages_migrated 19596
> pgmigrate_success 15429278
> pgmigrate_fail 256
>
> kpromoted_recorded_accesses 27647687
> kpromoted_recorded_hwhints 0
> kpromoted_recorded_pgtscans 27647687
> kpromoted_record_toptier 0
Makes sense, we skip toptier scanning

> kpromoted_record_added 17184209
> kpromoted_record_exists 10463478
> kpromoted_mig_right_node 0
> kpromoted_mig_non_lru 404308
> kpromoted_mig_cold_old 6417567
> kpromoted_mig_cold_not_accessed 10342825
> kpromoted_mig_promoted 19509
Compared to 611077 (IBS number) this is a lot lower.

> kpromoted_mig_dropped 17164700
>
> When I try to get the same benchmark numbers for kpromoted driven by
> kmmscand, kpromoted gets overwhelmed with the amount of data that
> kmmdscand provides while no such issues with the amount of accesses
> reported by this patchset.
The scan interval in this series is 4 seconds, while the kmmscand's
pause between scanning is 16ms. So there're definitely some gaps here.
The MGLRU page table walk also has a bunch of optimizations, and some
of them are more focused on reclaim, so we might need to tweak some
things there too.


Yuanchu