[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250614100451.GI2278213@noisy.programming.kicks-ass.net>
Date: Sat, 14 Jun 2025 12:04:51 +0200
From: Peter Zijlstra <peterz@...radead.org>
To: Chris Mason <clm@...a.com>
Cc: mingo@...hat.com, juri.lelli@...hat.com, vincent.guittot@...aro.org,
dietmar.eggemann@....com, rostedt@...dmis.org, bsegall@...gle.com,
mgorman@...e.de, vschneid@...hat.com, linux-kernel@...r.kernel.org
Subject: Re: [RFC][PATCH 0/5] sched: Try and address some recent-ish
regressions
On Wed, May 28, 2025 at 09:41:33PM -0400, Chris Mason wrote:
> I'll get all of these run on the big turin machine, should have some
> numbers tomorrow.
Right... so Turin. I had a quick look through our IRC logs but I
couldn't find exactly which model you had, and unfortunately AMD uses
the Turin name for both Zen 5c and Zen 5 Epyc :-(
Anyway, the big and obvious difference between the whole Intel and AMD
machines is the L3. So far we've been looking at SKL/SPR single L3
performance, but Turin (whichever that might be) will be having many L3.
With Zen5 having 8 cores per L3 and Zen5c having 16.
Additionally, your schbench -M auto thing is doing exactly the wrong
thing for them. What you want is for those message threads to be spread
out across the L3s, not all stuck to the first (which is what -M auto
would end up doing). And then the associated worker threads would
ideally stick to their respective L3s and not scatter all over the
machine.
Anyway, most of the data we shared was about single socket SKL, might be
we missed some obvious things for the multi-L3 case.
I'll go poke at some of the things I've so far neglected because of the
single L3 focus.
Powered by blists - more mailing lists