[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <0FF6E325-3490-4A8B-BB04-D36CCBFA1D19@oracle.com>
Date: Thu, 30 Sep 2021 23:51:11 +0000
From: Alex Kogan <alex.kogan@...cle.com>
To: Barry Song <21cnbao@...il.com>
CC: Arnd Bergmann <arnd@...db.de>, Borislav Petkov <bp@...en8.de>,
Daniel Jordan <daniel.m.jordan@...cle.com>,
Dave Dice <dave.dice@...cle.com>,
Hanjun Guo <guohanjun@...wei.com>,
"hpa@...or.com" <hpa@...or.com>,
"jglauber@...vell.com" <jglauber@...vell.com>,
"linux-arch@...r.kernel.org" <linux-arch@...r.kernel.org>,
"linux-arm-kernel@...ts.infradead.org"
<linux-arm-kernel@...ts.infradead.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"linux@...linux.org.uk" <linux@...linux.org.uk>,
"longman@...hat.com" <longman@...hat.com>,
"mingo@...hat.com" <mingo@...hat.com>,
"peterz@...radead.org" <peterz@...radead.org>,
Steven Sistare <steven.sistare@...cle.com>,
"tglx@...utronix.de" <tglx@...utronix.de>,
"will.deacon@....com" <will.deacon@....com>,
"x86@...nel.org" <x86@...nel.org>
Subject: Re: [PATCH v15 0/6] Add NUMA-awareness to qspinlock
> On Sep 30, 2021, at 5:44 AM, Barry Song <21cnbao@...il.com> wrote:
>
>> We have done some performance evaluation with the locktorture module
>> as well as with several benchmarks from the will-it-scale repo.
>> The following locktorture results are from an Oracle X5-4 server
>> (four Intel Xeon E7-8895 v3 @ 2.60GHz sockets with 18 hyperthreaded
>> cores each). Each number represents an average (over 25 runs) of the
>> total number of ops (x10^7) reported at the end of each run. The
>> standard deviation is also reported in (), and in general is about 3%
>> from the average. The 'stock' kernel is v5.12.0,
>
> I assume x5-4 server has the crossbar topology and its numa diameter is
> 1hop, and all tests were done on this kind of symmetrical topology. Am
> I right?
>
> ┌─┐ ┌─┐
> │ ├─────────────────┤ │
> └─┤1 1└┬┘
> │ 1 1 │
> │ 1 1 │
> │ 1 1 │
> │ 1 │
> │ 1 1 │
> │ 1 1 │
> │ 1 1 │
> ┌┼┐1 1 ├─┐
> │┼┼─────────────────┤ │
> └─┘ └─┘
>
>
> what if the hardware is using the ring topology and other topologies
For better or worse, CNA is pretty much agnostic to the physical topology.
So it is true that we might do better by transferring NUMA node preference
to a “closer” node, but it would probably require some sophisticated logic
to identify such a node. As discussed on another thread, we are opting
for a simple solution that might be refined later if needed.
Best regards,
— Alex
Powered by blists - more mailing lists