[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <87r060i3nn.fsf@DESKTOP-5N7EMDA>
Date: Sun, 22 Dec 2024 15:21:32 +0800
From: "Huang, Ying" <ying.huang@...ux.alibaba.com>
To: Hyeonggon Yoo <hyeonggon.yoo@...com>
Cc: Joshua Hahn <joshua.hahnjy@...il.com>, "gourry@...rry.net"
<gourry@...rry.net>, kernel_team@...ynix.com, 42.hyeyoo@...il.com,
"rafael@...nel.org" <rafael@...nel.org>, "lenb@...nel.org"
<lenb@...nel.org>, "gregkh@...uxfoundation.org"
<gregkh@...uxfoundation.org>, "akpm@...ux-foundation.org"
<akpm@...ux-foundation.org>, Honggyu Kim <honggyu.kim@...com>, Rakie Kim
<rakie.kim@...com>, "dan.j.williams@...el.com"
<dan.j.williams@...el.com>, "Jonathan.Cameron@...wei.com"
<Jonathan.Cameron@...wei.com>, "dave.jiang@...el.com"
<dave.jiang@...el.com>, "horen.chuang@...ux.dev"
<horen.chuang@...ux.dev>, "hannes@...xchg.org" <hannes@...xchg.org>,
"linux-mm@...ck.org" <linux-mm@...ck.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"linux-acpi@...r.kernel.org" <linux-acpi@...r.kernel.org>,
"kernel-team@...a.com" <kernel-team@...a.com>
Subject: Re: [External Mail] [RFC PATCH v2] Weighted interleave auto-tuning
Hyeonggon Yoo <hyeonggon.yoo@...com> writes:
> On 2024-12-20 4:18 AM, Joshua Hahn wrote:
[snip]
>
> By the way, this might be out of scope, but let me ask for my own
> learning.
>
> We have a server with 2 sockets, each attached with local DRAM and CXL
> memory (and thus 4 NUMA nodes). When accessing remote socket's memory
> (either CXL or not), the bandwidth is limited by the interconnect's
> bandwidth.
>
> On this server, ideally weighted interleaving should be configured
> within a socket (e.g. local NUMA node + local CXL node) because
> weighted interleaving does not consider the bandwidth when accessed
> from a remote socket.
If multiple sockets are considered, what is the best behavior?
The process may be cross-socket too. So, we will need to use
set_mempolicy() to bind tasks to sockets firstly. Then, it may be
better to use per-task weights.
> So, the question is: On systems with multiple sockets (and CXL mem
> attached to each socket), do you always assume the admin must bind to
> a specific socket for optimal performance or is there any plan to
> mitigate this problem without binding tasks to a socket?
>
[snip]
---
Best Regards,
Huang, Ying
Powered by blists - more mailing lists