[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <962c1dff-b944-499d-8883-2fe760219352@huawei.com>
Date: Tue, 1 Apr 2025 17:19:44 +0800
From: Yunsheng Lin <linyunsheng@...wei.com>
To: Sean Hefty <shefty@...dia.com>, Jason Gunthorpe <jgg@...dia.com>
CC: Bernard Metzler <BMT@...ich.ibm.com>, Roland Dreier
<roland@...abrica.net>, Nikolay Aleksandrov <nikolay@...abrica.net>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>, "shrijeet@...abrica.net"
<shrijeet@...abrica.net>, "alex.badea@...sight.com"
<alex.badea@...sight.com>, "eric.davis@...adcom.com"
<eric.davis@...adcom.com>, "rip.sohan@....com" <rip.sohan@....com>,
"dsahern@...nel.org" <dsahern@...nel.org>, "winston.liu@...sight.com"
<winston.liu@...sight.com>, "dan.mihailescu@...sight.com"
<dan.mihailescu@...sight.com>, Kamal Heib <kheib@...hat.com>,
"parth.v.parikh@...sight.com" <parth.v.parikh@...sight.com>, Dave Miller
<davem@...hat.com>, "ian.ziemba@....com" <ian.ziemba@....com>,
"andrew.tauferner@...nelisnetworks.com"
<andrew.tauferner@...nelisnetworks.com>, "welch@....com" <welch@....com>,
"rakhahari.bhunia@...sight.com" <rakhahari.bhunia@...sight.com>,
"kingshuk.mandal@...sight.com" <kingshuk.mandal@...sight.com>,
"linux-rdma@...r.kernel.org" <linux-rdma@...r.kernel.org>, "kuba@...nel.org"
<kuba@...nel.org>, Paolo Abeni <pabeni@...hat.com>
Subject: Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction
On 2025/4/1 3:49, Sean Hefty wrote:
>> Through reading this patchset, it seems the semantics of 'job' for UEC is about
>> how to identify a PDC(Packet Delivery Context) instance, which is specified by
>> src fep_address/pdc_id and dst fep_address/pdc_id as there seems to be more
>> than one PDC instance between two nodes, so the 'job' is really about
>> grouping processes from the same 'job' to use the same PDC instance and
>> preventing processes from different 'job' from using the same PDC instance?
>
> UEC targets HPC and AI workloads, so the concept of a job in this discussion represents a parallel application. I.e. a group of processes across multiple nodes communicating.
Ok, I guess this patchset only implement a portion of semantics for the 'job'
in UEC, the page mapping, local access, and remote access functions grouping
and separating you mentioned in other thread does not seem to be implemented
yet.
>
> - Sean
Powered by blists - more mailing lists