[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <d35924de-a560-40aa-a99d-7278009c5ff3@arm.com>
Date: Wed, 8 Jan 2025 13:59:35 -0600
From: Jeremy Linton <jeremy.linton@....com>
To: Jason Gunthorpe <jgg@...dia.com>
Cc: Jonathan Cameron <Jonathan.Cameron@...wei.com>,
Wathsala Vithanage <wathsala.vithanage@....com>, linux-acpi@...r.kernel.org,
linux-pci@...r.kernel.org, acpica-devel@...ts.linux.dev,
linux-kernel@...r.kernel.org, lenb@...nel.org, robert.moore@...el.com,
bhelgaas@...gle.com, wei.huang2@....com, honnappa.nagarahalli@....com,
dhruv.tripathi@....com, rob.herring@....com
Subject: Re: [RFC 0/2] GENL interface for ACPI _DSM methods
Hi,
On 1/7/25 11:48 AM, Jason Gunthorpe wrote:
> On Tue, Jan 07, 2025 at 11:37:01AM -0600, Jeremy Linton wrote:
>> Hi,
>>
>> On 1/6/25 12:01 PM, Jonathan Cameron wrote:
>>> On Mon, 6 Jan 2025 16:30:43 +0000
>>> Wathsala Vithanage <wathsala.vithanage@....com> wrote:
>>>
>>>> Linux v6.13-rc1 added support for PCIe TPH and direct cache injection.
>>>> As already described in the patch set[1] that introduced this feature,
>>>> the cache injection in supported hardware allows optimal utilization of
>>>> platform resources for specific requests on the PCIe bus. However, the
>>>> patch set [1] implements the functionality for usage within the kernel.
>>>> But certain user space applications, especially those whose performance
>>>> is sensitive to the latency of inbound writes as seen by a CPU core, may
>>>> benefit from using this information (E.g., the DPDK cache stashing
>>>> feature discussed in RFC [2]).
>
> There is no way for userspace to program TPH information into a PCI
> device without going through a kernel driver, and the kernel driver
> must be the exclusive owner of the steering tag configuration or chaos
> would ensue. Having a way for sysfs to override this seems very wrong
> to me, and I think you should not go in this direction.
>
> DPDK runs on VFIO or RDMA. It would natural to have an VFIO native API
> to manipulate the steering tags, and we are already discussing what
> RDMA support for steering tag would look like.
>
>>> Superficially this feels like another potential interface that could be wrapped
>>> up under appropriate fwctl. Jason, what do you think?
>
> As above, I think this very squarely belongs under the appropriate
> subsystems that are providing the kernel drivers for the device. There
> is no reasonable way to share steering tags with unrelated userspace
> through any mechanism. Basically it fails the independence test of
> fwctl.
>
>> I think this was similar to a conversation we had internally, which was
>> basically to detect the PCIe extended capability and export a 'steering'
>> entry in sysfs on each PCIe device which can take a logical cpu/cache value,
>> translate those on write to the ACPI cpu/cache id's, make the firmware call,
>> then directly update the PCIe device's capability with the result.
>
> Seems wrong, driver must do this. If the driver was already using that
> entry for something else you've just wrecked it.
Can you clarify what you mean by 'wrecked'? AFAIK a valid, if poorly
chosen, steering tag is going to be sub-optimal performance.
I'm under the impression this is a similar problem to cpu/irq/numa
affinity where the driver/subsystem should be making the choice, but the
user is provided the opportunity to override the defaults if they think
there is benefit in their environment. Again AFAIK, the whole
OS/software stashing is already well down the 'I know better than the HW
where to store this data' rabbit hole.
Thanks,
Powered by blists - more mailing lists