[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20251104133724.GC1537560@nvidia.com>
Date: Tue, 4 Nov 2025 09:37:24 -0400
From: Jason Gunthorpe <jgg@...dia.com>
To: Will Deacon <will@...nel.org>
Cc: Mostafa Saleh <smostafa@...gle.com>,
Daniel Mentz <danielmentz@...gle.com>, iommu@...ts.linux.dev,
linux-kernel@...r.kernel.org,
Pranjal Shrivastava <praan@...gle.com>,
Liviu Dudau <liviu.dudau@....com>,
Rob Clark <robin.clark@....qualcomm.com>
Subject: Re: [PATCH 1/2] iommu/io-pgtable-arm: Implement .iotlb_sync_map
callback
On Tue, Nov 04, 2025 at 01:28:35PM +0000, Will Deacon wrote:
> If that's not workable (due to Daniel's performance worries), another
> option is to bring back the ->map_sg() hook (removed by d88e61faad52
> ("iommu: Remove the ->map_sg indirection")) and implement an optimised
> version of that, preferably sharing as much code as possible with the
> existing map path.
I've been broadly thinking of doing this. The benchmarks I've run
suggest there are decent gains in avoiding the re-walks, so we can
speed up the map_sg operation by working directly on the scatterlist
without rewalking every next sgent. That cleanly solves the CMO issue
as well.
I think the page table rework will land this cycle, that would be the
right time to attempt it as we should not be trying to make the
duplicated page table code any more complicated.
Jason
Powered by blists - more mailing lists