[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <f6541f4e-648e-d9a0-eda7-b2a117978ebb@linux.intel.com>
Date: Thu, 8 Jul 2021 21:38:34 +0800
From: Lu Baolu <baolu.lu@...ux.intel.com>
To: David Stevens <stevensd@...omium.org>,
Joerg Roedel <joro@...tes.org>, Will Deacon <will@...nel.org>
Cc: baolu.lu@...ux.intel.com,
Sergey Senozhatsky <senozhatsky@...omium.org>,
iommu@...ts.linux-foundation.org,
David Stevens <stevensd@...gle.com>,
Christoph Hellwig <hch@....de>, linux-kernel@...r.kernel.org
Subject: Re: [PATCH 0/4] Add dynamic iommu backed bounce buffers
Hi David,
I like this idea. Thanks for proposing this.
On 2021/7/7 15:55, David Stevens wrote:
> Add support for per-domain dynamic pools of iommu bounce buffers to the
> dma-iommu API. This allows iommu mappings to be reused while still
> maintaining strict iommu protection. Allocating buffers dynamically
> instead of using swiotlb carveouts makes per-domain pools more amenable
> on systems with large numbers of devices or where devices are unknown.
Have you ever considered leveraging the per-device swiotlb memory pool
added by below series?
https://lore.kernel.org/linux-iommu/20210625123004.GA3170@willie-the-truck/
>
> When enabled, all non-direct streaming mappings below a configurable
> size will go through bounce buffers. Note that this means drivers which
> don't properly use the DMA API (e.g. i915) cannot use an iommu when this
> feature is enabled. However, all drivers which work with swiotlb=force
> should work.
If so, why not making it more scalable by adding a callback into vendor
iommu drivers? The vendor iommu drivers have enough information to tell
whether the bounce buffer is feasible for a specific domain.
>
> Bounce buffers serve as an optimization in situations where interactions
> with the iommu are very costly. For example, virtio-iommu operations in
The simulated IOMMU does the same thing.
It's also an optimization for bare metal in cases where the strict mode
of cache invalidation is used. CPU moving data is faster than IOMMU
cache invalidation if the buffer is small.
Best regards,
baolu
Powered by blists - more mailing lists