[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190627090843.GB11548@lst.de>
Date: Thu, 27 Jun 2019 11:08:43 +0200
From: Christoph Hellwig <hch@....de>
To: Logan Gunthorpe <logang@...tatee.com>
Cc: Jason Gunthorpe <jgg@...pe.ca>, Christoph Hellwig <hch@....de>,
linux-kernel@...r.kernel.org, linux-block@...r.kernel.org,
linux-nvme@...ts.infradead.org, linux-pci@...r.kernel.org,
linux-rdma@...r.kernel.org, Jens Axboe <axboe@...nel.dk>,
Bjorn Helgaas <bhelgaas@...gle.com>,
Dan Williams <dan.j.williams@...el.com>,
Sagi Grimberg <sagi@...mberg.me>,
Keith Busch <kbusch@...nel.org>,
Stephen Bates <sbates@...thlin.com>
Subject: Re: [RFC PATCH 00/28] Removing struct page from P2PDMA
On Wed, Jun 26, 2019 at 02:45:38PM -0600, Logan Gunthorpe wrote:
> > The bar info would give the exporting struct device and any other info
> > we need to make the iommu mapping.
>
> Well, the IOMMU mapping is the normal thing the mapping driver will
> always do. We'd really just need the submitting driver to, when
> appropriate, inform the mapping driver that this is a pci bus address
> and not to call dma_map_xxx(). Then, for special mappings for the CMB
> like Christoph is talking about, it's simply a matter of doing a range
> compare on the PCI Bus address and converting the bus address to a BAR
> and offset.
Well, range compare on the physical address. We have a few different
options here:
(a) a range is normal RAM, DMA mapping works as usual
(b) a range is another devices BAR, in which case we need to do a
map_resource equivalent (which really just means don't bother with
cache flush on non-coherent architectures) and apply any needed
offset, fixed or iommu based
(c) a range points to a BAR on the acting device. In which case we
don't need to DMA map at all, because no dma is happening but just an
internal transfer. And depending on the device that might also require
a different addressing mode
I guess it might make sense to just have a block layer flag that (b) or
(c) might be contained in a bio. Then we always look up the data
structure, but can still fall back to (a) if nothing was found. That
even allows free mixing and matching of memory types, at least as long
as they are contained to separate bio_vec segments.
Powered by blists - more mailing lists