[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-id: <002901cd4a0a$667e1600$337a4200$%szyprowski@samsung.com>
Date: Thu, 14 Jun 2012 10:47:48 +0200
From: Marek Szyprowski <m.szyprowski@...sung.com>
To: 'Daniel Vetter' <daniel@...ll.ch>,
'Konrad Rzeszutek Wilk' <konrad.wilk@...cle.com>
Cc: linux-arch@...r.kernel.org,
'Abhinav Kochhar' <abhinav.k@...sung.com>,
'Russell King - ARM Linux' <linux@....linux.org.uk>,
'Arnd Bergmann' <arnd@...db.de>,
'Benjamin Herrenschmidt' <benh@...nel.crashing.org>,
linux-kernel@...r.kernel.org,
'Subash Patel' <subash.ramaswamy@...aro.org>,
linaro-mm-sig@...ts.linaro.org, linux-mm@...ck.org,
'Kyungmin Park' <kyungmin.park@...sung.com>,
linux-arm-kernel@...ts.infradead.org
Subject: RE: [Linaro-mm-sig] [PATCHv2 0/6] ARM: DMA-mapping: new extensions for
buffer sharing
Hello,
On Wednesday, June 13, 2012 9:02 PM Daniel Vetter wrote:
> On Wed, Jun 13, 2012 at 10:12:12AM -0400, Konrad Rzeszutek Wilk wrote:
> > On Wed, Jun 13, 2012 at 01:50:12PM +0200, Marek Szyprowski wrote:
> (snipped)
> > > The third extension solves the performance issues which we observed with
> > > some advanced buffer sharing use cases, which require creating a dma
> > > mapping for the same memory buffer for more than one device. From the
> > > DMA-mapping perspective this requires to call one of the
> > > dma_map_{page,single,sg} function for the given memory buffer a few
> > > times, for each of the devices. Each dma_map_* call performs CPU cache
> > > synchronization, what might be a time consuming operation, especially
> > > when the buffers are large. We would like to avoid any useless and time
> > > consuming operations, so that was the main reason for introducing
> > > another attribute for DMA-mapping subsystem: DMA_ATTR_SKIP_CPU_SYNC,
> > > which lets dma-mapping core to skip CPU cache synchronization in certain
> > > cases.
>
> Ah, here's the use-case I've missed ;-) I'm a bit vary of totally insane
> platforms that have additional caches only on the device side, and only
> for some devices. Well, tlbs belong to that, but the iommu needs to handle
> that anyway.
>
> I think it would be good to add a blurb to the documentation that any
> device-side flushing (of tlbs or special caches or whatever) still needs
> to happen and that this is only a performance optimization to avoid the
> costly cpu cache flushing. This way the dma-buf exporter could keep track
> of whether it's 'device-coherent' and set that flag if the cpu caches don't
> need to be flushed.
>
> Maybe also make it clear that implementing this bit is optional (like your
> doc already mentions for NO_KERNEL_MAPPING).
Ok, I can add additional comment, but support for all dma attributes is optional
(attributes are considered only as hints that might improve performance for some
use cases on some hw platforms).
Best regards
--
Marek Szyprowski
Samsung Poland R&D Center
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists