[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAK8P3a0miz=yghtqK+1=1APGf4R1-NW64TJTtGiO5pOPBQNgKg@mail.gmail.com>
Date: Tue, 30 Jul 2019 17:22:03 +0200
From: Arnd Bergmann <arnd@...db.de>
To: Guo Ren <guoren@...nel.org>
Cc: Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
linux-arch <linux-arch@...r.kernel.org>,
linux-csky@...r.kernel.org, feng_shizhu@...uatech.com,
zhang_jian5@...uatech.com, zheng_xingjian@...uatech.com,
zhu_peng@...uatech.com, Guo Ren <ren_guo@...ky.com>
Subject: Re: [PATCH 4/4] csky: Add dma_inv_range for DMA_FROM_DEVICE
On Tue, Jul 30, 2019 at 5:11 PM Guo Ren <guoren@...nel.org> wrote:
> > > diff --git a/arch/csky/mm/dma-mapping.c b/arch/csky/mm/dma-mapping.c
> > > cache_op(paddr, size, dma_wb_range);
> > > break;
> > > case DMA_FROM_DEVICE:
> > > + cache_op(paddr, size, dma_inv_range);
> > > + break;
> > > case DMA_BIDIRECTIONAL:
> > > cache_op(paddr, size, dma_wbinv_range);
> > > break;
> >
> > When syncing 'for_cpu', you should not need to write back, because
> > there won't be any dirty cache lines.
>
> I just follow the dma_data_direction param, seems dir param and
> function are a little bit duplicated. And our cpu won't clear clean
> cache line into memory, so dma_wb_page won't cause problem.
> Seems arch_sync_dma_for_cpu with dir=DMA_TO_DEVICE is
> self-contradictory.
Right, you generally don't need to do cache management for that
combination.
There might be other things to do here though, e.g. with a strict
iommu implementation one could tear down the i/o page table
entry to prevent the device from accessing a buffer while that is
owned by the cpu.
> Do you want me modfiy like these:
> @@ -88,6 +90,8 @@ void arch_sync_dma_for_cpu(struct device *dev,
> phys_addr_t paddr,
> case DMA_TO_DEVICE:
> case DMA_FROM_DEVICE:
> case DMA_BIDIRECTIONAL:
> cache_op(paddr, size, dma_inv_range);
> break;
>
> @@ -72,6 +72,8 @@ void arch_sync_dma_for_device(struct device *dev,
> phys_addr_t paddr,
> case DMA_TO_DEVICE:
> cache_op(paddr, size, dma_wb_range);
> break;
> case DMA_FROM_DEVICE:
> case DMA_BIDIRECTIONAL:
> cache_op(paddr, size, dma_wbinv_range);
> break;
>
> >
> > If you have a CPU core that does not do speculative loads, you also don't
> > need to invalidate here, because you have already done that in the
> > _for_device() case, the only reason to invalidate the CPU cache
> > again is if a speculative load created a stale cache line that now
> > shadows the data received from the device.
> Our CPU support speculative loads :)
Ok, then you both invalidations are indeed needed.
I was guessing that CK610 had no speculation.
Arnd
Powered by blists - more mailing lists