[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <3a2561fc-65a6-4c68-fdb7-a5b670706f43@arm.com>
Date: Wed, 20 Jan 2021 12:29:44 +0000
From: Robin Murphy <robin.murphy@....com>
To: Chunyan Zhang <zhang.lyra@...il.com>
Cc: DTML <devicetree@...r.kernel.org>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
Chunyan Zhang <chunyan.zhang@...soc.com>,
Sheng Xu <sheng.xu@...soc.com>,
iommu@...ts.linux-foundation.org, Rob Herring <robh+dt@...nel.org>,
Kevin Tang <kevin.tang@...soc.com>,
Baolin Wang <baolin.wang7@...il.com>,
Orson Zhai <orsonzhai@...il.com>
Subject: Re: [RFC PATCH V2 2/2] iommu: add Unisoc iommu basic driver
On 2021-01-20 11:40, Chunyan Zhang wrote:
[...]
>>> + pgt_base_iova = dom->pgt_va +
>>> + ((iova - mdata->iova_start) >> SPRD_IOMMU_PAGE_SHIFT);
>>> +
>>> + spin_lock_irqsave(&dom->pgtlock, flags);
>>> + for (i = 0; i < page_num; i++) {
>>> + pgt_base_iova[i] = pabase >> SPRD_IOMMU_PAGE_SHIFT;
>>
>> Out of curiosity, is the pagetable walker cache-coherent, or is this
>> currently managing to work by pure chance and natural cache churn?
>
> ->iotlb_sync_map() was implemented in this driver, I guess that has
> done what you say here?
No, sync_map only ensures that the previous (invalid) PTE isn't held in
the IOMMU's TLB. If pgt_va is a regular page allocation then you're
writing the new PTE to normal kernel memory, with nothing to guarantee
that write goes any further than the CPU's L1 cache. Thus either the
IOMMU has capable of snooping the CPU caches in order to see the updated
PTE value (rather than refetching the stale value from DRAM), or you're
just incredibly lucky that by the time the IOMMU *does* go to fetch the
PTE for that address, that updated cache line has already been evicted
out to DRAM naturally.
This is not an issue if you use the proper DMA allocator, since that
will ensure you get a non-cacheable buffer if you need one.
Robin.
Powered by blists - more mailing lists