[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20210129103340.GW1551@shell.armlinux.org.uk>
Date: Fri, 29 Jan 2021 10:33:40 +0000
From: Russell King - ARM Linux admin <linux@...linux.org.uk>
To: Arnd Bergmann <arnd@...nel.org>
Cc: "Leizhen (ThunderTown)" <thunder.leizhen@...wei.com>,
Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
Will Deacon <will.deacon@....com>,
Haojian Zhuang <haojian.zhuang@...il.com>,
Arnd Bergmann <arnd@...db.de>,
Rob Herring <robh+dt@...nel.org>,
Wei Xu <xuwei5@...ilicon.com>,
devicetree <devicetree@...r.kernel.org>,
linux-arm-kernel <linux-arm-kernel@...ts.infradead.org>,
linux-kernel <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH v5 4/4] ARM: Add support for Hisilicon Kunpeng L3 cache
controller
On Fri, Jan 29, 2021 at 11:26:38AM +0100, Arnd Bergmann wrote:
> Another clarification, as there are actually two independent
> points here:
>
> * if you can completely remove the readl() above and just write a
> hardcoded value into the register, or perhaps read the original
> value once at boot time, that is probably a win because it
> avoids one of the barriers in the beginning. The datasheet should
> tell you if there are any bits in the register that have to be
> preserved
>
> * Regarding the _relaxed() accessors, it's a lot harder to know
> whether that is safe, as you first have to show, in particular in case
> any of the accesses stop being guarded by the spinlock in that
> case, and whether there may be a case where you have to
> serialize the memory access against accesses that are still in the
> store queue or prefetched.
>
> Whether this matters at all depends mostly on the type of devices
> you are driving on your SoC. If you have any high-speed network
> interfaces that are unable to do cache coherent DMA, any extra
> instruction here may impact the number of packets you can transfer,
> but if all your high-speed devices are connected to a coherent
> interconnect, I would just go with the obvious approach and use
> the safe MMIO accessors everywhere.
For L2 cache code, I would say the opposite, actually, because it is
all too easy to get into a deadlock otherwise.
If you implement the sync callback, that will be called from every
non-relaxed accessor, which means if you need to take some kind of
lock in the sync callback and elsewhere in the L2 cache code, you will
definitely deadlock.
It is safer to put explicit barriers where it is necessary.
Also remember that the barrier in readl() etc is _after_ the read, not
before, and the barrier in writel() is _before_ the write, not after.
The point is to ensure that DMA memory accesses are properly ordered
with the IO-accessing instructions.
So, using readl_relaxed() with a read-modify-write is entirely sensible
provided you do not access DMA memory inbetween.
--
RMK's Patch system: https://www.armlinux.org.uk/developer/patches/
FTTP is here! 40Mbps down 10Mbps up. Decent connectivity at last!
Powered by blists - more mailing lists