[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20180322201649.GC9469@ziepe.ca>
Date: Thu, 22 Mar 2018 14:16:49 -0600
From: Jason Gunthorpe <jgg@...pe.ca>
To: Casey Leedom <leedom@...lsio.com>
Cc: SWise OGC <swise@...ngridcomputing.com>,
Sinan Kaya <okaya@...eaurora.org>,
'kbuild test robot' <lkp@...el.com>,
"kbuild-all@...org" <kbuild-all@...org>,
"linux-rdma@...r.kernel.org" <linux-rdma@...r.kernel.org>,
"timur@...eaurora.org" <timur@...eaurora.org>,
"sulrich@...eaurora.org" <sulrich@...eaurora.org>,
"linux-arm-msm@...r.kernel.org" <linux-arm-msm@...r.kernel.org>,
"linux-arm-kernel@...ts.infradead.org"
<linux-arm-kernel@...ts.infradead.org>,
Steve Wise <swise@...lsio.com>,
'Doug Ledford' <dledford@...hat.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
Michael Werner <werner@...lsio.com>
Subject: Re: [PATCH v4 4/6] infiniband: cxgb4: Eliminate duplicate barriers
on weakly-ordered archs
On Thu, Mar 22, 2018 at 07:44:51PM +0000, Casey Leedom wrote:
> | From: Steve Wise <swise@...ngridcomputing.com>
> | Sent: Thursday, March 22, 2018 9:28 AM
> |
> | | From: Sinan Kaya <okaya@...eaurora.org>
> | | Date: Thursday, March 22, 2018 7:52 AM
> | |
> | | Isn't this a PowerPC problem? Why penalize other architectures?
> |
> | I worry it breaks PPC.
>
> And all other architectures. Aparraently there isn't a formal API
> description for writel_relaxed() and Co., nor __raw_writel(), etc.
We have this:
Documentation/memory-barriers.txt lines 2600-2677/3136 85%
(*) readX_relaxed(), writeX_relaxed()
These are similar to readX() and writeX(), but provide weaker memory
ordering guarantees. Specifically, they do not guarantee ordering with
respect to normal memory accesses (e.g. DMA buffers) nor do they guarantee
ordering with respect to LOCK or UNLOCK operations. If the latter is
required, an mmiowb() barrier can be used. Note that relaxed accesses to
the same peripheral are guaranteed to be ordered with respect to each
other.
Which basically says they are the same as writel() except they are not
required to be contained by a spinlock, which is the expensive thing
ARM and PPC are doing with the barriers in writel()
Jason
Powered by blists - more mailing lists