[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190531082528.GJ2623@hirez.programming.kicks-ass.net>
Date: Fri, 31 May 2019 10:25:28 +0200
From: Peter Zijlstra <peterz@...radead.org>
To: "Paul E. McKenney" <paulmck@...ux.ibm.com>
Cc: Vineet Gupta <Vineet.Gupta1@...opsys.com>,
Will Deacon <Will.Deacon@....com>,
arcml <linux-snps-arc@...ts.infradead.org>,
lkml <linux-kernel@...r.kernel.org>,
"linux-arch@...r.kernel.org" <linux-arch@...r.kernel.org>
Subject: Re: single copy atomicity for double load/stores on 32-bit systems
On Thu, May 30, 2019 at 11:53:58AM -0700, Paul E. McKenney wrote:
> On Thu, May 30, 2019 at 11:22:42AM -0700, Vineet Gupta wrote:
> > Hi Peter,
> >
> > Had an interesting lunch time discussion with our hardware architects pertinent to
> > "minimal guarantees expected of a CPU" section of memory-barriers.txt
> >
> >
> > | (*) These guarantees apply only to properly aligned and sized scalar
> > | variables. "Properly sized" currently means variables that are
> > | the same size as "char", "short", "int" and "long". "Properly
> > | aligned" means the natural alignment, thus no constraints for
> > | "char", two-byte alignment for "short", four-byte alignment for
> > | "int", and either four-byte or eight-byte alignment for "long",
> > | on 32-bit and 64-bit systems, respectively.
> >
> >
> > I'm not sure how to interpret "natural alignment" for the case of double
> > load/stores on 32-bit systems where the hardware and ABI allow for 4 byte
> > alignment (ARCv2 LDD/STD, ARM LDRD/STRD ....)
> >
> > I presume (and the question) that lkmm doesn't expect such 8 byte load/stores to
> > be atomic unless 8-byte aligned
>
> I would not expect 8-byte accesses to be atomic on 32-bit systems unless
> some special instruction was in use. But that usually means special
> intrinsics or assembly code.
If the GCC of said platform defaults to the double-word instructions for
long long, then I would very much expect natural alignment on it too.
If the feature is only available through inline asm or intrinsics, then
we can be a little more lenient perhaps.
Powered by blists - more mailing lists