[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20171102172624.qiz2w5szz77sqg7j@hirez.programming.kicks-ass.net>
Date: Thu, 2 Nov 2017 18:26:24 +0100
From: Peter Zijlstra <peterz@...radead.org>
To: Will Deacon <will.deacon@....com>
Cc: Alan Stern <stern@...land.harvard.edu>,
"Reshetova, Elena" <elena.reshetova@...el.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"gregkh@...uxfoundation.org" <gregkh@...uxfoundation.org>,
"keescook@...omium.org" <keescook@...omium.org>,
"tglx@...utronix.de" <tglx@...utronix.de>,
"mingo@...hat.com" <mingo@...hat.com>,
"ishkamiel@...il.com" <ishkamiel@...il.com>,
Paul McKenney <paulmck@...ux.vnet.ibm.com>,
parri.andrea@...il.com, boqun.feng@...il.com, dhowells@...hat.com,
david@...morbit.com
Subject: Re: [PATCH] refcount: provide same memory ordering guarantees as in
atomic_t
On Thu, Nov 02, 2017 at 05:16:44PM +0000, Will Deacon wrote:
> On Thu, Nov 02, 2017 at 01:08:52PM -0400, Alan Stern wrote:
> > Right. To address your point: release + acquire isn't the same as a
> > full barrier either. The SB pattern illustrates the difference:
> >
> > P0 P1
> > Write x=1 Write y=1
> > Release a smp_mb
> > Acquire b Read x=0
> > Read y=0
> >
> > This would not be allowed if the release + acquire sequence was
> > replaced by smp_mb. But as it stands, this is allowed because nothing
> > prevents the CPU from interchanging the order of the release and the
> > acquire -- and then you're back to the acquire + release case.
> >
> > However, there is one circumstance where this interchange isn't
> > allowed: when the release and acquire access the same memory
> > location. Thus:
> >
> > P0(int *x, int *y, int *a)
> > {
> > int r0;
> >
> > WRITE_ONCE(*x, 1);
> > smp_store_release(a, 1);
> > smp_load_acquire(a);
> > r0 = READ_ONCE(*y);
> > }
> >
> > P1(int *x, int *y)
> > {
> > int r1;
> >
> > WRITE_ONCE(*y, 1);
> > smp_mb();
> > r1 = READ_ONCE(*x);
> > }
> >
> > exists (0:r0=0 /\ 1:r1=0)
> >
> > This is forbidden. It would remain forbidden even if the smp_mb in P1
> > were replaced by a similar release/acquire pair for the same memory
> > location.
>
> Isn't this allowed on x86 mapping smp_mb() to mfence, store-release to plain
> store and load-acquire to plain load? All we're saying is that you can forward
> from a release to an acquire, which is fine for RCpc semantics.
Yeah, as it happens I talked to Will about that exact case while writing
that email :-), this is why he has that thing handy.
Powered by blists - more mailing lists