[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20140617200329.GA27400@laptop.dumpdata.com>
Date: Tue, 17 Jun 2014 16:03:29 -0400
From: Konrad Rzeszutek Wilk <konrad.wilk@...cle.com>
To: Peter Zijlstra <a.p.zijlstra@...llo.nl>, Waiman.Long@...com
Cc: Waiman.Long@...com, tglx@...utronix.de, mingo@...nel.org,
linux-arch@...r.kernel.org, linux-kernel@...r.kernel.org,
virtualization@...ts.linux-foundation.org,
xen-devel@...ts.xenproject.org, kvm@...r.kernel.org,
paolo.bonzini@...il.com, boris.ostrovsky@...cle.com,
paulmck@...ux.vnet.ibm.com, riel@...hat.com,
torvalds@...ux-foundation.org, raghavendra.kt@...ux.vnet.ibm.com,
david.vrabel@...rix.com, oleg@...hat.com, gleb@...hat.com,
scott.norton@...com, chegu_vinod@...com,
Peter Zijlstra <peterz@...radead.org>
Subject: Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock
> > + new = tail | (val & _Q_LOCKED_MASK);
> > +
> > + old = atomic_cmpxchg(&lock->val, val, new);
> > + if (old == val)
> > + break;
> > +
> > + val = old;
> > + }
> > +
> > + /*
> > + * we won the trylock; forget about queueing.
> > + */
> > + if (new == _Q_LOCKED_VAL)
> > + goto release;
> > +
> > + /*
> > + * if there was a previous node; link it and wait.
> > + */
> > + if (old & ~_Q_LOCKED_MASK) {
> > + prev = decode_tail(old);
> > + ACCESS_ONCE(prev->next) = node;
> > +
> > + arch_mcs_spin_lock_contended(&node->locked);
Could you add a comment here:
/* We are spinning forever until the previous node updates locked - which
it does once the it has updated lock->val with our tail number. */
> > + }
> > +
> > + /*
> > + * we're at the head of the waitqueue, wait for the owner to go away.
> > + *
> > + * *,x -> *,0
> > + */
> > + while ((val = atomic_read(&lock->val)) & _Q_LOCKED_MASK)
> > + cpu_relax();
> > +
> > + /*
> > + * claim the lock:
> > + *
> > + * n,0 -> 0,1 : lock, uncontended
> > + * *,0 -> *,1 : lock, contended
> > + */
> > + for (;;) {
> > + new = _Q_LOCKED_VAL;
> > + if (val != tail)
> > + new |= val;
>
..snip..
>
> Could you help a bit in explaining it in English please?
After looking at the assembler code I finally figured out how
we can get here. And the 'contended' part threw me off. Somehow
I imagined there are two more more CPUs stampeding here and
trying to update the lock->val. But in reality the other CPUs
are stuck in the arch_mcs_spin_lock_contended spinning on their
local value.
Perhaps you could add this comment.
/* Once queue_spin_unlock is called (which _subtracts_ _Q_LOCKED_VAL from
the lock->val and still preserving the tail data), the winner gets to
claim the ticket. Since we still need the other CPUs to continue and
preserve the strict ordering in which they setup node->next, we:
1) update lock->val to the tail value (so tail CPU and its index) with
_Q_LOCKED_VAL.
2). Once we are done, we poke the other CPU (the one that linked to
us) by writting to node->locked (below) so they can make progress and
loop on lock->val changing from _Q_LOCKED_MASK to zero).
*/
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists