[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1389041805.30730.47.camel@dvhart-mobl4.amr.corp.intel.com>
Date: Mon, 06 Jan 2014 12:56:45 -0800
From: Darren Hart <dvhart@...ux.intel.com>
To: Davidlohr Bueso <davidlohr@...com>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
Ingo Molnar <mingo@...nel.org>,
Peter Zijlstra <peterz@...radead.org>,
Thomas Gleixner <tglx@...utronix.de>,
Paul McKenney <paulmck@...ux.vnet.ibm.com>,
Mike Galbraith <efault@....de>, Jeff Mahoney <jeffm@...e.com>,
Jason Low <jason.low2@...com>,
Waiman Long <Waiman.Long@...com>, Tom Vaden <tom.vaden@...com>,
"Norton, Scott J" <scott.norton@...com>,
"Chandramouleeswaran, Aswin" <aswin@...com>
Subject: Re: [PATCH v5 4/4] futex: Avoid taking hb lock if nothing to wakeup
On Thu, 2014-01-02 at 12:59 -0800, Davidlohr Bueso wrote:
> On Thu, 2014-01-02 at 11:23 -0800, Linus Torvalds wrote:
> > On Thu, Jan 2, 2014 at 7:05 AM, Davidlohr Bueso <davidlohr@...com> wrote:
> > >
> > > In futex_wake() there is clearly no point in taking the hb->lock if we know
> > > beforehand that there are no tasks to be woken.
> >
> > Btw, I think we could optimize this a bit further for the wakeup case.
> >
> > wake_futex() does a get_task_struct(p)/put_task_struct(p) around its
> > actual waking logic, and I don't think that's necessary. The task
> > structures are RCU-delayed, and the task cannot go away until the
> > "q->lock_ptr = NULL" afaik, so you could replace that atomic inc/dec
> > with just a RCU read region.
>
> I had originally explored making the whole plist thing more rcu aware
> but never got to anything worth sharing. What you say does make a lot of
> sense, however, I haven't been able to see any actual improvements. It
> doesn't hurt however, so I'd have no problem adding such patch to the
> lot.
>
> >
> > Maybe it's not a big deal ("wake_up_state()" ends up getting the task
> > struct pi_lock anyway, so it's not like we can avoid toucing the task
> > structure), but I'm getting the feeling that we're doing a lot of
> > unnecessary work here.
>
> I passed this idea through my wakeup measuring program and didn't notice
> hardly any difference, just noise, even for large amounts of futexes.
> I believe that peterz's idea of lockless batch wakeups is the next step
> worth looking into for futexes -- even though the spurious wakeup
> problem can become a real pain.
>
> Thanks,
> Davidlohr
>
>
While I love to see significant performance improvements to the futex
hot paths, I am wary of the sort of implicit improvements we've been
exploring here. At the risk of being a wimp here, this code is
incredibly complex already, so I would prefer anything along these lines
have very strong empirical justification first - as Davidlohr's changes
here have provided.
Does anyone see any reason to hold off getting them in at this point?
I've made a couple points on comments and docs to the 4/5 patch, but
otherwise, I think it's time to get them in and more broadly tested.
--
Darren Hart
Intel Open Source Technology Center
Yocto Project - Linux Kernel
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists