[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20070126112837.059502fc.akpm@osdl.org>
Date: Fri, 26 Jan 2007 11:28:37 -0800
From: Andrew Morton <akpm@...l.org>
To: dipankar@...ibm.com
Cc: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>,
Gautham Shenoy <ego@...ux.vnet.ibm.com>,
linux-kernel@...r.kernel.org, Ingo Molnar <mingo@...e.hu>
Subject: Re: Fw: Re: [mm PATCH 4/6] RCU: (now) CPU hotplug
On Sat, 27 Jan 2007 00:41:13 +0530
Dipankar Sarma <dipankar@...ibm.com> wrote:
> On Thu, Jan 25, 2007 at 02:36:45AM +0530, Dipankar Sarma wrote:
> > On Wed, Jan 24, 2007 at 08:15:59AM -0800, Paul E. McKenney wrote:
> >
> > It should be relatively easy. Setting the offlined cpu's flags
> > to neutral state should do the trick in most cases.
> > I will send out the patches tomorrow after reviewing the code
> > some more.
>
> Famous last words.
>
> It turns out that I have been bitten by the ugly cpu hotplug
> locking mess while trying to get preemptible RCU code working
> with CPU hotplug. The per-subsystem locking thing isn't really
> user-friendly. Here is the dependency -
>
> In cpu hotplug path (after CPU_LOCK_ACQUIRE) -
>
> CPU_DOWN_PREPARE:sched domains -> detach_destroy_domains() ->
> synchronize_sched() -> sched_setaffinity()
>
> sched_setaffinity() tries to acquire the scheduler cpu hotplug
> mutex and deadlocks.
>
> I see no easy way of getting around this - doing "cpu hotplug locked"
> version of all those APIs would add lots of code which is bad.
> We could try Gautham's idea of letting each subsystem maintain
> its own online cpu mask, but I bet implementing sched_setaffinity()
> would not be very easy despite this.
Suggest you just ignore cpu hotplug locking, if that helps.
> What is the status on this now ?
Stalled, apparently.
> Is this a good example to
> show why per-subsystem locks might be unmaintainable ?
Maybe. It might also be a good example of confused design.
> Can we go back
"back" assumes it was once present. It wasn't.
> to a simple
"simple" hasn't been demonstrated. New lock types and their use are never
simple, especially magic ones.
> scalable refcount model
> for CPU hotplug now ?
The plan is, I hope, to rip it all out and do freeze_processes() on the
hotplug side, so nobody else needs to worry about cpu hotplug any more.
But at present everyone seems to be in hiding.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists