[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20091130082744.GN17484@wotan.suse.de>
Date: Mon, 30 Nov 2009 09:27:44 +0100
From: Nick Piggin <npiggin@...e.de>
To: Ingo Molnar <mingo@...e.hu>
Cc: Mike Galbraith <efault@....de>,
Peter Zijlstra <peterz@...radead.org>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>
Subject: Re: newidle balancing in NUMA domain?
On Tue, Nov 24, 2009 at 10:11:24AM +0100, Ingo Molnar wrote:
>
> * Mike Galbraith <efault@....de> wrote:
>
> > On Tue, 2009-11-24 at 09:40 +0100, Mike Galbraith wrote:
> > > On Tue, 2009-11-24 at 07:53 +0100, Nick Piggin wrote:
> > > > On Mon, Nov 23, 2009 at 04:53:37PM +0100, Mike Galbraith wrote:
> > > > > On Mon, 2009-11-23 at 16:29 +0100, Nick Piggin wrote:
> > > > >
> > > > > > So basically about the least well performing or scalable possible
> > > > > > software architecture. This is exactly the wrong thing to optimise
> > > > > > for, guys.
> > > > >
> > > > > Hm. Isn't fork/exec our daily bread?
> > > >
> > > > No. Not for handing out tiny chunks of work and attempting to do
> > > > them in parallel. There is this thing called Amdahl's law, and if
> > > > you write a parallel program that wantonly uses the heaviest
> > > > possible primitives in its serial sections, then it doesn't deserve
> > > > to go fast.
> > >
> > > OK by me. A bit if idle time for kbuild is easily cured with telling
> > > make to emit more jobs, so there's enough little jobs to go around.
> > >
> > > If x264 is declared dainbramaged, that's fine with me too.
> >
> > (P.S. I don't want to have to explain to users of any such thread
> > happy applications why they suck rocks under Linux though)
>
> The 68% of speedup is pretty valid for your change and the workload isnt
> particularly odd beyond the fast creation rate of threads - but i'd not
> blame the app for that, Linux creates/destroys threads very fast. Your
> followup load-balancing fix got queued up in the scheduler tree for
> v2.6.33 ~two weeks ago:
>
> 1b9508f: sched: Rate-limit newidle
>
> certainly handles some of the NUMA pain. If not, numbers telling us the
> other story (and patches to extend 'perf bench sched' with matching
> testcases) would be helpful, i'm not seeing problems on my NUMA testbox.
Yay for more complexity. And it doesn't fix all problems introduced
because it doesn't improve the now inherently far weaker NUMA affinity.
> 1b9508f was certainly too late for 2.6.32, but since it appears to be
> fine it can be forwarded to stable@...nel.org so it makes it into
> 2.6.32.1.
I totally disagree with this development / release process of yours.
In my opinion it is *not* ok to leave a wake of known regressions with
features that help some (non-regression) case.
If it was agreed that rate limiting newidle is the right fix for the
regression, and it was agreed that it is too late for the next release,
then IMO the correct way to approach the release is to revert the
offending patch until the new, combined, approach is tested and
validated.
Doing it your way, you release a 2.6.32 kernel with a known regression
and also a case that gets a big speedup. This makes it impossible now
to revert that patch without causing a regression for someone. And
then you think you have a way to fix it, but not enough testing so it
could be the case that it causes yet other regressions or does not fix
it well enough.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists