[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080228104637.GA9129@elte.hu>
Date: Thu, 28 Feb 2008 11:46:38 +0100
From: Ingo Molnar <mingo@...e.hu>
To: Paul Jackson <pj@....com>
Cc: a.p.zijlstra@...llo.nl, tglx@...utronix.de, oleg@...sign.ru,
rostedt@...dmis.org, maxk@...lcomm.com,
linux-kernel@...r.kernel.org
Subject: Re: [RFC/PATCH 0/4] CPUSET driven CPU isolation
* Paul Jackson <pj@....com> wrote:
> But your words sound alot like what we at SGI call a 'boot' cpuset.
>
> Our big honkin NUMA customers, who are managing most of the system
> either for a few dedicated, very-important jobs, and/or under a batch
> scheduler, need to leave a few nodes to run the classic Unix load such
> as init, cron, assorted daemons and the admins login shell.
>
> So we provide them some init script mechanisms that make it easy to
> set this up, which includes moving every task (not many at the low
> numbered init script time this runs) that isn't pinned (doesn't have a
> restricted Cpus_allowed) into the boot cpuset, conventionally named
> /dev/cpuset/boot.
yes. Ideally Peter's patchset should turn into something equivalent and
i very much agree with Peter's arguments. There was never any design
level problem with cpusets, and the parallel cpu_isolated_map approach
was misdirected IMO.
There was indeed a problem with the _manageability_ of cpusets in
certain (rather new) usecases like real-time or virtualization, and how
they are connected to other system resources like IRQs and how easy it
is to manage these resources. IRQs should probably be tied to specific
cpusets and should migrate together with them, were the span of that
cpuset be changed. (by default they'd be tied to the boot cpuset)
IMO Peter's patchset is a good first step in that it removes the
cpu_isolated_map API hack, and i think we should try to go the whole way
and just offer a /dev/cpuset/boot/ default set that can then be
restricted to isolate the default workloads away from other CPUs.
( an initscripts approach, while i'm sure it works, would always be a
bit fragile in that it requires precise knowledge about which task is
what. I think we should make this a turn-key in-kernel solution that
both the big-honking NUMA-box guys and the real-time guys would be
happy with. )
Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists