[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <alpine.DEB.0.9999.0710282352270.23978@chino.kir.corp.google.com>
Date: Mon, 29 Oct 2007 00:00:25 -0700 (PDT)
From: David Rientjes <rientjes@...gle.com>
To: Paul Jackson <pj@....com>
cc: clameter@....com, Lee.Schermerhorn@...com,
akpm@...ux-foundation.org, ak@...e.de, linux-kernel@...r.kernel.org
Subject: Re: [patch 2/2] cpusets: add interleave_over_allowed option
On Sun, 28 Oct 2007, Paul Jackson wrote:
> > If we can't identify any applications that would be broken by this, what's
> > the difference in simply implementing Choice B and then, if we hear
> > complaints, add your hack to revert back to Choice A behavior based on the
> > get_mempolicy() call you specified is always part of libnuma?
>
> I'll probably reply to other parts of your message later, but this
> one catches my eye right now.
>
> "if we hear complaints, add your hack ... back" -- this doesn't seem
> like a good idea to me. Maybe inside Google you don't see it, but
> for those of us shipping computer systems using major distributions
> such as SUSE or Red Hat, there can be a year lag between when I send a
> feature patch to Andrew, and when my customers send their first
> feedback to me resulting from using that new feature.
>
Let's add a Choice C:
Any nodemask that is passed to set_mempolicy() is saved as the
intent of the application in struct mempolicy. All policies
are effected on a contextualized per-allocation basis.
Policies such as MPOL_INTERLEAVE always get AND'd with
pol->cpuset_mems_allowed. If that yields numa_no_nodes,
MPOL_DEFAULT is used instead.
Policies such as MPOL_PREFERRED are respected if the node is set
in pol->cpuset_mems_allowed, otherwise MPOL_DEFAULT is used.
If an application attempts to setup a memory policy for an
MPOL_PREFERRED node that it doesn't have access to or an
MPOL_INTERLEAVE nodemask that is empty when AND'd with
pol->cpuset_mems_allowed, -EINVAL is returned and no new policy
is effected.
If an application gains nodes in pol->cpuset_mems_allowed that
now include the nodes from MPOL_INTERLEAVE or MPOL_PREFERRED,
that policy is then effected once again. Otherwise,
MPOL_DEFAULT is still used.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists