[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5bb65c0e4c6828b1331d33745f34d9ee.squirrel@webmail-b.css.fujitsu.com>
Date: Sat, 25 Jul 2009 11:35:54 +0900 (JST)
From: "KAMEZAWA Hiroyuki" <kamezawa.hiroyu@...fujitsu.com>
To: "KAMEZAWA Hiroyuki" <kamezawa.hiroyu@...fujitsu.com>
Cc: "Andrew Morton" <akpm@...ux-foundation.org>,
"David Rientjes" <rientjes@...gle.com>, lee.schermerhorn@...com,
kosaki.motohiro@...fujitsu.com, miaox@...fujitsu.com,
mingo@...e.hu, a.p.zijlstra@...llo.nl, cl@...ux-foundation.org,
menage@...gle.com, nickpiggin@...oo.com.au, y-goto@...fujitsu.com,
penberg@...helsinki.fi, linux-mm@...ck.org,
linux-kernel@...r.kernel.org
Subject: Re: [BUG] set_mempolicy(MPOL_INTERLEAV) cause kernel panic
KAMEZAWA Hiroyuki wrote:
> Andrew Morton wrote:
>> On Fri, 24 Jul 2009 15:51:51 -0700 (PDT)
>> David Rientjes <rientjes@...gle.com> wrote:
>
>> afaik we don't have a final patch for this. I asked Motohiro-san about
>> this and he's proposing that we revert the offending change (which one
>> was it?) if nothing gets fixed soon - the original author is on a
>> lengthy vacation.
>>
>>
>> If we _do_ have a patch then can we start again? Someone send out the
>> patch
>> and let's take a look at it.
> Hmm, like this ? (cleaned up David's one because we shouldn't have
> extra nodemask_t on stack.)
>
> Problems are
> - rebind() is maybe broken but no good idea.
> (but it seems to be broken in old kernels
> - Who can test this is only a user who has possible node on SRAT.
>
> + /* should we call is_valid_nodemask() here ?*/
> if (pol->flags & MPOL_F_RELATIVE_NODES)
> mpol_relative_nodemask(&cpuset_context_nmask, nodes,
> - &cpuset_current_mems_allowed);
> + &cpuset_context_nmask);
Sorry this part is buggy.
But to fix this, we use extra nodemask here and this patch will
allocate 3 nodemasks on stack!
Then, here is a much easier fix. for trusting cpuset more.
==
From: KAMEZAWA Hiroyuki <kamezawa.hiroyu@...fujitsu.com>
task->mems_allowed is guaranteed to be only includes valid nodes when
it is set under cpuset. but, at init, all possible nodes are included.
fix it.
And at cpuset-rebind, caluculated result can be a invaild one.
In that case, trust cpuset's one
Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@...fujitsu.com>
---
init/main.c | 4 ++--
mm/mempolicy.c | 7 ++++++-
2 files changed, 8 insertions(+), 3 deletions(-)
Index: mmotm-2.6.31-Jul16/init/main.c
===================================================================
--- mmotm-2.6.31-Jul16.orig/init/main.c
+++ mmotm-2.6.31-Jul16/init/main.c
@@ -855,9 +855,9 @@ static int __init kernel_init(void * unu
lock_kernel();
/*
- * init can allocate pages on any node
+ * init can allocate pages on any online node
*/
- set_mems_allowed(node_possible_map);
+ set_mems_allowed(node_state[N_HIGH_MEMORY]);
/*
* init can run on any cpu.
*/
Index: mmotm-2.6.31-Jul16/mm/mempolicy.c
===================================================================
--- mmotm-2.6.31-Jul16.orig/mm/mempolicy.c
+++ mmotm-2.6.31-Jul16/mm/mempolicy.c
@@ -290,7 +290,12 @@ static void mpol_rebind_nodemask(struct
*nodes);
pol->w.cpuset_mems_allowed = *nodes;
}
-
+ /*
+ * tmp can be invalid ...just use cpuset's one in that case.
+ */
+ if (nodes_empty(tmp) ||
+ ((pol->mode == MPOL_BIND) && !is_valid_nodemask(&tmp)))
+ tmp = *nodes;
pol->v.nodes = tmp;
if (!node_isset(current->il_next, tmp)) {
current->il_next = next_node(current->il_next, tmp);
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists