[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <200702172324.44892.rjw@sisk.pl>
Date: Sat, 17 Feb 2007 23:24:43 +0100
From: "Rafael J. Wysocki" <rjw@...k.pl>
To: Oleg Nesterov <oleg@...sign.ru>
Cc: ego@...ibm.com, akpm@...l.org, paulmck@...ibm.com, mingo@...e.hu,
vatsa@...ibm.com, dipankar@...ibm.com,
venkatesh.pallipadi@...el.com, linux-kernel@...r.kernel.org,
Pavel Machek <pavel@....cz>
Subject: Re: [RFC PATCH(Experimental) 0/4] Freezer based Cpu-hotplug
On Saturday, 17 February 2007 22:34, Oleg Nesterov wrote:
> Rafael, I am trying to understand try_to_freeze_tasks(), and I have a
> couple of questions.
>
> static inline int is_user_space(struct task_struct *p)
> {
> return p->mm && !(p->flags & PF_BORROWED_MM);
> }
>
> This doesn't look right. First, an exiting task has ->mm == NULL after
> do_exit()->exit_mm(). Probably not a problem. However, PF_BORROWED_MM
> check is racy without task_lock(), so we can have a false positive as
> well. Is it ok? We can freeze aio_wq prematurely.
Right now aio_wq is not freezeable (PF_NOFREEZE).
>
>
> try_to_freeze_tasks:
>
> do_each_thread(g, p) {
>
> if (p->state == TASK_TRACED && frozen(p->parent)) {
>
> Why we are doing this check outside of "if (is_user_space(p))" ?
> Not a bug of course, but looks strange.
For no particular reason.
>
> cancel_freezing(p);
> continue;
>
> Is it right? Shouldn't we increment "todo" counter?
No. It would be wrong to do that, because TASK_TRACED tasks with frozen
parents cannot be frozen any further.
>
> }
> if (is_user_space(p)) {
> if (!freeze_user_space)
> continue;
>
> /* Freeze the task unless there is a vfork
> * completion pending
> */
> if (!p->vfork_done)
> freeze_process(p);
>
>
> Racy. do_fork(CLONE_VFORK) first does copy_process() which puts 'p' on
> the task list and unlocks tasklist_lock. This means that 'p' is visible
> to try_to_freeze_tasks(), and p->vfork_done == NULL. try_to_freeze_tasks()
> sets TIF_FREEZE.
>
> Now, do_fork() continues, sets ->vfork_done, p goes to user space, notices
> the fake signal and goes to refrigerator while its parent is blocked on
> "struct completion vfork". Freezing failed.
You are right, but this has never happened, AFAICS.
> So, shouldn't we do
>
> if (p->vfork_done)
> cancel_freezing(p);
>
> instead?
I don't think so. If p hasn't got TIF_FREEZE set yet or it has already been
frozen, cancel_freezing(p) is a noop.
Alternatively, we can move the check into refrigerator(), like this:
---
kernel/power/process.c | 21 +++++++--------------
1 file changed, 7 insertions(+), 14 deletions(-)
Index: linux-2.6.20-git13/kernel/power/process.c
===================================================================
--- linux-2.6.20-git13.orig/kernel/power/process.c
+++ linux-2.6.20-git13/kernel/power/process.c
@@ -39,6 +39,11 @@ void refrigerator(void)
/* Hmm, should we be allowed to suspend when there are realtime
processes around? */
long save;
+
+ /* Freeze the task unless there is a vfork completion pending */
+ if (current->vfork_done)
+ return;
+
save = current->state;
pr_debug("%s entered refrigerator\n", current->comm);
@@ -112,22 +117,10 @@ static unsigned int try_to_freeze_tasks(
cancel_freezing(p);
continue;
}
- if (is_user_space(p)) {
- if (!freeze_user_space)
- continue;
-
- /* Freeze the task unless there is a vfork
- * completion pending
- */
- if (!p->vfork_done)
- freeze_process(p);
- } else {
- if (freeze_user_space)
- continue;
-
+ if (is_user_space(p) == !!freeze_user_space) {
freeze_process(p);
+ todo++;
}
- todo++;
} while_each_thread(g, p);
read_unlock(&tasklist_lock);
yield(); /* Yield is okay here */
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists