[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20140316162512.GA9467@redhat.com>
Date: Sun, 16 Mar 2014 17:25:12 +0100
From: Oleg Nesterov <oleg@...hat.com>
To: Tetsuo Handa <penguin-kernel@...ove.SAKURA.ne.jp>
Cc: rientjes@...gle.com, akpm@...ux-foundation.org,
joseph.salisbury@...onical.com, torvalds@...ux-foundation.org,
tj@...nel.org, tglx@...utronix.de, linux-kernel@...r.kernel.org,
kernel-team@...ts.ubuntu.com
Subject: Re: [v3.13][v3.14][Regression] kthread: make kthread_create()
killable
On 03/17, Tetsuo Handa wrote:
>
> Therefore, I'd like to propose this patch for 3.14-final
> and 3.13-stable.
Well, I disagree. To me, the patch tries to fix the problem in the wrong
place,
> Commit 786235ee "kthread: make kthread_create() killable" changed to
> leave kthread_create() as soon as receiving SIGKILL. But this change
> caused boot failures if systemd-udevd received SIGKILL (probably due
> to timeout) while loading SCSI controller drivers using
> finit_module() [1].
Shouldn't we fix the caller instead? It should handle the error from
kthread_create() correctly.
And could you tell who is the caller which doesn't do this? If it can't
be fixed, then, say, it can use workqueue to create a kernel thread.
> @@ -292,6 +292,17 @@ struct task_struct *kthread_create_on_node(int (*threadfn)(void *data),
> * new kernel thread.
> */
> if (unlikely(wait_for_completion_killable(&done))) {
> + int i = 0;
> +
> + /*
> + * I got SIGKILL, but wait for 10 more seconds for completion
> + * unless chosen by the OOM killer. This delay is there as a
> + * workaround for boot failure caused by SIGKILL upon device
> + * driver initialization timeout.
> + */
> + while (i++ < 10 && !test_tsk_thread_flag(current, TIF_MEMDIE))
> + if (wait_for_completion_timeout(&done, HZ))
> + goto ready;
Personally I really dislike this hack. And btw, why we return -ENOMEM if
SIGKILL'ed? Why not EINTR ?
If nothing else we can change the caller to do
for (;;) {
kthread = kthread_create(...);
if (!IS_ERR(kthread) || PTR_ERR(kthread) != -EINTR)
break;
// FIXME, I am stupid and can't handle SIGKILL properly
clear_thread_flag(TIF_SIGPENDING);
}
recalc_sigpending();
Oleg.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists