[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20120429165846.GA19054@redhat.com>
Date: Sun, 29 Apr 2012 18:58:46 +0200
From: Oleg Nesterov <oleg@...hat.com>
To: "Eric W. Biederman" <ebiederm@...ssion.com>
Cc: Mike Galbraith <efault@....de>,
LKML <linux-kernel@...r.kernel.org>,
Pavel Emelyanov <xemul@...allels.com>,
Cyrill Gorcunov <gorcunov@...nvz.org>,
Louis Rilling <louis.rilling@...labs.com>
Subject: Re: [RFC PATCH] namespaces: fix leak on fork() failure
On 04/29, Eric W. Biederman wrote:
>
> Oleg Nesterov <oleg@...hat.com> writes:
>
> > Heh. Please look at http://marc.info/?l=linux-kernel&m=127687751003902
> > and the whole thread, there are a lot more problems here.
>
> I don't remember seeing a leak in that conversation.
It was discussed many times ;) in particular, from the link above:
Note: afaics we have another problem. What if copy_process(CLONE_NEWPID)
fails after pid_ns_prepare_proc() ? Who will do mntput() ?
But we all forgot about this (relatively minor) problem.
> > But this particular one looks simple iirc.
> >
> >> @@ -216,6 +216,14 @@ void switch_task_namespaces(struct task_struct *p, struct nsproxy *new)
> >> rcu_assign_pointer(p->nsproxy, new);
> >>
> >> if (ns && atomic_dec_and_test(&ns->count)) {
> >> + /* Handle fork() failure, unmount proc before proceeding */
> >> + if (unlikely(!new && !((p->flags & PF_EXITING)))) {
> >> + struct pid_namespace *pid_ns = ns->pid_ns;
> >> +
> >> + if (pid_ns && pid_ns != &init_pid_ns)
> >> + pid_ns_release_proc(pid_ns);
> >> + }
> >> +
> >> /*
> >> * wait for others to get what they want from this nsproxy.
> >> *
> >
> > At first glance this looks correct. But the PF_EXITING check doesn't
> > look very nice imho. It is needed to detect the case when the caller
> > is copy_process()->bad_fork_cleanup_namespaces and p is not current.
>
> Mike's proposed change to switch_task_namespace is most definitely not
> correct. This will potentially get called on unshare
Yes, but please note that this change also checks "new == NULL", so I
still think the patch is correct.
But,
> > bad_fork_cleanup_namespaces:
> > + if (unlikely(clone_flags & CLONE_NEWPID))
> > + pid_ns_release_proc(...);
> > exit_task_namespaces(p);
> >
> >
> > code into this error path in copy_process?
>
> For now Oleg your minimal patch looks good.
Good.
Mike, could you please re-send the patch to akpm? Feel free to add my ack.
I guess Eric will ack this fix too.
> Part of me would like to call proc_flush_task instead,
Yes, I thought about this too, it checks upid->nr == 1. But
> pid_ns_release_proc but we have no assurance task_pid and task_tgid are
> valid when we get here so proc_flush_task is out.
Yes.
> There are crazy code paths like daemonize()
Forget. It has no callers anymore, should be killed. A user-space process
should never use kernel_thread() and thus daemonize() is not needed.
Oleg.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists