lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20100831111413.GB12853@elte.hu>
Date:	Tue, 31 Aug 2010 13:14:13 +0200
From:	Ingo Molnar <mingo@...e.hu>
To:	Eric Dumazet <eric.dumazet@...il.com>
Cc:	netdev@...r.kernel.org, linux-kernel@...r.kernel.org,
	"Michael S. Tsirkin" <mst@...hat.com>,
	"David S. Miller" <davem@...emloft.net>, Tejun Heo <tj@...nel.org>
Subject: Re: vhost_dev_cleanup() crash: BUG: unable to handle kernel NULL
 pointer dereference


* Eric Dumazet <eric.dumazet@...il.com> wrote:

> Le mardi 31 août 2010 à 09:57 +0200, Ingo Molnar a écrit :
> > FYI, there's a new crash in the vnet driver that occasionally triggers 
> > on ordinary host bootups as well, when (non-virtualized) networking 
> > initializes:
> > 
> >  [   86.563889]  [<ffffffff81b05655>] page_fault+0x25/0x30
> >  [   86.569065]  [<ffffffff8186d899>] ? vhost_poll_flush+0x11a/0x156
> >  [   86.575119]  [<ffffffff8105f511>] ? kthread_stop+0xa/0x57
> >  [   86.580544]  [<ffffffff8186e535>] vhost_dev_cleanup+0x269/0x271
> >  [   86.586528]  [<ffffffff8186e974>] vhost_net_release+0x48/0x7f
> >  [   86.592359]  [<ffffffff810c5419>] fput+0x120/0x1d4
> >  [   86.597185]  [<ffffffff810c2a1d>] filp_close+0x63/0x6d
> >  [   86.602353]  [<ffffffff810c2acf>] sys_close+0xa8/0xe2
> >  [   86.607429]  [<ffffffff81008602>] system_call_fastpath+0x16/0x1b
> > 
> > See the full crashlog below. Config attached.
> > 
> > AFAICT this bug probably went upstream during the merge window.
> > 
> > Thanks,
> > 
> > 	Ingo
> > 
> > [   86.262123] BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
> > [   86.265200] IP:ntry for device- [<ffffffff8105f511>] kthread_stop+0xa/0x57
> > [   86.265200] PGD 3ad75067 PUD 3b352067 PMD 0 
> > [   86.265200] Oops: 0002 [#1] SMP 
> > [   86.265200] last sysfs file: /sys/devices/pnp0/00:0d/id
> > [   86.265200] CPU 0 mapper found
> > Is
> > [   86.265200] Pid: 1254, comm: multipath.stati Not tainted 2.6.36-rc3-tip+ #31158 A8N-E/System Product Name
> >  device-mapper d[   86.265200] RIP: 0010:[<ffffffff8105f511>]  [<ffffffff8105f511>] kthread_stop+0xa/0x57
> > river missing fr[   86.265200] RSP: 0018:ffff88003ae83e58  EFLAGS: 00010246
> > [   86.265200] RAX: ffff88003d1dc170 RBX: 0000000000000000 RCX: 0000000000000000
> > [   86.265200] RDX: ffff88003aa82030 RSI: 0000000000000001 RDI: 0000000000000000
> > om kernel?
> > devi[   86.265200] RBP: ffff88003ae83e68 R08: ffff88003ae83e68 R09: 0000000000000001
> > [   86.265200] R10: ffffffff8186d899 R11: 0000000000000246 R12: ffff88003d1dc8f0
> > [   86.265200] R13: 0000000000000002 R14: ffff88003b2a1000 R15: ffff88003aa82030
> > [   86.265200] FS:  0000000001fc5880(0063) GS:ffff88003fc00000(0000) knlGS:0000000000000000
> > ce-mapper: versi[   86.265200] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> > [   86.265200] CR2: 0000000000000010 CR3: 000000003b377000 CR4: 00000000000006f0
> > on ioctl failed:[   86.265200] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> > [   86.265200] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> > [   86.265200] Process multipath.stati (pid: 1254, threadinfo ffff88003ae82000, task ffff88003b3e8000)
> > [   86.265200] Stack:
> > [   86.265200]  ffff88003d1dc090 ffff88003d1dc8f0 ffff88003ae83e98 ffffffff8186e535
> > [   86.265200] <0> ffff88003ae83e98 ffff88003d1dc090 0000000000000000 0000000000000000
> > [   86.265200] <0> ffff88003ae83ec8 ffffffff8186e974 0000000000000008 ffff88003b3a6180
> > [   86.265200] Call Trace:
> > [   86.265200]  [<ffffffff8186e535>] vhost_dev_cleanup+0x269/0x271
> >  Operation not p[   86.265200]  [<ffffffff8186e974>] vhost_net_release+0x48/0x7f
> > [   86.265200]  [<ffffffff810c5419>] fput+0x120/0x1d4
> > ermitted
> > [   86.265200]  [<ffffffff810c2a1d>] filp_close+0x63/0x6d
> > [   86.265200]  [<ffffffff810c2acf>] sys_close+0xa8/0xe2
> > [   86.265200]  [<ffffffff81008602>] system_call_fastpath+0x16/0x1b
> > [   86.265200] Code: 4c 8b 25 83 b4 16 01 49 81 fc 70 a9 1c 82 75 94 48 c7 c7 80 a9 1c 82 e8 99 5b aa 00 e9 50 ff ff ff 55 48 89 e5 41 54 53 48 89 fb <f0> ff 47 10 4c 8b a7 00 05 00 00 48 83 bf 00 05 00 00 00 74 16 
> > [   86.265200] RIP  [<ffffffff8105f511>] kthread_stop+0xa/0x57
> > [   86.265200]  RSP <ffff88003ae83e58>
> > [   86.265200] CR2: 0000000000000010
> > [   86.499743] ---[ end trace 433623c38ffeb225 ]---
> > [   86.504397] Kernel panic - not syncing: Fatal exception
> > [   86.509633] Pid: 1254, comm: multipath.stati Tainted: G      D     2.6.36-rc3-tip+ #31158
> > [   86.517858] Call Trace:
> > [   86.520343]  [<ffffffff81b01c87>] panic+0x8c/0x196
> > [   86.525181]  [<ffffffff81048405>] ? kmsg_dump+0x126/0x140
> > [   86.530606]  [<ffffffff8100c43d>] oops_end+0x8f/0x9c
> > [   86.535611]  [<ffffffff8102d93d>] no_context+0x1f7/0x206
> > [   86.540948]  [<ffffffff8102dacb>] __bad_area_nosemaphore+0x17f/0x1a2
> > [   86.547334]  [<ffffffff8102db40>] bad_area+0x42/0x49
> > [   86.552329]  [<ffffffff8102de84>] do_page_fault+0x1fe/0x363
> > [   86.557925]  [<ffffffff81352dff>] ? do_raw_spin_lock+0x6b/0x122
> > [   86.563889]  [<ffffffff81b05655>] page_fault+0x25/0x30
> > [   86.569065]  [<ffffffff8186d899>] ? vhost_poll_flush+0x11a/0x156
> > [   86.575119]  [<ffffffff8105f511>] ? kthread_stop+0xa/0x57
> > [   86.580544]  [<ffffffff8186e535>] vhost_dev_cleanup+0x269/0x271
> > [   86.586528]  [<ffffffff8186e974>] vhost_net_release+0x48/0x7f
> > [   86.592359]  [<ffffffff810c5419>] fput+0x120/0x1d4
> > [   86.597185]  [<ffffffff810c2a1d>] filp_close+0x63/0x6d
> > [   86.602353]  [<ffffffff810c2acf>] sys_close+0xa8/0xe2
> > [   86.607429]  [<ffffffff81008602>] system_call_fastpath+0x16/0x1b
> > [   86.613516] Rebooting in 1 seconds..Press any key to enter the menu
> 
> Hi Ingo
> 
> Seems to be commit c23f3445e68e1
> (vhost: replace vhost_workqueue with per-vhost kthread)
> 
> following patch should cure it ?
> 
> Thanks
> 
> [PATCH] vhost: stop worker only if created
> 
> Its illegal to call kthread_stop(NULL)
> 
> Reported-by: Ingo Molnar <mingo@...e.hu>
> Signed-off-by: Eric Dumazet <eric.dumazet@...il.com>
> ---
> diff --git a/drivers/vhost/vhost.c b/drivers/vhost/vhost.c
> index e05557d..0a00121 100644
> --- a/drivers/vhost/vhost.c
> +++ b/drivers/vhost/vhost.c
> @@ -323,7 +323,8 @@ void vhost_dev_cleanup(struct vhost_dev *dev)
>  	dev->mm = NULL;
>  
>  	WARN_ON(!list_empty(&dev->work_list));
> -	kthread_stop(dev->worker);
> +	if (dev->worker)
> +		kthread_stop(dev->worker);

Btw., i think this check should be pushed into kthread_stop() instead - 
just like kfree(NULL) is possible as well - it simplifies cleanup 
sequences.

Thanks,

	Ingo
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ