[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4F8EA64B.2050208@parallels.com>
Date: Wed, 18 Apr 2012 15:32:27 +0400
From: Stanislav Kinsbursky <skinsbursky@...allels.com>
To: Simon Kirby <sim@...tway.ca>
CC: Eric Dumazet <eric.dumazet@...il.com>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>
Subject: Re: 3.3.0, 3.4-rc1 reproducible tun Oops
17.04.2012 22:35, Simon Kirby пишет:
> On Tue, Apr 17, 2012 at 04:18:53PM +0400, Stanislav Kinsbursky wrote:
>
>> 17.04.2012 06:08, Simon Kirby ??????????:
>>> On Thu, Apr 05, 2012 at 04:41:04AM +0200, Eric Dumazet wrote:
>>>
>>>> Hmm, is it happening if you remove the nvidia module ?
>>>>
>>>> If yes, please try to add slub_debug=FZPU
>>>
>>> Finally got annoyed enough at this to bisect it. It doesn't happen every
>>> time and I got a bit confused, but I finally tracked it down to:
>>>
>>> 1ab5ecb90cb6a3df1476e052f76a6e8f6511cb3d is the first bad commit
>>> commit 1ab5ecb90cb6a3df1476e052f76a6e8f6511cb3d
>>> Author: Stanislav Kinsbursky<skinsbursky@...allels.com>
>>> Date: Mon Mar 12 02:59:41 2012 +0000
>>>
>>> tun: don't hold network namespace by tun sockets
>>>
>>> v3: added previously removed sock_put() to the tun_release() callback, because
>>> sk_release_kernel() doesn't drop the socket reference.
>>>
>>> v2: sk_release_kernel() used for socket release. Dummy tun_release() is
>>> required for sk_release_kernel() ---> sock_release() ---> sock->ops->release()
>>> call.
>>>
>>> TUN was designed to destroy it's socket on network namesapce shutdown. But this
>>> will never happen for persistent device, because it's socket holds network
>>> namespace.
>>> This patch removes of holding network namespace by TUN socket and replaces it
>>> by creating socket in init_net and then changing it's net it to desired one. On
>>> shutdown socket is moved back to init_net prior to final put.
>>>
>>> Signed-off-by: Stanislav Kinsbursky<skinsbursky@...allels.com>
>>> Signed-off-by: David S. Miller<davem@...emloft.net>
>>>
>>> ...With this reverted on top of 3.4-rc3, I no longer see crashes when I
>>> keep making and breaking the SSH tunnel while running "vmstat 1" in an
>>> SSH session over a socket that is running through that tunnel.
>>>
>>> Simon-
>>
>> Hi, Simon.
>> Could you please try to apply the patch below on top of your the
>> tree (with 1ab5ecb90cb6a3df1476e052f76a6e8f6511cb3d applied) and
>> check does it fix the problem:
>>
>> diff --git a/drivers/net/tun.c b/drivers/net/tun.c
>> index bb8c72c..1fc4622 100644
>> --- a/drivers/net/tun.c
>> +++ b/drivers/net/tun.c
>> @@ -1540,13 +1540,10 @@ static int tun_chr_close(struct inode
>> *inode, struct file *file)
>> if (dev->reg_state == NETREG_REGISTERED)
>> unregister_netdevice(dev);
>> rtnl_unlock();
>> - }
>> + } else
>> + sock_put(tun->socket.sk);
>> }
>>
>> - tun = tfile->tun;
>> - if (tun)
>> - sock_put(tun->socket.sk);
>> -
>> put_net(tfile->net);
>> kfree(tfile);
>
> (Whitespace-damaged patch, applied manually)
>
> Yes, I no longer see crashes with this applied. I haven't tried with
> kmemleak or similar, but it seems to work.
>
> Thanks,
>
This bug looks like double free, but I can't understand how does this can happen...
Simon, would be really great, if you'll describe in details some simple way, how
to reproduce the bug.
--
Best regards,
Stanislav Kinsbursky
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists