[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1319030740.8416.14.camel@edumazet-laptop>
Date: Wed, 19 Oct 2011 15:25:40 +0200
From: Eric Dumazet <eric.dumazet@...il.com>
To: Johannes Berg <johannes@...solutions.net>
Cc: Richard Cochran <richardcochran@...il.com>,
David Miller <davem@...emloft.net>, netdev@...r.kernel.org
Subject: Re: [PATCH 0/3] net: time stamping fixes
Le mercredi 19 octobre 2011 à 15:09 +0200, Johannes Berg a écrit :
> On Wed, 2011-10-19 at 14:58 +0200, Johannes Berg wrote:
>
> > Not disputing this either. But you said sk_refcnt can be 0, so why can't
> > the following happen:
> >
> > /* skb; skb->sk = sk; skb->destructor = sock_wfree; */
> >
> > /* skb is on qdisc, some time passes */
> >
> > sk_free(sk); /* user closed socket,
> > sk->sk_refcnt reaches 0,
> > sk->sk_wmem_alloc == skb->truesize,
> > __sk_free not called, socket still lives,
> > but no more +1 in sk_wmem_alloc */
> >
> > /* some more time passes */
> >
> > /* ethernet hard_start_xmit calls skb_clone_tx_timestamp() */
> > skb2 = skb_clone(skb);
> > skb2->sk = skb->sk;
> > sock_hold(skb->sk);
> >
> > /* ethernet TX completion calls skb_free(skb) */
> > skb_free(skb):
> > sock_wfree(skb); /* sk_wmem_alloc reaches 0,
> > __sk_free called DESPITE sk_refcnt > 0 */
> >
> > /* later, in skb_complete_tx_timestamp() */
> > sock_put(sk); /* KABOOM */
>
>
> Given the complexity of all this, I'm not sure we shouldn't do something
> like this, but I have no idea what the cost would be:
>
> --- wireless-testing.orig/include/net/sock.h 2011-10-18 22:28:41.000000000 +0200
> +++ wireless-testing/include/net/sock.h 2011-10-19 15:08:45.000000000 +0200
> @@ -434,7 +434,10 @@ static __inline__ int __sk_del_node_init
>
> static inline void sock_hold(struct sock *sk)
> {
> - atomic_inc(&sk->sk_refcnt);
> + if (atomic_inc_return(&sk->sk_refcnt) == 1) {
> + /* was zero -- we must've gotten an sk_wmem_alloc reference */
> + atomic_inc(&sk->sk_wmem_alloc);
> + }
> }
>
Hmm, it will be difficult to handle two atomics without adding races,
and add quite expensive atomic_inc_return() on some arches.
I would just change the skb tx cloning to take a normal reference on
sk_wmem_alloc
atomic_add(skb->truesize, &sk->sk_wmem_alloc);
instead of
sock_hold(sk);
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists