[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <6858f130-e6b4-1ba7-ed6f-58c00152be69@virtuozzo.com>
Date: Mon, 23 Aug 2021 08:59:58 +0300
From: Vasily Averin <vvs@...tuozzo.com>
To: Christoph Paasch <christoph.paasch@...il.com>
Cc: "David S. Miller" <davem@...emloft.net>,
Hideaki YOSHIFUJI <yoshfuji@...ux-ipv6.org>,
David Ahern <dsahern@...nel.org>,
Jakub Kicinski <kuba@...nel.org>,
Eric Dumazet <eric.dumazet@...il.com>,
netdev <netdev@...r.kernel.org>,
LKML <linux-kernel@...r.kernel.org>, kernel@...nvz.org,
Julian Wiedmann <jwi@...ux.ibm.com>
Subject: Re: [PATCH NET v4 3/7] ipv6: use skb_expand_head in ip6_xmit
On 8/23/21 8:44 AM, Vasily Averin wrote:
> On 8/22/21 8:13 PM, Christoph Paasch wrote:
>> On Sun, Aug 22, 2021 at 10:04 AM Christoph Paasch
>> <christoph.paasch@...il.com> wrote:
>>>
>>> Hello Vasily,
>>>
>>> On Fri, Aug 20, 2021 at 11:21 PM Vasily Averin <vvs@...tuozzo.com> wrote:
>>>>
>>>> On 8/21/21 1:44 AM, Christoph Paasch wrote:
>>>>> (resend without html - thanks gmail web-interface...)
>>>>> On Fri, Aug 20, 2021 at 3:41 PM Christoph Paasch
>>>>>> AFAICS, this is because pskb_expand_head (called from
>>>>>> skb_expand_head) is not adjusting skb->truesize when skb->sk is set
>>>>>> (which I guess is the case in this particular scenario). I'm not
>>>>>> sure what the proper fix would be though...
>>>>
>>>> Could you please elaborate?
>>>> it seems to me skb_realloc_headroom used before my patch called pskb_expand_head() too
>>>> and did not adjusted skb->truesize too. Am I missed something perhaps?
>>>>
>>>> The only difference in my patch is that skb_clone can be not called,
>>>> though I do not understand how this can affect skb->truesize.
>>>
>>> I *believe* that the difference is that after skb_clone() skb->sk is
>>> NULL and thus truesize will be adjusted.
>>>
>>> I will try to confirm that with some more debugging.
>>
>> Yes indeed.
>>
>> Before your patch:
>> [ 19.154039] ip6_xmit before realloc truesize 4864 sk? 000000002ccd6868
>> [ 19.155230] ip6_xmit after realloc truesize 5376 sk? 0000000000000000
>>
>> skb->sk is not set and thus truesize will be adjusted.
>
> This looks strange for me. skb should not lost sk reference.
>
> Could you please clarify where exactly you cheked it?
> sk on newly allocated skb is set on line 291
>
> net/ipv6/ip6_output.c::ip6_xmit()
> 282 if (unlikely(skb_headroom(skb) < head_room)) {
> 283 struct sk_buff *skb2 = skb_realloc_headroom(skb, head_room);
> 284 if (!skb2) {
> 285 IP6_INC_STATS(net, ip6_dst_idev(skb_dst(skb)),
> 286 IPSTATS_MIB_OUTDISCARDS);
> 287 kfree_skb(skb);
> 288 return -ENOBUFS;
> 289 }
> 290 if (skb->sk)
> 291 skb_set_owner_w(skb2, skb->sk); <<<<< here
> 292 consume_skb(skb);
> 293 skb = skb2;
> 294 }
>
>> With your change:
>> [ 15.092933] ip6_xmit before realloc truesize 4864 sk? 00000000072930fd
>> [ 15.094131] ip6_xmit after realloc truesize 4864 sk? 00000000072930fd
>>
>> skb->sk is set and thus truesize is not adjusted.
>
> In this case skb_set_owner_w() is called inside skb_expand_head()
>
> net/ipv6/ip6_output.c::ip6_xmit()
> 265 if (unlikely(head_room > skb_headroom(skb))) {
> 266 skb = skb_expand_head(skb, head_room);
> 267 if (!skb) {
> 268 IP6_INC_STATS(net, idev, IPSTATS_MIB_OUTDISCARDS);
> 269 return -ENOBUFS;
> 270 }
> 271 }
>
> net/core/skbuff.c::skb_expand_head()
> 1813 if (skb_shared(skb)) {
> 1814 struct sk_buff *nskb = skb_clone(skb, GFP_ATOMIC);
> 1815
> 1816 if (likely(nskb)) {
> 1817 if (skb->sk)
> 1818 skb_set_owner_w(nskb, skb->sk); <<<< here
> 1819 consume_skb(skb);
> 1820 } else {
> 1821 kfree_skb(skb);
> 1822 }
> 1823 skb = nskb;
> 1824 }
>
> So I do not understand how this can happen.
> With my patch:
> a) if skb is not shared -- it should keep original skb->sk
> b) if skb is shared -- new skb should set sk if it was set on original skb.
>
> Your results can be explained if you looked and skb->sk and truesize right after skb_realloc_headroom() call
> but before following skb_set_owner_w(). Could you please check it?
It seems I've found the reason:
before my change pskb_expand_head() is called for newly cloned skb where sk was not set.
after my change skb->sk is set before following pskb_expand_head() call
On own turn pskb_expand_head() adjust truesize:
net/core/skbuff.c::pskb_expand_head()
1751 /* It is not generally safe to change skb->truesize.
1752 * For the moment, we really care of rx path, or
1753 * when skb is orphaned (not attached to a socket).
1754 */
1755 if (!skb->sk || skb->destructor == sock_edemux)
1756 skb->truesize += size - osize;
1757
1758 return 0;
Could you please confirm it?
Thank you,
Vasily Averin
Powered by blists - more mailing lists