[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-Id: <20190104.125414.2212636411296708605.davem@davemloft.net>
Date: Fri, 04 Jan 2019 12:54:14 -0800 (PST)
From: David Miller <davem@...emloft.net>
To: rientjes@...gle.com
Cc: edumazet@...gle.com, akpm@...ux-foundation.org, willemb@...gle.com,
mhocko@...e.com, vbabka@...e.cz, netdev@...r.kernel.org,
linux-kernel@...r.kernel.org
Subject: Re: [patch] net, skbuff: do not prefer skb allocation fails early
From: David Rientjes <rientjes@...gle.com>
Date: Wed, 2 Jan 2019 13:01:43 -0800 (PST)
> Commit dcda9b04713c ("mm, tree wide: replace __GFP_REPEAT by
> __GFP_RETRY_MAYFAIL with more useful semantic") replaced __GFP_REPEAT in
> alloc_skb_with_frags() with __GFP_RETRY_MAYFAIL when the allocation may
> directly reclaim.
>
> The previous behavior would require reclaim up to 1 << order pages for
> skb aligned header_len of order > PAGE_ALLOC_COSTLY_ORDER before failing,
> otherwise the allocations in alloc_skb() would loop in the page allocator
> looking for memory. __GFP_RETRY_MAYFAIL makes both allocations failable
> under memory pressure, including for the HEAD allocation.
>
> This can cause, among many other things, write() to fail with ENOTCONN
> during RPC when under memory pressure.
>
> These allocations should succeed as they did previous to dcda9b04713c
> even if it requires calling the oom killer and additional looping in the
> page allocator to find memory. There is no way to specify the previous
> behavior of __GFP_REPEAT, but it's unlikely to be necessary since the
> previous behavior only guaranteed that 1 << order pages would be reclaimed
> before failing for order > PAGE_ALLOC_COSTLY_ORDER. That reclaim is not
> guaranteed to be contiguous memory, so repeating for such large orders is
> usually not beneficial.
>
> Removing the setting of __GFP_RETRY_MAYFAIL to restore the previous
> behavior, specifically not allowing alloc_skb() to fail for small orders
> and oom kill if necessary rather than allowing RPCs to fail.
>
> Fixes: dcda9b04713c ("mm, tree wide: replace __GFP_REPEAT by
> __GFP_RETRY_MAYFAIL with more useful semantic")
> Signed-off-by: David Rientjes <rientjes@...gle.com>
Applied and queued up for -stable.
Powered by blists - more mailing lists