[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <c1a673ab-0382-445e-aa45-2b8fe2f6bc40@nvidia.com>
Date: Tue, 28 Oct 2025 21:36:17 +0800
From: Jianbo Liu <jianbol@...dia.com>
To: Sabrina Dubroca <sd@...asysnail.net>
CC: <netdev@...r.kernel.org>, <davem@...emloft.net>, <kuba@...nel.org>,
<steffen.klassert@...unet.com>, Cosmin Ratiu <cratiu@...dia.com>, Herbert Xu
<herbert@...dor.apana.org.au>, Eric Dumazet <edumazet@...gle.com>, "Paolo
Abeni" <pabeni@...hat.com>, Simon Horman <horms@...nel.org>, David Ahern
<dsahern@...nel.org>
Subject: Re: [PATCH ipsec v3 2/2] xfrm: Determine inner GSO type from packet
inner protocol
On 10/28/2025 7:03 PM, Sabrina Dubroca wrote:
> 2025-10-28, 04:22:48 +0200, Jianbo Liu wrote:
>> The GSO segmentation functions for ESP tunnel mode
>> (xfrm4_tunnel_gso_segment and xfrm6_tunnel_gso_segment) were
>> determining the inner packet's L2 protocol type by checking the static
>> x->inner_mode.family field from the xfrm state.
>>
>> This is unreliable. In tunnel mode, the state's actual inner family
>> could be defined by x->inner_mode.family or by
>> x->inner_mode_iaf.family. Checking only the former can lead to a
>> mismatch with the actual packet being processed, causing GSO to create
>> segments with the wrong L2 header type.
>>
>> This patch fixes the bug by deriving the inner mode directly from the
>> packet's inner protocol stored in XFRM_MODE_SKB_CB(skb)->protocol.
>>
>> Instead of replicating the code, this patch modifies the
>> xfrm_ip2inner_mode helper function. It now correctly returns
>> &x->inner_mode if the selector family (x->sel.family) is already
>> specified, thereby handling both specific and AF_UNSPEC cases
>> appropriately.
>
> (nit: I think this paragraph goes a bit too much into describing the
> changes between versions)
>
>> With this change, ESP GSO can use xfrm_ip2inner_mode to get the
>> correct inner mode. It doesn't affect existing callers, as the updated
>> logic now mirrors the checks they were already performing externally.
>
> Sorry, maybe I wasn't clear, but I meant that the callers should also
> be updated to not do the AF_UNSPEC check anymore (note: this will
> cause merge conflicts with your "NULL inner_mode" cleanup patch [1]).
>
> And I think it would be nicer to split the refactoring into a separate
> patch. So this series would be:
>
> patch 1: fix xfrm_dev_offload_ok and xfrm_get_inner_ipproto (same as now)
> patch 2: modify xfrm_ip2inner_mode and remove the AF_UNSPEC check and
> setting inner_mode = &x->inner_mode from all callers
> [no behavior change, just a refactoring to prepare for patch 3]
> patch 3: use xfrm_ip2inner_mode for GSO (same as your v2 patch 2/2)
>
> Does that seem ok to you?
>
>
> And to avoid the merge conflict with [1], maybe it also makes more
> sense to integrate that clean up in patch 2 from the list above, so
> for ip_vti we'd have:
>
> diff --git a/net/ipv4/ip_vti.c b/net/ipv4/ip_vti.c
> index 95b6bb78fcd2..89784976c65e 100644
> --- a/net/ipv4/ip_vti.c
> +++ b/net/ipv4/ip_vti.c
> @@ -118,16 +118,7 @@ static int vti_rcv_cb(struct sk_buff *skb, int err)
>
> x = xfrm_input_state(skb);
>
> - inner_mode = &x->inner_mode;
> -
> - if (x->sel.family == AF_UNSPEC) {
> - inner_mode = xfrm_ip2inner_mode(x, XFRM_MODE_SKB_CB(skb)->protocol);
> - if (inner_mode == NULL) {
> - XFRM_INC_STATS(dev_net(skb->dev),
> - LINUX_MIB_XFRMINSTATEMODEERROR);
> - return -EINVAL;
> - }
> - }
> + inner_mode = xfrm_ip2inner_mode(x, XFRM_MODE_SKB_CB(skb)->protocol);
>
> family = inner_mode->family;
>
>
>
> Does that sound reasonable?
I have a concern regarding backporting.
Patches 1 and 3 in your proposed structure are bug fixes that should
ideally go into the ipsec tree and be suitable for stable backports.
Patch 2 should be targeted to ipsec-next as refactoring often does.
If so, patch 3 becomes dependent on a change that won't exist in older
kernels, making it difficult to backport cleanly.
To maintain backportability for the GSO fix, I'd prefer to keep the
modification to xfrm_ip2inner_mode within the same patch that fixes the
GSO code (which is currently my v3 patch 2/2).
My proposed plan is:
Send the patch 1 and patch 3 (including the xfrm_ip2inner_mode change)
together to the ipsec tree. They are self-contained fixes.
Separately, after those are accepted, I can modify and re-submit that
patch [1] to ipsec-next that removes the now-redundant checks from the
other callers (VTI, etc.), leveraging the updated helper function.
This way, the critical fixes are self-contained and backportable, while
the cleanup of other callers happens later in the development cycle.
>
> [1] https://lore.kernel.org/netdev/20251027023818.46446-1-jianbol@nvidia.com/
>
Powered by blists - more mailing lists