[<prev] [next>] [day] [month] [year] [list]
Message-ID: <b25cb419-3205-993c-08b1-2ff21cfec9b0@gmail.com>
Date: Sat, 6 Oct 2018 21:28:57 -0600
From: David Ahern <dsahern@...il.com>
To: Preethi Ramachandra <preethir@...iper.net>
Cc: Reji Thomas <rejithomas@...iper.net>,
Yogesh Ankolekar <ayogesh@...iper.net>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>
Subject: Re: PMTU discovery broken in Linux for UDP/raw application if the
socket is not bound to a device
The correct mailing list is netdev@...r.kernel.org (added)
non-text emails will be rejected.
On 10/3/18 10:15 PM, Preethi Ramachandra wrote:
> Hi,
>
>
>
> While testing the PMTU discovery for UDP/raw applications, Linux is not
> doing PMTU discovery if the UDP server socket is not bound to a device.
> In the scenario we are testing there could be multiple VRF devices
> created and an application like UDP/RAW can use a common socket for all
> vrf devices. While sending packet IP_PKTINFO socket option can be used
> to specify the vrf interface through which packet will be sent out. In
> this case, when packet too big icmp6 error message comes back to Linux
> on a vrf device, a route lookup is done on default routing-table(0) for
> src/dst address which case, the route will not be found and packet is
> dropped. If the route lookup happened with proper VRF device (packet’s
> incoming index), the route lookup succeeds, PMTU discovery is successful.
>
>
>
> This might need a fix, please take a look.
>
>
>
> *Linux version *
>
>
>
> Linux 4.8.24
>
>
>
> *Code flow *
>
>
>
> Linux code where it expects socket’s bound device in order for PMTU
> discovery to happen.
>
> *void ip6_sk_update_pmtu*(struct sk_buff *skb, struct sock *sk, __be32 mtu)
>
> {
>
> struct dst_entry *dst;
>
>
>
> ip6_update_pmtu(skb, sock_net(sk), mtu,
>
>
> sk->sk_bound_dev_if, sk->sk_mark, sk->sk_uid);*<<<<< This is the point
> where it expects socket’s sk_bound_dev_if to be set. In our testing this
> is actually 0, since the socket is not really bound to a vrf device.*
Try this based on top of tree for 4.19-next (whitespace damaged on paste
so you'll need to manually apply and handle differences with 4.8):
diff --git a/net/ipv6/route.c b/net/ipv6/route.c
index 6c1d817151ca..50b95b48b911 100644
--- a/net/ipv6/route.c
+++ b/net/ipv6/route.c
@@ -2360,10 +2360,13 @@ EXPORT_SYMBOL_GPL(ip6_update_pmtu);
void ip6_sk_update_pmtu(struct sk_buff *skb, struct sock *sk, __be32 mtu)
{
+ int oif = sk->sk_bound_dev_if;
struct dst_entry *dst;
- ip6_update_pmtu(skb, sock_net(sk), mtu,
- sk->sk_bound_dev_if, sk->sk_mark, sk->sk_uid);
+ if (!oif && skb->dev)
+ oif = l3mdev_master_ifindex(skb->dev);
+
+ ip6_update_pmtu(skb, sock_net(sk), mtu, oif, sk->sk_mark,
sk->sk_uid);
dst = __sk_dst_get(sk);
if (!dst || !dst->obsolete ||
Powered by blists - more mailing lists