[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <7fb36754-9e7b-73a7-3c3b-0a8141e4a85f@kernel.org>
Date: Tue, 25 Jul 2023 16:37:15 -0600
From: David Ahern <dsahern@...nel.org>
To: Ido Schimmel <idosch@...sch.org>
Cc: Hangbin Liu <liuhangbin@...il.com>, netdev@...r.kernel.org,
"David S . Miller" <davem@...emloft.net>, Eric Dumazet
<edumazet@...gle.com>, Jakub Kicinski <kuba@...nel.org>,
Paolo Abeni <pabeni@...hat.com>, Thomas Haller <thaller@...hat.com>,
Donald Sharp <sharpd@...dia.com>
Subject: Re: [PATCHv3 net] ipv6: do not match device when remove source route
On 7/25/23 4:06 AM, Ido Schimmel wrote:
> On Sun, Jul 23, 2023 at 12:12:00PM -0600, David Ahern wrote:
>> On 7/23/23 2:13 AM, Ido Schimmel wrote:
>>>
>>> I don't know, but when I checked the code and tested it I noticed that
>>> the kernel doesn't care on which interface the address is configured.
>>> Therefore, in order for deletion to be consistent with addition and with
>>> IPv4, the preferred source address shouldn't be removed from routes in
>>> the VRF table as long as the address is configured on one of the
>>> interfaces in the VRF.
>>>
>>
>> Deleting routes associated with device 2 when an address is deleted from
>> device 1 is going to introduce as many problems as it solves. The VRF
>> use case is one example.
>
> This already happens in IPv4:
>
> # ip link add name dummy1 up type dummy
> # ip link add name dummy2 up type dummy
> # ip address add 192.0.2.1/24 dev dummy1
> # ip route add 198.51.100.0/24 dev dummy2 src 192.0.2.1
> # ip -4 r s
> 192.0.2.0/24 dev dummy1 proto kernel scope link src 192.0.2.1
> 198.51.100.0/24 dev dummy2 scope link src 192.0.2.1
> # ip address del 192.0.2.1/24 dev dummy1
> # ip -4 r s
>
> IPv6 only removes the preferred source address from routes, but doesn't
> delete them. The patch doesn't change that.
>
> Another difference from IPv4 is that IPv6 only removes the preferred
> source address from routes whose first nexthop device matches the device
> from which the address was removed:
>
> # ip link add name dummy1 up type dummy
> # ip link add name dummy2 up type dummy
> # ip address add 2001:db8:1::1/64 dev dummy1
> # ip route add 2001:db8:2::/64 dev dummy2 src 2001:db8:1::1
> # ip -6 r s
> 2001:db8:1::/64 dev dummy1 proto kernel metric 256 pref medium
> 2001:db8:2::/64 dev dummy2 src 2001:db8:1::1 metric 1024 pref medium
> fe80::/64 dev dummy1 proto kernel metric 256 pref medium
> fe80::/64 dev dummy2 proto kernel metric 256 pref medium
> # ip address del 2001:db8:1::1/64 dev dummy1
> # ip -6 r s
> 2001:db8:2::/64 dev dummy2 src 2001:db8:1::1 metric 1024 pref medium
> fe80::/64 dev dummy1 proto kernel metric 256 pref medium
> fe80::/64 dev dummy2 proto kernel metric 256 pref medium
>
> And this is despite the fact that the kernel only allowed the route to
> be programmed because the preferred source address was present on
> another interface in the same L3 domain / VRF:
>
> # ip link add name dummy1 up type dummy
> # ip link add name dummy2 up type dummy
> # ip route add 2001:db8:2::/64 dev dummy2 src 2001:db8:1::1
> Error: Invalid source address.
>
> The intent of the patch (at least with the changes I proposed) is to
> remove the preferred source address from routes in a VRF when the
> address is no longer configured on any interface in the VRF.
>
> Note that the above is true for addresses with a global scope. The
> removal of a link-local address from a device should not affect other
> devices. This restriction also applies when a route is added:
>
> # ip link add name dummy1 up type dummy
> # ip link add name dummy2 up type dummy
> # ip -6 address add fe80::1/64 dev dummy1
> # ip -6 route add 2001:db8:2::/64 dev dummy2 src fe80::1
> Error: Invalid source address.
> # ip -6 address add fe80::1/64 dev dummy2
> # ip -6 route add 2001:db8:2::/64 dev dummy2 src fe80::1
Lot of permutations. It would be good to get these in a test script
along with other variations - e.g.,
# 2 devices with the same source address
ip link add name dummy1 up type dummy
ip link add name dummy2 up type dummy
ip link add name dummy3 up type dummy
ip address add 192.0.2.1/24 dev dummy1
ip address add 192.0.2.1/24 dev dummy3
ip route add 198.51.100.0/24 dev dummy2 src 192.0.2.1
ip address del 192.0.2.1/24 dev dummy1
--> src route should stay
# VRF with single device using src address
ip li add name red up type vrf table 123
ip link add name dummy4 up type dummy vrf red
ip link add name dummy5 up type dummy vrf red
ip address add 192.0.2.1/24 dev dummy4
ip route add 198.51.100.0/24 dev dummy5 src 192.0.2.1
ip address del 192.0.2.1/24 dev dummy4
ip ro ls vrf red
# VRF with two devices using src address
ip li add name red up type vrf table 123
ip link add name dummy4 up vrf red type dummy
ip link add name dummy5 up vrf red type dummy
ip link add name dummy6 up vrf red type dummy
ip address add 192.0.2.1/24 dev dummy4
ip address add 192.0.2.1/24 dev dummy6
ip route add 198.51.100.0/24 dev dummy5 src 192.0.2.1 vrf red
ip address del 192.0.2.1/24 dev dummy4
I can not find my notes but I recall Donald raised a ticket at Cumulus
when FRR tripped over a scenario like this or a related one (something
about routes and address delete). CC'ed Donald in case he recalls the
details
Powered by blists - more mailing lists