[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <1386233691.3594.49.camel@ubuntu-vm-makita>
Date: Thu, 05 Dec 2013 17:54:51 +0900
From: Toshiaki Makita <makita.toshiaki@....ntt.co.jp>
To: vyasevic@...hat.com
Cc: "David S . Miller" <davem@...emloft.net>,
Stephen Hemminger <stephen@...workplumber.org>,
netdev@...r.kernel.org
Subject: Re: [PATCH net 4/7] bridge: Fix the way checking if a local fdb
entry can be deleted
On Wed, 2013-12-04 at 11:40 -0500, Vlad Yasevich wrote:
> On 12/04/2013 03:29 AM, Toshiaki Makita wrote:
> > On Tue, 2013-12-03 at 10:41 -0500, Vlad Yasevich wrote:
> >> On 12/03/2013 07:45 AM, Toshiaki Makita wrote:
> >>> On Mon, 2013-12-02 at 12:07 -0500, Vlad Yasevich wrote:
> >>>> On 12/02/2013 01:40 AM, Toshiaki Makita wrote:
> >>>>> We should take into account the followings when deleting a local fdb entry.
> >>>>>
> >>>>> - nbp_vlan_find() can be used only when vid != 0 to check if an entry is
> >>>>> deletable, because a fdb entry with vid 0 can exist at any time but
> >>>>> nbp_vlan_find() always return false with vid 0.
> >>>>>
> >>>>> Example of problematic case:
> >>>>> ip link set eth0 address 12:34:56:78:90:ab
> >>>>> ip link set eth1 address 12:34:56:78:90:ab
> >>>>> brctl addif br0 eth0
> >>>>> brctl addif br0 eth1
> >>>>> ip link set eth0 address aa:bb:cc:dd:ee:ff
> >>>>> Then, the fdb entry 12:34:56:78:90:ab will be deleted even though the
> >>>>> bridge port eth1 still has that address.
> >>>>>
> >>>>> - The port to which the bridge device is attached might needs a local entry
> >>>>> if its mac address is set manually.
> >>>>>
> >>>>> Example of problematic case:
> >>>>> ip link set eth0 address 12:34:56:78:90:ab
> >>>>> brctl addif br0 eth0
> >>>>> ip link set br0 address 12:34:56:78:90:ab
> >>>>> ip link set eth0 address aa:bb:cc:dd:ee:ff
> >>>>> Then, the fdb still must have the entry 12:34:56:78:90:ab, but it will be
> >>>>> deleted.
> >>>>>
> >>>>> We can use br->dev->addr_assign_type to check if the address is manually
> >>>>> set or not, but I propose another approach.
> >>>>>
> >>>>> Since we delete and insert local entries whenever changing mac address
> >>>>> of the bridge device, we can change dst of the entry to NULL regardless of
> >>>>> addr_assign_type when deleting an entry associated with a certain port,
> >>>>> and if it is found to be unnecessary later, then delete it.
> >>>>> That is, if changing mac address of a port, the entry might be changed
> >>>>> to its dst being NULL first, but is eventually deleted when recalculating
> >>>>> and changing bridge id.
> >>>>>
> >>>>> This approach is useful when we want to share the code with deleting
> >>>>> vlan in which the bridge device might want such an entry regardless of
> >>>>> addr_assign_type, and makes things easy because we don't have to consider
> >>>>> if mac address of the bridge device will be changed or not at
> >>>>> fdb_delete_local().
> >>>>
> >>>> This is a nifty approach, but it does have one side-effect that I am not
> >>>> sure is correct. In the case where bridge mac address is not manually
> >>>> set, a fdb entry for the removed address survives past the
> >>>> synchronize_net() call. This would result in a behavioral change where
> >>>> packets that always used to flood, would now sometimes be delivered
> >>>> only to the bridge.
> >>>
> >>> I think no unnecessary entry will survive in any case.
> >>> It will survive only if the bridge device remains to have the same mac
> >>> address, because we check if the entry is necessary or not whenever the
> >>> address of the bridge device is changed (assuming patch 3/7 is applied).
> >>>
> >>> Further analysis:
> >>> The function fdb_delete_local() (and __fdb_delete_local()) will called
> >>> by br_fdb_changeaddr(), br_fdb_change_mac_address(),
> >>> br_fdb_delete_by_port() (with do_all==1), br_vlan_delete(), and
> >>> nbp_vlan_delete() after all of this patch series are applied.
> >>> So, we have to consider only these 5 functions.
> >>>
> >>> br_fdb_changeaddr():
> >>> It is called by br_device_event().
> >>> br_device_event() calls br_stp_recalculate_bridge_id() right after
> >>> calling br_fdb_changeaddr(), so if the address of bridge device should
> >>> be changed, the fdb entry will immediately reflect it and be deleted.
> >>
> >> This doesn't happen until later. Currently nbp_del() will remove all
> >> fdb entries for a given port. With this patch, the local fdb entry for
> >> the port will survive the removal process and the fdb->dst will be set
> >> to NULL.
> >> The port is now removed from the list, rx_handler is unregistered and we
> >> push call synchronize_net() trying flush all packets currently in rcu
> >> section. Once this completes, the port and all the fdbs for it should
> >> be removed, but now they are not. We have to wait for br_del_if() to
> >> call the notifier call chain to remove the the fdb entry. Any packet
> >> arriving at the bridge for the mac address of the port that just got
> >> removed will now be handed over to the bridge, instead of being flooded.
> >> This is a change in behavior.
> >
> > I see. Indeed this is a change in behavior.
> >
> > But the bridge device still has the mac address after synchronize_net(),
> > and it will be actually changed in br_stp_recalculate_bridge_id().
> > Though that port has gone, as the bridge still uses the address, I'm
> > thinking the corresponding entry should exist.
>
> You are right in that the address is still assigned to bridge. This is
> the same exact state the we have when adding a port to the bridge.
> >From br_add_if():
> err = netdev_rx_handler_register(dev, br_handle_frame, p);
> ...
> list_add_rcu(&p->list, &br->port_list);
> ....
> changed_addr = br_stp_recalculate_bridge_id(br);
> ....
> if (br_fdb_insert(br, p, dev->dev_addr, 0))
>
> So, we add the the port to the list, set the bridge mac address and then
> add the local fdb. So, we have a short window where a mac address
> assigned to the bridge does not have an associated fdb.
>
> As I stated before, I am not sure if the side-effect you are introducing
> is OK or not. It doesn't appear to be a problem from the fdb inspection
> side of things since we are holding the rtnl here.
> It might cause an issue if the a host behind a port might start using
> the address while the fdb is still alive.
>
> Now the way I see it, you have 2 options:
> 1) Add some text to the commit log to explain the new state
> 2) Add a check for addr_type to the if statement below
>
> > + /* Maybe bridge device has same hw addr? */
> > + if (p && ether_addr_equal(br->dev->dev_addr, addr) &&
> > + (!vid || br_vlan_find(br, vid))) {
> > + f->dst = NULL;
> > + return;
> > + }
> > +
I want to go for the former unless the change is absolutely not
accepted.
Introducing a check for addr_assign_type seems to be risky to me because
it might considerably make conditions complicated.
The check is not a simple check but a part of prediction of address
changing of the bridge device. We have to know how the bridge id is
calculated by other codes to implement fdb_delete_local().
For now, if fdb_delete_local() is called by br_fdb_delete_by_port(), we
will be able to expect the address change.
But once we try to use the function from *_vlan_delete(), we have to
assume that the address will not change. We will need to use another
function or introduce another argument for the function.
For the future, the logic of address changing might be modified. Fdb
code will have to reflect it properly as well.
I want to make the logic based on just current states. The last port
among those having the same address and vlan is simply responsible to
delete the entry.
I believe simplification of the logic will help us reduce unhandled
corner cases.
Anyway, thank you for your kind analysis and advice.
I will try to make proper changelog.
Thanks,
Toshiaki Makita
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists