lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date:	Fri, 13 Dec 2013 14:20:10 -0800
From:	Jay Vosburgh <fubar@...ibm.com>
To:	Ding Tianhong <dingtianhong@...wei.com>
cc:	Andy Gospodarek <andy@...yhouse.net>,
	"David S. Miller" <davem@...emloft.net>,
	Nikolay Aleksandrov <nikolay@...hat.com>,
	Veaceslav Falico <vfalico@...hat.com>,
	Netdev <netdev@...r.kernel.org>
Subject: Re: [PATCH net-next v6 0/11] bonding: rebuild the lock use for bond monitor

Ding Tianhong <dingtianhong@...wei.com> wrote:

>Now the bond slave list is not protected by bond lock, only by RTNL,
>but the monitor still use the bond lock to protect the slave list,
>it is useless, according to the Veaceslav's opinion, there were
>three way to fix the protect problem:
>
>1. add bond_master_upper_dev_link() and bond_upper_dev_unlink()
>   in bond->lock, but it is unsafe to call call_netdevice_notifiers()
>   in write lock.
>2. remove unused bond->lock for monitor function, only use the exist
>   rtnl lock(), it will take performance loss in fast path.
>3. use RCU to protect the slave list, of course, performance is better,
>   but in slow path, it is ignored.
>
>obviously the solution 1 is not fit here, I will consider the 2 and 3
>solution. My principle is simple, if in fast path, RCU is better,
>otherwise in slow path, both is well, but according to the Jay Vosburgh's
>opinion, the monitor will loss performace if use RTNL to protect the all
>slave list, so remove the bond lock and replace with RCU.
>
>The second problem is the curr_slave_lock for bond, it is too old and
>unwanted in many place, because the curr_active_slave would only be
>changed in 3 place:
>
>1. enslave slave.
>2. release slave.
>3. change active slave.
>
>all above were already holding bond lock, RTNL and curr_slave_lock
>together, it is tedious and no need to add so mach lock, when change
>the curr_active_slave, you have to hold the RTNL and curr_slave_lock
>together, and when you read the curr_active_slave, RTNL or curr_slave_lock,
>any one of them is no problem.
>
>for the stability, I did not change the logic for the monitor,
>all change is clear and simple, I have test the patch set for lockdep,
>it work well and stability.
>
>v2. accept the Jay Vosburgh's opinion, remove the RTNL and replace with RCU,
>    also add some rcu function for bond use, so the patch set reach 10.
>
>v3. accept the Nikolay Aleksandrov's opinion, remove no needed bond_has_slave_rcu(),
>    add protection for several 3ad mode handler functions and current_arp_slave.
>    rebuild the bond_first_slave_rcu(), make it more clear.
>
>v4. because the struct netdev_adjacent should not be exist in netdevice.h, so I have
>    to make a new function to support micro bond_first_slave_rcu().
>    also add a new patch to simplify the bond_resend_igmp_join_requests_delayed().
>
>v5. according the Jay Vosburgh's opinion, in patch 2 and 6, the calling of notify
>    peer is hardly to happen with the bond_xxx_commit() when the monitoring is running,
>    so the performance impact about make two round trips to one trip on RTNL is minimal,
>    no need to do that,the reason is very clear, so modify the patch 2 and 6, recover
>    the notify peer in RTNL alone.
>
>v6. Jay Vosburgh catch a problem that if I remove the bond lock for bond_3ad_state_machine,
>    these are nothing to mutex changes to aggregator->lag_ports between
>    bond_3ad_state_machine_handler and bond_3ad_unbind_slave, and the bond lock is the simplest
>    way to protect aggregator->lag_ports, So I recover the bond lock in bond_3ad_state_machine.
>    As the bond_3ad_unbind_slave will always be called after the slave is detached from list,
>    So modify the old commit and some cleanups.

	I'm out of the office for the holidays, so I didn't compile and
test this last version, but from inspection it looks good to me.

	For the whole series:

Signed-off-by: Jay Vosburgh <fubar@...ibm.com>

	-J

---
	-Jay Vosburgh, IBM Linux Technology Center, fubar@...ibm.com


>Best Regards
>Ding Tianhong
>
>Ding Tianhong (11):
>  bonding: remove the no effect lock for bond_select_active_slave()
>  bonding: rebuild the lock use for bond_mii_monitor()
>  bonding: rebuild the lock use for bond_alb_monitor()
>  bonding: rebuild the lock use for bond_loadbalance_arp_mon()
>  bonding: create bond_first_slave_rcu()
>  bonding: rebuild the lock use for bond_activebackup_arp_mon()
>  bonding: remove unwanted lock for bond enslave and release
>  bonding: add RCU for bond_3ad_state_machine_handler()
>  bonding: remove unwanted lock for bond_option_active_slave_set()
>  bonding: remove unwanted lock for bond_store_primaryxxx()
>  bonding: rebuild the bond_resend_igmp_join_requests_delayed()
>
> drivers/net/bonding/bond_3ad.c     |  54 ++++++++-----
> drivers/net/bonding/bond_alb.c     |  34 +++-----
> drivers/net/bonding/bond_main.c    | 157 ++++++++++++++-----------------------
> drivers/net/bonding/bond_options.c |   2 -
> drivers/net/bonding/bond_sysfs.c   |   4 -
> drivers/net/bonding/bonding.h      |   4 +
> include/linux/netdevice.h          |   1 +
> net/core/dev.c                     |  21 +++++
> 8 files changed, 130 insertions(+), 147 deletions(-)
>
>-- 
>1.8.0
>
>
>

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ