[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <1371663286-12518-6-git-send-email-vfalico@redhat.com>
Date: Wed, 19 Jun 2013 19:34:45 +0200
From: Veaceslav Falico <vfalico@...hat.com>
To: netdev@...r.kernel.org
Cc: vfalico@...hat.com, fubar@...ibm.com, andy@...yhouse.net,
davem@...emloft.net, linux@...2.net, nicolas.2p.debian@...e.fr,
rick.jones2@...com
Subject: [PATCH net-next 5/6] bonding: don't swap arp's ips on validation for backup slave
Currently, if we catch an arp on a backup slave, we swap the ips before
checking it. It's done so because backup slaves usually don't receive arp
replies and only receive arp requests from the active slave to
arp_ip_targets, thus it's needed to swap the ips in order for the
validation to succeed.
This breaks the scenario when the backup slave actually receives arp
replies from the target.
It's useless when the active and the backup slaves are not in the same arp
broadcast domain (i.e. when it doesn't receive those requests originating
from the active slave).
And it actually breaks the whole logic of arp_validate - it marks the
backup slave UP even if it can't access the arp_ip_target and is in the
same arp broadcast domain. This means that we can see an endless up/down
loop if the arp_ip_target becomes inaccessible:
2 slaves, 1 bond:
1) active slave is up, backup slave is up because it receives arp requests
from the active slave.
2) after delay, arp_ip_target check fails on active slave.
3) failover to the backup slave occurs (which is up), active swaps with
backup
4) goto 1
So, don't swap the sip/tip for backup slaves, and verify them as an active
slave. Also, update the documentation to reflect the changes (and remove
some random spaces there).
Signed-off-by: Veaceslav Falico <vfalico@...hat.com>
---
Documentation/networking/bonding.txt | 24 +++++++++---------------
drivers/net/bonding/bond_main.c | 13 +------------
2 files changed, 10 insertions(+), 27 deletions(-)
diff --git a/Documentation/networking/bonding.txt b/Documentation/networking/bonding.txt
index e7454fc..84f16c8 100644
--- a/Documentation/networking/bonding.txt
+++ b/Documentation/networking/bonding.txt
@@ -23,7 +23,7 @@ multiple network interfaces into a single logical "bonded" interface.
The behavior of the bonded interfaces depends upon the mode; generally
speaking, modes provide either hot standby or load balancing services.
Additionally, link integrity monitoring may be performed.
-
+
The bonding driver originally came from Donald Becker's
beowulf patches for kernel 2.0. It has changed quite a bit since, and
the original tools from extreme-linux and beowulf sites will not work
@@ -293,15 +293,9 @@ arp_validate
Validation is performed for all slaves.
- For the active slave, the validation checks ARP replies to
- confirm that they were generated by an arp_ip_target. Since
- backup slaves do not typically receive these replies, the
- validation performed for backup slaves is on the ARP request
- sent out via the active slave. It is possible that some
- switch or network configurations may result in situations
- wherein the backup slaves do not receive the ARP requests; in
- such a situation, validation of backup slaves must be
- disabled.
+ For any eligible slave (active, backup or both) the validation
+ checks ARP replies to confirm that they were generated by an
+ arp_ip_target.
This option is useful in network configurations in which
multiple bonding hosts are concurrently issuing ARPs to one or
@@ -2238,7 +2232,7 @@ when they are configured in parallel as part of an isolated network
between two or more systems, for example:
+-----------+
- | Host A |
+ | Host A |
+-+---+---+-+
| | |
+--------+ | +---------+
@@ -2250,7 +2244,7 @@ between two or more systems, for example:
+--------+ | +---------+
| | |
+-+---+---+-+
- | Host B |
+ | Host B |
+-----------+
In this configuration, the switches are isolated from one
@@ -2478,7 +2472,7 @@ bonding driver.
(either the internal Ethernet Switch Module, or an external switch) to
avoid fail-over delay issues when using bonding.
-
+
15. Frequently Asked Questions
==============================
@@ -2515,7 +2509,7 @@ monitored, and should it recover, it will rejoin the bond (in whatever
manner is appropriate for the mode). See the sections on High
Availability and the documentation for each mode for additional
information.
-
+
Link monitoring can be enabled via either the miimon or
arp_interval parameters (described in the module parameters section,
above). In general, miimon monitors the carrier state as sensed by
@@ -2618,7 +2612,7 @@ be found at:
http://vger.kernel.org/vger-lists.html#netdev
Donald Becker's Ethernet Drivers and diag programs may be found at :
- - http://web.archive.org/web/*/http://www.scyld.com/network/
+ - http://web.archive.org/web/*/http://www.scyld.com/network/
You will also find a lot of information regarding Ethernet, NWay, MII,
etc. at www.scyld.com.
diff --git a/drivers/net/bonding/bond_main.c b/drivers/net/bonding/bond_main.c
index 2cfbb2e..3f64607 100644
--- a/drivers/net/bonding/bond_main.c
+++ b/drivers/net/bonding/bond_main.c
@@ -2660,18 +2660,7 @@ static int bond_arp_rcv(const struct sk_buff *skb, struct bonding *bond,
bond->params.arp_validate, slave_do_arp_validate(bond, slave),
&sip, &tip);
- /*
- * Backup slaves won't see the ARP reply, but do come through
- * here for each ARP probe (so we swap the sip/tip to validate
- * the probe). In a "redundant switch, common router" type of
- * configuration, the ARP probe will (hopefully) travel from
- * the active, through one switch, the router, then the other
- * switch before reaching the backup.
- */
- if (bond_is_active_slave(slave))
- bond_validate_arp(bond, slave, sip, tip);
- else
- bond_validate_arp(bond, slave, tip, sip);
+ bond_validate_arp(bond, slave, sip, tip);
out_unlock:
read_unlock(&bond->lock);
--
1.7.1
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists