[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20150429150841.55477595@urahara>
Date: Wed, 29 Apr 2015 15:08:41 -0700
From: Stephen Hemminger <stephen@...workplumber.org>
To: anuradhak@...ulusnetworks.com
Cc: davem@...emloft.net, sfeldma@...il.com, netdev@...r.kernel.org,
roopa@...ulusnetworks.com, gospo@...ulusnetworks.com,
wkok@...ulusnetworks.com
Subject: Re: [RFC PATCH net-next v3 0/4] net: Introduce IFF_PROTO_DOWN flag.
On Mon, 27 Apr 2015 10:38:20 -0700
anuradhak@...ulusnetworks.com wrote:
> From: Anuradha Karuppiah <anuradhak@...ulusnetworks.com>
>
> User space daemons can detect errors in the network that need to be
> notified to the switch device drivers.
>
> Drivers can react to this error state by doing a phy-down on the
> switch-port which would result in a carrier-off locally and on the
> directly connected switch. Doing that would prevent loops and
> black-holes in the network.
>
> One such use case is the multi-chassis LAG application -
> 1. The MLAG application runs on peer switches (say Switch0 and Switch1)
> synchronizing states, forwarding entries etc. between the two
> switches over the peer-link (this is a link directly connecting the
> two switches).
> 2. An MLAG election process designates one of the switches as a primary
> (for e.g. Switch0 is primary and Switch1 is secondary).
> 3. The peer link plays a critical role in allowing Switch0-Switch1 to
> function as a single LAG partner to the downstream dual-connected
> servers. When the peer-link between the switches goes down we have a
> split-brain situation. Switch0 and Switch1 are no longer in sync and
> are acting independently. This can result in traffic loops and
> traffic black-holing in the network.
> 4. To prevent these problems the MLAG application on the secondary
> switch phy-downs the MLAG ports on detecting the peer-link down.
> This will be seen as a carrier down on servers that are
> dual-connected to Switch0 and Switch1.
> 5. Specifically a dual-connected server will see a carrier-down on the
> port connected to the MLAG secondary, Switch1, and will stop using
> that port for traffic TX. So traffic black holing is prevented.
>
> v2 to v3:
> In response to Dave’s comments I have tried to make IFF_PROTODOWN
> more easily consumable by providing switchdev APIs to control the
> phy state of the switch port. The use case is relevant primarily to
> switch drivers at this point. That is the reason for making the
> change in rocker (commonly used switch driver example).
>
> One other change that could be done is to bring back the net-core
> change to hold the oper state down in response to IFF_PROTO_DOWN.
> This would be a driver agnostic change and the phy-down could be done
> in addition by interested switch drivers.
>
> v1 to v2:
> Based on Dave's suggestion I have moved out aggregating of error bits
> across applications to a user space framework. This patch now simply
> notifies an aggregated error bit to drivers enabling them to handle
> the error gracefully.
>
>
> Anuradha Karuppiah (4):
> net core: Add IFF_PROTO_DOWN support.
> switchdev: APIs for setting physical state of the switch port.
> rocker: Handle IFF_PROTODOWN by doing a PHYS-DOWN on the switch port.
> ip link: Config and display IFF_PROTO_DOWN flag.
>
> Signed-off-by: Anuradha Karuppiah <anuradhak@...ulusnetworks.com>
> Signed-off-by: Andy Gospodarek <gospo@...ulusnetworks.com>
> Signed-off-by: Roopa Prabhu <roopa@...ulusnetworks.com>
> Signed-off-by: Wilson Kok <wkok@...ulusnetworks.com>
>
> drivers/net/ethernet/rocker/rocker.c | 16 +++++++++++++++-
> include/net/switchdev.h | 12 ++++++++++++
> include/uapi/linux/if.h | 4 ++++
> net/8021q/vlan_dev.c | 3 ++-
> net/core/dev.c | 8 +++++++-
> net/switchdev/switchdev.c | 23 +++++++++++++++++++++++
> 6 files changed, 63 insertions(+), 3 deletions(-)
>
How does this interact with operstate?
It seems RFC2863 operstate (Documentation/network/operstates.txt) already
has concept of LOWERLAYERDOWN
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists