[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <4FC53AAA.3060203@ddn.com>
Date: Tue, 29 May 2012 17:07:54 -0400
From: Karandeep Chahal <kchahal@....com>
To: linux-rdma@...r.kernel.org, linux-kernel@...r.kernel.org
CC: dillowda@...l.gov, roland@...nel.org, sean.hefty@...el.com
Subject: [PATCH 1/1] ib_srp: Infiniband srp fast failover patch.
Subject: [PATCH] Infiniband srp fast failover patch. Currently ib_srp does
not do anything on receiving a DREQ from the target, it
only sends a response back. Further it also does not
monitor port (down) events. I have patched srp to remove
scsi devices when a port down event is received or if the
target sends a DREQ. Currently even though the target
notifies the initiator of its intentions of going away, the
initiator ignores that information. Later the initiator
gets upset when the devices "suddenly" disappear resulting
in srp initiating an error recovery process which takes a
long time. This caused high failover latencies as compared
to fibre channel. In my experiments with RHEL 6.0 and 6.2 I
encountered failover time that exceeded 2 minutes and 20
seconds (despite tweaking /etc/multipath.conf and
/sys/block/<>/timeout). With this patch the failover takes
30 seconds. I have tested this patch with and without a
switch.
Yours, etc.
Karan
View attachment "0001-Infiniband-srp-fast-failover-patch.-Currently-ib_srp.patch" of type "text/x-patch" (4768 bytes)
Powered by blists - more mailing lists