[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CH2PR15MB35759E1A42897371181250289A790@CH2PR15MB3575.namprd15.prod.outlook.com>
Date: Wed, 6 Nov 2019 21:07:20 +0000
From: Jon Maloy <jon.maloy@...csson.com>
To: Hoang Huu Le <hoang.h.le@...tech.com.au>,
"maloy@...jonn.com" <maloy@...jonn.com>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>,
"tipc-discussion@...ts.sourceforge.net"
<tipc-discussion@...ts.sourceforge.net>
Subject: RE: [net-next 2/2] tipc: reduce sensitive to retransmit failures
Acked-by: Jon
> -----Original Message-----
> From: Hoang Le <hoang.h.le@...tech.com.au>
> Sent: 6-Nov-19 01:26
> To: Jon Maloy <jon.maloy@...csson.com>; maloy@...jonn.com; netdev@...r.kernel.org; tipc-
> discussion@...ts.sourceforge.net
> Subject: [net-next 2/2] tipc: reduce sensitive to retransmit failures
>
> With huge cluster (e.g >200nodes), the amount of that flow:
> gap -> retransmit packet -> acked will take time in case of STATE_MSG
> dropped/delayed because a lot of traffic. This lead to 1.5 sec tolerance
> value criteria made link easy failure around 2nd, 3rd of failed
> retransmission attempts.
>
> Instead of re-introduced criteria of 99 faled retransmissions to fix the
> issue, we increase failure detection timer to ten times tolerance value.
>
> Fixes: 77cf8edbc0e7 ("tipc: simplify stale link failure criteria")
> Acked-by: Jon Maloy <jon.maloy@...csson.com>
> Signed-off-by: Hoang Le <hoang.h.le@...tech.com.au>
> ---
> net/tipc/link.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/net/tipc/link.c b/net/tipc/link.c
> index 038861bad72b..2aed7a958a8c 100644
> --- a/net/tipc/link.c
> +++ b/net/tipc/link.c
> @@ -1087,7 +1087,7 @@ static bool link_retransmit_failure(struct tipc_link *l, struct tipc_link *r,
> return false;
>
> if (!time_after(jiffies, TIPC_SKB_CB(skb)->retr_stamp +
> - msecs_to_jiffies(r->tolerance)))
> + msecs_to_jiffies(r->tolerance * 10)))
> return false;
>
> hdr = buf_msg(skb);
> --
> 2.20.1
Powered by blists - more mailing lists