[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <174438318485.31035.4351288521914711713.b4-ty@oracle.com>
Date: Fri, 11 Apr 2025 10:53:15 -0400
From: cel@...nel.org
To: Trond Myklebust <trondmy@...nel.org>,
Anna Schumaker <anna@...nel.org>,
Neil Brown <neilb@...e.de>,
Olga Kornievskaia <okorniev@...hat.com>,
Dai Ngo <Dai.Ngo@...cle.com>,
Tom Talpey <tom@...pey.com>,
Jeff Layton <jlayton@...nel.org>
Cc: Chuck Lever <chuck.lever@...cle.com>,
linux-nfs@...r.kernel.org,
linux-kernel@...r.kernel.org
Subject: Re: [PATCH] sunrpc: add info about xprt queue times to svc_xprt_dequeue tracepoint
From: Chuck Lever <chuck.lever@...cle.com>
On Fri, 11 Apr 2025 10:22:14 -0400, Jeff Layton wrote:
> I've been looking at a problem where we see increased RPC timeouts in
> clients when the nfs_layout_flexfiles dataserver_timeo value is tuned
> very low (6s). This is necessary to ensure quick failover to a different
> mirror if a server goes down, but it causes a lot more major RPC timeouts.
>
> Ultimately, the problem is server-side however. It's sometimes doesn't
> respond to connection attempts. My theory is that the interrupt handler
> runs when a connection comes in, the xprt ends up being enqueued, but it
> takes a significant amount of time for the nfsd thread to pick it up.
>
> [...]
Applied to nfsd-testing, thanks!
[1/1] sunrpc: add info about xprt queue times to svc_xprt_dequeue tracepoint
commit: b7a6405d13bb2b5cbf89decb111f84408d121dc9
--
Chuck Lever
Powered by blists - more mailing lists