lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <1219605422.14389.2.camel@localhost>
Date:	Sun, 24 Aug 2008 15:17:02 -0400
From:	Trond Myklebust <trond.myklebust@....uio.no>
To:	Ian Campbell <ijc@...lion.org.uk>
Cc:	John Ronciak <john.ronciak@...il.com>,
	Grant Coady <gcoady.lk@...il.com>,
	linux-kernel@...r.kernel.org, neilb@...e.de, bfields@...ldses.org,
	linux-nfs@...r.kernel.org,
	Jeff Kirsher <jeffrey.t.kirsher@...el.com>,
	Jesse Brandeburg <jesse.brandeburg@...el.com>,
	Bruce Allan <bruce.w.allan@...el.com>,
	PJ Waskiewicz <peter.p.waskiewicz.jr@...el.com>,
	John Ronciak <john.ronciak@...el.com>,
	e1000-devel@...ts.sourceforge.net
Subject: Re: NFS regression? Odd delays and lockups accessing an NFS export.

On Sun, 2008-08-24 at 19:53 +0100, Ian Campbell wrote:
> On Fri, 2008-08-22 at 14:56 -0700, Trond Myklebust wrote:
> > On Fri, 2008-08-22 at 22:37 +0100, Ian Campbell wrote:
> > > I can ssh to the server fine. The same server also serves my NFS home
> > > directory to the box I'm writing this from and I've not seen any trouble
> > > with this box at all, it's a 2.6.18-xen box.
> > 
> > OK... Are you able to reproduce the problem reliably?
> > 
> > If so, can you provide me with a binary tcpdump or wireshark dump? If
> > using tcpdump, then please use something like
> > 
> >    tcpdump -w /tmp/dump.out -s 90000 host myserver.foo.bar and port 2049
> > 
> > Please also try to provide a netstat dump of the current TCP connections
> > as soon as the hang occurs:
> > 
> >   netstat -t
> 
> Aug 24 18:08:59 iranon kernel: [168839.556017] nfs: server hopkins not responding, still trying
> but I wasn't around until 19:38 to spot it.
> 
> netstat when I got to it was:
> 
> Proto Recv-Q Send-Q Local Address           Foreign Address         State      
> tcp        0      0 localhost.localdo:50891 localhost.localdom:6543 ESTABLISHED
> tcp        1      0 iranon.hellion.org.:ssh azathoth.hellion.:52682 CLOSE_WAIT 
> tcp        0      0 localhost.localdom:6543 localhost.localdo:50893 ESTABLISHED
> tcp        0      0 iranon.hellion.org.:837 hopkins.hellion.org:nfs FIN_WAIT2  
> tcp        0      0 localhost.localdom:6543 localhost.localdo:41831 ESTABLISHED
> tcp        0      0 localhost.localdo:13666 localhost.localdo:59482 ESTABLISHED
> tcp        0      0 localhost.localdo:34288 localhost.localdom:6545 ESTABLISHED
> tcp        0      0 iranon.hellion.org.:ssh azathoth.hellion.:48977 ESTABLISHED
> tcp        0      0 iranon.hellion.org.:ssh azathoth.hellion.:52683 ESTABLISHED
> tcp        0      0 localhost.localdom:6545 localhost.localdo:34288 ESTABLISHED
> tcp        0      0 localhost.localdom:6543 localhost.localdo:50891 ESTABLISHED
> tcp        0      0 localhost.localdo:50893 localhost.localdom:6543 ESTABLISHED
> tcp        0      0 localhost.localdo:41831 localhost.localdom:6543 ESTABLISHED
> tcp        0     87 localhost.localdo:59482 localhost.localdo:13666 ESTABLISHED
> tcp        1      0 localhost.localdom:6543 localhost.localdo:41830 CLOSE_WAIT 
> 
> (iranon is the problematic host .4, azathoth is my desktop machine .5, hopkins is the NFS server .6)
> 
> tcpdumps are pretty big. I've attached the last 100 packets captured. If
> you need more I can put the full file up somewhere.
> 
> -rw-r--r-- 1 root   root   1.3G Aug 24 17:57 dump.out0
> -rw-r--r-- 1 root   root   536M Aug 24 19:38 dump.out1
> 
> Ian.

>>From the tcpdump, it looks as if the NFS server is failing to close the
socket, when the client closes its side. You therefore end up getting
stuck in the FIN_WAIT2 state (as netstat clearly shows above).

Is the server keeping the client in this state for a very long period?

Cheers
  Trond

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ