lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:	Wed, 17 Feb 2010 14:53:34 -0800
From:	Jesse Brandeburg <jesse.brandeburg@...il.com>
To:	Jesper Krogh <jesper@...gh.cc>
Cc:	linux-nfs@...r.kernel.org, netdev@...r.kernel.org,
	Jesse Brandeburg <jesse.brandeburg@...el.com>
Subject: Re: ixgbe_clean_tx_irq: tx hang 1 detected, resetting adapter 
	(2.6.32.8)

On Sun, Feb 14, 2010 at 8:29 AM, Jesper Krogh <jesper@...gh.cc> wrote:
> Hi List.
>
> I have tried to get a dual bond of 2 x 10G NICs using the
> Intel Corporation 82598EB 10-Gigabit AT2 Server Adapter (rev 01)
> going. As first it looked like it "just worked" but when tried to fill
> the links with data one of the NIC's (eth7) hang and did a reset of
> itself, so all data was pushed through the other NIC in the bond (eth8)
>
> Full dmesg below, but I think the important part is this:
>
> [ 2162.745354] ixgbe: eth7: ixgbe_check_tx_hang: Detected Tx Unit Hang
> [ 2162.745356]   Tx Queue             <4>
> [ 2162.745356]   TDH, TDT             <e1>, <cc>
> [ 2162.745357]   next_to_use          <cc>
> [ 2162.745358]   next_to_clean        <e1>
> [ 2162.745359] tx_buffer_info[next_to_clean]
> [ 2162.745359]   time_stamp           <1000713d3>
> [ 2162.745360]   jiffies              <10007152e>
> [ 2163.162478] ixgbe: eth7: ixgbe_clean_tx_irq: tx hang 1 detected,
> resetting adapter
> [ 2163.357333] bonding: bond0: link status definitely down for interface
> eth7, disabling it
> [ 2168.670342] ixgbe: eth7 NIC Link is Up 10 Gbps, Flow Control: None

Hi Jesper, my first thought was flow control, but I can see you have it off.

Can we get some more details on the hardware and bios version?  What
about some dmidecode output.  I'm checking here if we have any
hardware like this.

are you running ubuntu 9.10 or something else?

Wow, thats a monster machine, 8 nodes, 128GB ram.  Can we get a full
lspci -vvv output, as well as ethtool -e eth7 and eth8

32 has ixgbe with a known issue of multiple mappings on transmit
possibly causing some problems, could it be that you're running into
this?  can you apply commit e5a43549f7a58509a91b299a51337d386697b92c
and see if it fixes your issue?
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ