[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <200809202111.01256.denys@visp.net.lb>
Date: Sat, 20 Sep 2008 21:11:01 +0300
From: Denys Fedoryshchenko <denys@...p.net.lb>
To: Badalian Vyacheslav <slavon@...telecom.ru>
Cc: netdev@...r.kernel.org
Subject: Re: NetDev! Please help!
On Saturday 20 September 2008, Badalian Vyacheslav wrote:
> Hello all.
>
> We buy 10 Intel servers and paste it to shape traffic. After 5-15 hours
> all PC is was freeze! Kernel not see TCO watchdog at this platform and
> can't reboot it!. Soft Watchdog not reboot pc in this situation. =(
>
> At screen we see messages like this (when it freeze and i was near
> monitor):
Maybe try nmi_watchdog=1 ?
Also try http://www.nongnu.org/dmidecode/ - to check if it has IPMI.
Mine for example:
....
Handle 0x003F, DMI type 38, 16 bytes
IPMI Device Information
Interface Type: KCS (Keyboard Control Style)
Specification Version: 2.0
I2C Slave Address: 0x10
NV Storage Device Address: 0
Base Address: 0x0000000000000CA2 (I/O)
Also important to change it to newer 2.6.26.5 , because for example 2.6.26.4
have fix:
http://www.kernel.org/pub/linux/kernel/v2.6/ChangeLog-2.6.26.4
....
commit 685f605a498b73759cbcbc816089e804710fcc48
Author: David S. Miller <davem@...emloft.net>
Date: Wed Aug 27 22:35:56 2008 -0700
pkt_sched: Fix return value corruption in HTB and TBF.
[ Upstream commit 69747650c814a8a79fef412c7416adf823293a3e ]
Based upon a bug report by Josip Rodin.
Packet schedulers should only return NET_XMIT_DROP iff
the packet really was dropped. If the packet does reach
the device after we return NET_XMIT_DROP then TCP can
crash because it depends upon the enqueue path return
values being accurate.
Signed-off-by: David S. Miller <davem@...emloft.net>
Signed-off-by: Greg Kroah-Hartman <gregkh@...e.de>
....
P.S. For netdev, i have one more friend - who is complaining that shapers is
crashing on Intel machines (who uses TSC, he have two different "Core" based
servers, and both is crashing). With HPET i dont have any problem on high
performance shapers (except, that it is CPU expensive). It happens on latest
2.6.26.5 too. Machine getting hard lockup, and nothing than hardware watchdog
able to recover it. They dont have experience to get actual reason of this
issue and they dont know english well to report this issue.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists