[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080725223121.GD3107@ami.dom.local>
Date: Sat, 26 Jul 2008 00:31:22 +0200
From: Jarek Poplawski <jarkao2@...il.com>
To: denys@...p.net.lb
Cc: Thomas Gleixner <tglx@...utronix.de>, netdev@...r.kernel.org,
linux-kernel@...r.kernel.org
Subject: hrtimers lockups Re: NMI lockup, 2.6.26 release
Hi,
This netdev thread describes lockups breaking in hrtimers code:
http://marc.info/?l=linux-netdev&m=121675217927170&w=2
Very similar reports from Denys Fedoryshchenko could be found in
netdev archives a few kernel versions before.
It looks like replacing hrtimers with timers in sch_htb code removes
problems. I hope, Thomas or somebody from linux-kernel could give
some clue on this.
Thanks,
Jarek P.
Denys, read below:
On Sat, Jul 26, 2008 at 12:09:52AM +0300, denys@...p.net.lb wrote:
> I will try to explain all details, maybe anything matter
>
> around 150-300 megs passing
> Core 2 Duo E6750
> 3 ifb's
> 29 htb classes (summary)
> 26 qdiscs (sfq and bfifo)
> NAT is running (465-700K connections)
> maximum bfifo qdisc size is 600Kbyte
> mostly all filters u32 (one is police mtu)
> quantum is 1514, one is 1515
> Load is low (below 30-35)% by mpstat
>
> The only error i have in dmesg (a LOT of this messages, different ip port, )
> [162014.265116] UDP: short packet: From 200.122.35.205:64599 8409/1480 to
> 213.254.233.9:6073
> [162014.373110] UDP: short packet: From 200.122.35.205:52015 10698/1480 to
> 213.254.233.9:4855
>
> [162088.232099] UDP: bad checksum. From 96.234.33.9:1077 to
> 213.254.233.9:49520 ulen 111
>
>
> I run time-warp-test from Ingo Molnar - nothing, no warps.
>
> If required - i can send all rules to private e-mail.
>
> I will apply patch after 30-60 minutes (off peak time). Thanks for help a lot!
You are very helpful too! But, I think we will need some help from
hrtimers/hardware gurus. IMHO, since it works with timers, the bug
doesn't seem to belong to "netdev". I can't see any obvious
possibility of "abusing" hrtimers with e.g. too big number of hrtimers
with your config (1 hrtimer per qdisc). So, I'm not very optimistic
about this new patch, but even if it works it looks like something
else is wrong. That's why I added some CC to this.
Jarek P.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists