[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date: Thu, 9 Sep 2010 13:45:13 +0300 (EEST)
From: "Ilpo Järvinen" <ilpo.jarvinen@...sinki.fi>
To: Tejun Heo <tj@...nel.org>
cc: Lennart Schulte <lennart.schulte@...s.rwth-aachen.de>,
Eric Dumazet <eric.dumazet@...il.com>,
"David S. Miller" <davem@...emloft.net>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>,
"Fehrmann, Henning" <henning.fehrmann@....mpg.de>,
Carsten Aulbert <carsten.aulbert@....mpg.de>
Subject: Re: oops in tcp_xmit_retransmit_queue() w/ v2.6.32.15
On Thu, 9 Sep 2010, Tejun Heo wrote:
> On 09/08/2010 12:34 PM, Ilpo Järvinen wrote:
> >> Unfortunately, we haven't been able to reproduce the problem anymore.
> >
> > With my debug patch or not at all?
>
> With the ugly merged patch I posted previously in this thread which
> contained debug messages if any of the worked around condition
> triggers.
>
> >> It could be (but not likely given that none of the debugging messages
> >> is triggering) that I was mistaken and the previously posted fixed the
> >> issue. The network used by the cluster went through some changes at
> >> the time and there have been issues with packet losses. Given that
> >> the problem needs packet losses to trigger, it's likely that packet
> >> loss pattern here changed such that the patterns of packet losses
> >> which trigger the problem aren't happening anymore. (Carsten,
> >> Henning, please feel free to fill in if I'm missing something).
> >
> > That might well be true, however, you're already a second guy who
> > cannot reproduce it with the debug patch so I would not rule out other
> > possibilities unless you've tried without debug patch too since the
> > changes?
>
> Unfortunately, I can't really tell one way or the other at this point.
> Carsten will be back in a few days. I'll ask him for more details.
Once you get the info, if not yet done, I'd recommend you try without the
debug patch (assuming a possible crash isn't too devasting for the actual
stuff you're doing with the machines :-)). ...If it crashes without, then
it's time to start looking into compiler versions, etc.
--
i.
Powered by blists - more mailing lists