[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4656CE39.8050800@netunix.com>
Date: Fri, 25 May 2007 12:53:29 +0100
From: Chris Newport <crn@...unix.com>
To: Ingo Molnar <mingo@...e.hu>
CC: Linus Torvalds <torvalds@...ux-foundation.org>,
Christoph Lameter <clameter@....com>,
Michal Piotrowski <michal.k.k.piotrowski@...il.com>,
Andrew Morton <akpm@...ux-foundation.org>,
LKML <linux-kernel@...r.kernel.org>,
"Cherwin R. Nooitmeer" <cherwin@...il.com>,
linux-pcmcia@...ts.infradead.org,
Robert de Rooy <robert.de.rooy@...il.com>,
Alan Cox <alan@...hat.com>, Tejun Heo <htejun@...il.com>,
sparclinux@...r.kernel.org, David Miller <davem@...emloft.net>,
Mikael Pettersson <mikpe@...uu.se>,
linux1394-devel@...ts.sourceforge.net,
Stefan Richter <stefanr@...6.in-berlin.de>,
Kristian H?gsberg <krh@...planet.net>,
linux-pm@...ts.linux-foundation.org,
"Rafael J. Wysocki" <rjw@...k.pl>, Pavel Machek <pavel@....cz>,
Marcus Better <marcus@...ter.se>,
Andrey Borzenkov <arvidjaar@...l.ru>,
linux-usb-devel@...ts.sourceforge.net,
Greg Kroah-Hartman <gregkh@...e.de>
Subject: Re: [2/3] 2.6.22-rc2: known regressions v2
Ingo Molnar wrote:
>A BUG_ON() has a (much) lower likelyhood of being reported back - for
>most users it is a "X just hung hard, there was nothing in the syslog, i
>had to switch back to the older kernel" experience, and they do not have
>a serial console to hook up (newer hardware often doesnt even have a
>serial port). With the WARN_ON()s we have a _chance_ that despite the
>seriousness of the bug, the message makes it to the syslog, until the
>system comes to a screeching halt due to side-effects of the bug.
>
>in that sense i am part of the problem: i was adding WARN_ON()s that
>werent true 'warnings' but 'bugs'. So i'd very much like to fix that
>problem, but i'd also like to solve the (very serious and existing)
>problem of BUG_ON()s making it less likely to get bugs reported back.
>
>
>
There is a fundamental problem in getting a decent log to debug a
crashed kernel. Maybe we should take a hint from Solaris.
If the kernel crashes Solaris dumps core to swap and sets a flag.
At the next boot this image is copied to /var/adm/crashdump where
it is preserved for future debugging. Obviously swap needs to be
larger than core, but this is usually the case.
On Sun machines this is fairly easy because the dump can be
performed by the OBP, on other architectures it may be more
difficult to still have enough working kernel to achieve the dump
after a kernel panic.
Just a thought .......
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists