lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date:	Sun, 11 Oct 2009 15:55:46 +0100
From:	Bernd Schmidt <bernds_cb1@...nline.de>
To:	netdev@...r.kernel.org
CC:	Stephen Hemminger <shemminger@...l.org>,
	shemminger@...ux-foundation.org
Subject: Occasional crashes with sky2

For a few months now, I've been seeing occasional kernel panics that
would happen every few weeks.  I'm not exactly sure when they started,
but I definitely see them in 2.6.29 and 2.6.30, and never saw them in
2.6.25 and earlier.  They happen with 32 bit and 64 bit kernels.

Today I managed to capture an oops with netconsole; it's attached.  This
seems to point towards the sky2 driver.

Some hardware information:
03:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E
Gigabit Ethernet Controller (rev 20)
	Subsystem: ASUSTeK Computer Inc. Marvell 88E8053 Gigabit Ethernet
controller PCIe (Asus)
	Flags: bus master, fast devsel, latency 0, IRQ 16
	Memory at f88fc000 (64-bit, non-prefetchable) [size=16K]
	I/O ports at a800 [size=256]
	Expansion ROM at f88c0000 [disabled] [size=128K]
	Capabilities: [48] Power Management version 2
	Capabilities: [50] Vital Product Data <?>
	Capabilities: [5c] Message Signalled Interrupts: Mask- 64bit+ Count=1/2
Enable-
	Capabilities: [e0] Express Legacy Endpoint, MSI 00
	Capabilities: [100] Advanced Error Reporting
		UESta:	DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
MalfTLP- ECRC- UnsupReq- ACSVoil-
		UEMsk:	DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
MalfTLP- ECRC- UnsupReq- ACSVoil-
		UESvrt:	DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+
MalfTLP+ ECRC- UnsupReq- ACSVoil-
		CESta:	RxErr+ BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
		CESta:	RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
		AERCap:	First Error Pointer: 1f, GenCap- CGenEn- ChkCap- ChkEn-
	Kernel driver in use: sky2
	Kernel modules: sky2

04:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E
Gigabit Ethernet Controller (rev 20)
	Subsystem: ASUSTeK Computer Inc. Marvell 88E8053 Gigabit Ethernet
controller PCIe (Asus)
	Flags: bus master, fast devsel, latency 0, IRQ 19
	Memory at f89fc000 (64-bit, non-prefetchable) [size=16K]
	I/O ports at b800 [size=256]
	Expansion ROM at f89c0000 [disabled] [size=128K]
	Capabilities: [48] Power Management version 2
	Capabilities: [50] Vital Product Data <?>
	Capabilities: [5c] Message Signalled Interrupts: Mask- 64bit+ Count=1/2
Enable-
	Capabilities: [e0] Express Legacy Endpoint, MSI 00
	Capabilities: [100] Advanced Error Reporting
		UESta:	DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
MalfTLP- ECRC- UnsupReq- ACSVoil-
		UEMsk:	DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
MalfTLP- ECRC- UnsupReq- ACSVoil-
		UESvrt:	DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+
MalfTLP+ ECRC- UnsupReq- ACSVoil-
		CESta:	RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
		CESta:	RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
		AERCap:	First Error Pointer: 1f, GenCap- CGenEn- ChkCap- ChkEn-
	Kernel driver in use: sky2
	Kernel modules: sky2


Both of these share their IRQ with other devices:
beaker ~ # dir /proc/irq/16/
total 0
dr-xr-xr-x 2 root root 0 Oct 11 15:46 eth1
dr-xr-xr-x 2 root root 0 Oct 11 15:46 nvidia
dr-xr-xr-x 2 root root 0 Oct 11 15:46 pata_jmicron
-rw------- 1 root root 0 Oct 11 15:46 smp_affinity
-r--r--r-- 1 root root 0 Oct 11 15:46 spurious
beaker ~ # dir /proc/irq/19/
total 0
dr-xr-xr-x 2 root root 0 Oct 11 15:46 HDA Intel
dr-xr-xr-x 2 root root 0 Oct 11 15:46 eth0
-rw------- 1 root root 0 Oct 11 15:46 smp_affinity
-r--r--r-- 1 root root 0 Oct 11 15:46 spurious
dr-xr-xr-x 2 root root 0 Oct 11 15:46 uhci_hcd:usb5

azx_interrupt, which occurs in the call trace, belongs to HDA Intel.
Not sure if that's a clue.

Please let me know if you need additional information.


Bernd

View attachment "oops" of type "text/plain" (3945 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ