[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-Id: <200908241146.07639.rene.mayrhofer@gibraltar.at>
Date: Mon, 24 Aug 2009 11:46:07 +0200
From: Rene Mayrhofer <rene.mayrhofer@...raltar.at>
To: Mike McCormack <mikem@...g3k.org>
Cc: Stephen Hemminger <shemminger@...ux-foundation.org>,
netdev@...r.kernel.org, Richard Leitner <leitner@...s.at>
Subject: Re: Kernel oops on setting sky2 interfaces down
On Friday 21 August 2009 01:03:26 pm Mike McCormack wrote:
> 2009/8/21 Rene Mayrhofer <rene.mayrhofer@...raltar.at>:
> > I've also tried the net-next-2.6 version of sky2.[ch] as of yesterday
> > without Mike's "bandaid" patches. With that version (the last one in
> > branch gibraltar-3.0 at
> > https://www.gibraltar.at/git/linux-2.6-gibraltar.git), I managed to
> > successfully do a networking restart (with "light" traffic on one
> > interface), leaving the interfaces functional after the restart. This
> > worked even twice in a row, so mabye we are onto something here. This is
> > certainly an improvement over the version with Mike's last patch (from
> > yesterday) applied, which left the interfaces broken after a restart (and
> > with the quoted errors on a rmmod/modprobe sky2).
> >
> > However, after doing a ping -f on one of the (GBit) interfaces to another
> > host on the same switch for a few seconds and then executing networking
> > restart, the result was an immediate reboot of the box without any oops
> > being printed to the (serial) console beforehand.
>
> How about trying to remove the skge module, then running tests on the
> sky2 interfaces only? This way you might be able isolate the
> remaining problems to sky2 or skge...?
Tried that. With skge removed, some traffic generated on the lan interface with
ping -f and a networking restart:
[~]# /etc/init.d/networking restart
Reconfiguring network interfaces...[ 231.816044] sky2 0000:03:00.0: error
interrupt status=0xffffffff
[ 231.822339] sky2 0000:03:00.0: PCI hardware error (0xffff)
[ 231.828205] sky2 0000:03:00.0: PCI Express error (0xffffffff)
[ 231.834301] sky2 dmz: ram data read parity error
[ 231.839226] sky2 dmz: ram data write parity error
[ 231.844142] sky2 dmz: MAC parity error
[ 231.848077] sky2 dmz: RX parity error
[ 231.851937] sky2 dmz: TCP segmentation error
[ 231.856499] BUG: unable to handle kernel NULL pointer dereference at
0000038d
[ 231.860446] IP: [<f8068cb5>] sky2_mac_intr+0x30/0xc1 [sky2]
[ 231.860446] *pde = 00000000
[ 231.860446] Oops: 0000 [#1] PREEMPT SMP
[ 231.860446] last sysfs file:
/sys/devices/system/cpu/cpu0/cpufreq/scaling_setspeed
[ 231.860446] Modules linked in: xt_multiport cpufreq_userspace ip6t_REJECT
xt_DSCP xt_length xt_mark xt_dscp xt_MARK xt_IMQ xt_CONNMARK xt_comment
xt_policy ip6t_LOG xt_tcpudp ip6table_mangle iptable_mangle ip6table_filter
ip6_tables sit tunnel4 8021q garp stp llc ipt_LOG xt_limit xt_state
iptable_nat iptable_filter ip_tables x_tables dm_mod p4_clockmod speedstep_lib
freq_table tun imq nf_nat_ftp nf_nat nf_conntrack_ftp nf_conntrack_ipv6
nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ipv6 iTCO_wdt parport_pc
rng_core i2c_i801 serio_raw evdev i2c_core pcspkr parport button intel_agp
processor squashfs loop aufs exportfs nls_utf8 nls_cp437 ide_generic sd_mod
ata_generic pata_acpi ata_piix ide_pci_generic ide_core sky2 thermal fan
thermal_sys [last unloaded: skge]
[ 231.901463]
[ 231.901463] Pid: 10223, comm: postfix Not tainted (2.6.30.5 #14)
[ 231.901463] EIP: 0060:[<f8068cb5>] EFLAGS: 00010286 CPU: 0
[ 231.901463] EIP is at sky2_mac_intr+0x30/0xc1 [sky2]
[ 231.901463] EAX: f8090f88 EBX: 00000001 ECX: 00000008 EDX: 000000ff
[ 231.901463] ESI: 00000000 EDI: f6b03e80 EBP: f7119eb4 ESP: f7119e9c
[ 231.901463] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
[ 231.901463] Process postfix (pid: 10223, ti=f7118000 task=f46c63b0
task.ti=f7118000)
[ 231.901463] Stack:
[ 231.901463] 00000080 ffb03e80 aa9c6992 f70fb4c0 ffffffff ffffffff f7119f28
f806c3cd
[ 231.901463] c181d7a0 aa9c6992 00000040 f6b03e88 c0243708 00000000
f6b03e80 ffffffff
[ 231.901463] 00000001 f70e6400 c181d260 ffffbd75 00000001 00000000
aa9c6992 c181aa20
[ 231.901463] Call Trace:
[ 231.901463] [<f806c3cd>] ? sky2_poll+0x1d2/0xa29 [sky2]
[ 231.901463] [<c0243708>] ? insert_work+0xa5/0xbf
[ 231.901463] [<c0244012>] ? delayed_work_timer_fn+0x3c/0x4f
[ 231.901463] [<c04eb081>] ? _spin_unlock_irq+0x2f/0x42
[ 231.901463] [<c0462eea>] ? net_rx_action+0x9e/0x1ae
[ 231.901463] [<c0237ce2>] ? __do_softirq+0xb2/0x188
[ 231.901463] [<c0237df7>] ? do_softirq+0x3f/0x5c
[ 231.901463] [<c0237f81>] ? irq_exit+0x37/0x80
[ 231.901463] [<c0213cfd>] ? smp_apic_timer_interrupt+0x7c/0x9b
[ 231.901463] [<c02037dd>] ? apic_timer_interrupt+0x31/0x38
[ 231.901463] Code: c7 56 53 89 d3 83 ec 0c 65 a1 14 00 00 00 89 45 f0 31 c0
8b 74 97 3c c1 e2 07 89 d0 05 08 0f 00 00 89 55 e8 03 07 8a 10 88 55 ef <f6>
86 8d 03 00 00 02 74 12 0f b6 c2 50 56 68 50 e4 06 f8 e8 74
[ 231.901463] EIP: [<f8068cb5>] sky2_mac_intr+0x30/0xc1 [sky2] SS:ESP
0068:f7119e9c
[ 231.901463] CR2: 000000000000038d
[ 232.123563] ---[ end trace 8bdf0c8b944f23cc ]---
Message from[ 232.128549] Kernel panic - not syncing: Fatal exception in
interrupt
syslogd@...ralt[ 232.136608] Pid: 10223, comm: postfix Tainted: G D
2.6.30.5 #14
ar3-esys-master [ 232.145004] Call Trace:
at Aug 24 11:44:[ 232.148967] [<c04e7e5e>] ? printk+0x1d/0x30
50 ...
kernel[ 232.154878] [<c04e7d9c>] panic+0x53/0xf8
:[ 231.860446] [ 232.160490] [<c0206368>] oops_end+0x9f/0xbf
Oops: 0000 [#1] [ 232.166380] [<c021ceb4>] no_context+0x11a/0x135
PREEMPT SMP
[ 232.172653] [<c021d005>] __bad_area_nosemaphore+0x136/0x14f
Message from [ 232.180117] [<c0371e1c>] ? vsnprintf+0x91/0x332
syslogd@...ralta[ 232.186394] [<c04eb0c5>] ?
_spin_unlock_irqrestore+0x31/0x44
r3-esys-master a[ 232.193846] [<c04eb0c5>] ?
_spin_unlock_irqrestore+0x31/0x44
t Aug 24 11:44:5[ 232.201305] [<c02330c3>] ?
release_console_sem+0x18b/0x1c9
0 ...
kernel:[ 232.208618] [<c021d03b>] bad_area_nosemaphore+0x1d/0x34
[ 231.860446] l[ 232.215633] [<c021d30b>] do_page_fault+0x110/0x21b
ast sysfs file: [ 232.222208] [<c021d1fb>] ? do_page_fault+0x0/0x21b
/sys/devices/sys[ 232.228739] [<c04eb53a>] error_code+0x7a/0x80
tem/cpu/cpu0/cpu[ 232.234852] [<c036007b>] ? req_bio_endio+0xad/0xb4
freq/scaling_set[ 232.241383] [<f8068cb5>] ? sky2_mac_intr+0x30/0xc1 [sky2]
speed
Mess[ 232.248534] [<f806c3cd>] sky2_poll+0x1d2/0xa29 [sky2]
age from syslogd[ 232.255298] [<c0243708>] ? insert_work+0xa5/0xbf
@gibraltar3-esys[ 232.261648] [<c0244012>] ? delayed_work_timer_fn+0x3c/0x4f
-master at Aug 2[ 232.268906] [<c04eb081>] ? _spin_unlock_irq+0x2f/0x42
[ 232.275707] [<c0462eea>] net_rx_action+0x9e/0x1ae
kernel:[ 231.[ 232.282160] [<c0237ce2>] __do_softirq+0xb2/0x188
901463] Process [ 232.288547] [<c0237df7>] do_softirq+0x3f/0x5c
postfix (pid: 10[ 232.294597] [<c0237f81>] irq_exit+0x37/0x80
223, ti=f7118000[ 232.300533] [<c0213cfd>]
smp_apic_timer_interrupt+0x7c/0x9b
task=f46c63b0 t[ 232.307864] [<c02037dd>] apic_timer_interrupt+0x31/0x38
ask.ti=f7118000)[ 232.314826] Rebooting in 30 seconds..
Is there anything else I can try?
best regards,
Rene
--
-------------------------------------------------
Gibraltar firewall http://www.gibraltar.at/
Download attachment "signature.asc " of type "application/pgp-signature" (198 bytes)
Powered by blists - more mailing lists