lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-Id: <200908241146.07639.rene.mayrhofer@gibraltar.at>
Date:	Mon, 24 Aug 2009 11:46:07 +0200
From:	Rene Mayrhofer <rene.mayrhofer@...raltar.at>
To:	Mike McCormack <mikem@...g3k.org>
Cc:	Stephen Hemminger <shemminger@...ux-foundation.org>,
	netdev@...r.kernel.org, Richard Leitner <leitner@...s.at>
Subject: Re: Kernel oops on setting sky2 interfaces down

On Friday 21 August 2009 01:03:26 pm Mike McCormack wrote:
> 2009/8/21 Rene Mayrhofer <rene.mayrhofer@...raltar.at>:
> > I've also tried the net-next-2.6 version of sky2.[ch] as of yesterday
> > without Mike's "bandaid" patches. With that version (the last one in
> > branch gibraltar-3.0 at
> > https://www.gibraltar.at/git/linux-2.6-gibraltar.git), I managed to
> > successfully do a networking restart (with "light" traffic on one
> > interface), leaving the interfaces functional after the restart. This
> > worked even twice in a row, so mabye we are onto something here. This is
> > certainly an improvement over the version with Mike's last patch (from
> > yesterday) applied, which left the interfaces broken after a restart (and
> > with the quoted errors on a rmmod/modprobe sky2).
> >
> > However, after doing a ping -f on one of the (GBit) interfaces to another
> > host on the same switch for a few seconds and then executing networking
> > restart, the result was an immediate reboot of the box without any oops
> > being printed to the (serial) console beforehand.
>
> How about trying to remove the skge module, then running tests on the
> sky2 interfaces only?  This way you might be able isolate the
> remaining problems to sky2 or skge...?

Tried that. With skge removed, some traffic generated on the lan interface with 
ping -f and a networking restart:

[~]# /etc/init.d/networking restart
Reconfiguring network interfaces...[  231.816044] sky2 0000:03:00.0: error 
interrupt status=0xffffffff
[  231.822339] sky2 0000:03:00.0: PCI hardware error (0xffff)                                         
[  231.828205] sky2 0000:03:00.0: PCI Express error (0xffffffff)                                      
[  231.834301] sky2 dmz: ram data read parity error                                                   
[  231.839226] sky2 dmz: ram data write parity error                                                  
[  231.844142] sky2 dmz: MAC parity error                                                             
[  231.848077] sky2 dmz: RX parity error                                                              
[  231.851937] sky2 dmz: TCP segmentation error                                                       
[  231.856499] BUG: unable to handle kernel NULL pointer dereference at 
0000038d                      
[  231.860446] IP: [<f8068cb5>] sky2_mac_intr+0x30/0xc1 [sky2]                                        
[  231.860446] *pde = 00000000                                                                        
[  231.860446] Oops: 0000 [#1] PREEMPT SMP                                                            
[  231.860446] last sysfs file: 
/sys/devices/system/cpu/cpu0/cpufreq/scaling_setspeed                 
[  231.860446] Modules linked in: xt_multiport cpufreq_userspace ip6t_REJECT 
xt_DSCP xt_length xt_mark xt_dscp xt_MARK xt_IMQ xt_CONNMARK xt_comment 
xt_policy ip6t_LOG xt_tcpudp ip6table_mangle iptable_mangle ip6table_filter 
ip6_tables sit tunnel4 8021q garp stp llc ipt_LOG xt_limit xt_state 
iptable_nat iptable_filter ip_tables x_tables dm_mod p4_clockmod speedstep_lib 
freq_table tun imq nf_nat_ftp nf_nat nf_conntrack_ftp nf_conntrack_ipv6 
nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ipv6 iTCO_wdt parport_pc 
rng_core i2c_i801 serio_raw evdev i2c_core pcspkr parport button intel_agp 
processor squashfs loop aufs exportfs nls_utf8 nls_cp437 ide_generic sd_mod 
ata_generic pata_acpi ata_piix ide_pci_generic ide_core sky2 thermal fan 
thermal_sys [last unloaded: skge]                                                                                  
[  231.901463]                                                                                                            
[  231.901463] Pid: 10223, comm: postfix Not tainted (2.6.30.5 #14)                                                       
[  231.901463] EIP: 0060:[<f8068cb5>] EFLAGS: 00010286 CPU: 0                                                             
[  231.901463] EIP is at sky2_mac_intr+0x30/0xc1 [sky2]                                                                   
[  231.901463] EAX: f8090f88 EBX: 00000001 ECX: 00000008 EDX: 000000ff                                                    
[  231.901463] ESI: 00000000 EDI: f6b03e80 EBP: f7119eb4 ESP: f7119e9c                                                    
[  231.901463]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068                                                              
[  231.901463] Process postfix (pid: 10223, ti=f7118000 task=f46c63b0 
task.ti=f7118000)                                   
[  231.901463] Stack:                                                                                                     
[  231.901463]  00000080 ffb03e80 aa9c6992 f70fb4c0 ffffffff ffffffff f7119f28 
f806c3cd                                   
[  231.901463]  c181d7a0 aa9c6992 00000040 f6b03e88 c0243708 00000000 
f6b03e80 ffffffff                                   
[  231.901463]  00000001 f70e6400 c181d260 ffffbd75 00000001 00000000 
aa9c6992 c181aa20                                   
[  231.901463] Call Trace:                                                                                                
[  231.901463]  [<f806c3cd>] ? sky2_poll+0x1d2/0xa29 [sky2]                                                               
[  231.901463]  [<c0243708>] ? insert_work+0xa5/0xbf                                                                      
[  231.901463]  [<c0244012>] ? delayed_work_timer_fn+0x3c/0x4f                                                            
[  231.901463]  [<c04eb081>] ? _spin_unlock_irq+0x2f/0x42                                                                 
[  231.901463]  [<c0462eea>] ? net_rx_action+0x9e/0x1ae                                                                   
[  231.901463]  [<c0237ce2>] ? __do_softirq+0xb2/0x188                                                                    
[  231.901463]  [<c0237df7>] ? do_softirq+0x3f/0x5c                                                                       
[  231.901463]  [<c0237f81>] ? irq_exit+0x37/0x80                                                                         
[  231.901463]  [<c0213cfd>] ? smp_apic_timer_interrupt+0x7c/0x9b                                                         
[  231.901463]  [<c02037dd>] ? apic_timer_interrupt+0x31/0x38                                                             
[  231.901463] Code: c7 56 53 89 d3 83 ec 0c 65 a1 14 00 00 00 89 45 f0 31 c0 
8b 74 97 3c c1 e2 07 89 d0 05 08 0f 00 00 89 55 e8 03 07 8a 10 88 55 ef <f6> 
86 8d 03 00 00 02 74 12 0f b6 c2 50 56 68 50 e4 06 f8 e8 74                              
[  231.901463] EIP: [<f8068cb5>] sky2_mac_intr+0x30/0xc1 [sky2] SS:ESP 
0068:f7119e9c                                      
[  231.901463] CR2: 000000000000038d                                                                                      
[  232.123563] ---[ end trace 8bdf0c8b944f23cc ]---                                                                       

Message from[  232.128549] Kernel panic - not syncing: Fatal exception in 
interrupt
 syslogd@...ralt[  232.136608] Pid: 10223, comm: postfix Tainted: G      D    
2.6.30.5 #14
ar3-esys-master [  232.145004] Call Trace:                                                
at Aug 24 11:44:[  232.148967]  [<c04e7e5e>] ? printk+0x1d/0x30                           
50 ...                                                                                    
 kernel[  232.154878]  [<c04e7d9c>] panic+0x53/0xf8
:[  231.860446] [  232.160490]  [<c0206368>] oops_end+0x9f/0xbf
Oops: 0000 [#1] [  232.166380]  [<c021ceb4>] no_context+0x11a/0x135
PREEMPT SMP
[  232.172653]  [<c021d005>] __bad_area_nosemaphore+0x136/0x14f

Message from [  232.180117]  [<c0371e1c>] ? vsnprintf+0x91/0x332
syslogd@...ralta[  232.186394]  [<c04eb0c5>] ? 
_spin_unlock_irqrestore+0x31/0x44
r3-esys-master a[  232.193846]  [<c04eb0c5>] ? 
_spin_unlock_irqrestore+0x31/0x44
t Aug 24 11:44:5[  232.201305]  [<c02330c3>] ? 
release_console_sem+0x18b/0x1c9
0 ...
 kernel:[  232.208618]  [<c021d03b>] bad_area_nosemaphore+0x1d/0x34
[  231.860446] l[  232.215633]  [<c021d30b>] do_page_fault+0x110/0x21b
ast sysfs file: [  232.222208]  [<c021d1fb>] ? do_page_fault+0x0/0x21b
/sys/devices/sys[  232.228739]  [<c04eb53a>] error_code+0x7a/0x80
tem/cpu/cpu0/cpu[  232.234852]  [<c036007b>] ? req_bio_endio+0xad/0xb4
freq/scaling_set[  232.241383]  [<f8068cb5>] ? sky2_mac_intr+0x30/0xc1 [sky2]
speed

Mess[  232.248534]  [<f806c3cd>] sky2_poll+0x1d2/0xa29 [sky2]
age from syslogd[  232.255298]  [<c0243708>] ? insert_work+0xa5/0xbf
@gibraltar3-esys[  232.261648]  [<c0244012>] ? delayed_work_timer_fn+0x3c/0x4f
-master at Aug 2[  232.268906]  [<c04eb081>] ? _spin_unlock_irq+0x2f/0x42
[  232.275707]  [<c0462eea>] net_rx_action+0x9e/0x1ae

 kernel:[  231.[  232.282160]  [<c0237ce2>] __do_softirq+0xb2/0x188
901463] Process [  232.288547]  [<c0237df7>] do_softirq+0x3f/0x5c
postfix (pid: 10[  232.294597]  [<c0237f81>] irq_exit+0x37/0x80
223, ti=f7118000[  232.300533]  [<c0213cfd>] 
smp_apic_timer_interrupt+0x7c/0x9b
 task=f46c63b0 t[  232.307864]  [<c02037dd>] apic_timer_interrupt+0x31/0x38
ask.ti=f7118000)[  232.314826] Rebooting in 30 seconds..

Is there anything else I can try?

best regards,
Rene

-- 
-------------------------------------------------
Gibraltar firewall       http://www.gibraltar.at/

Download attachment "signature.asc " of type "application/pgp-signature" (198 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ