lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date:	Tue, 08 May 2012 11:26:15 +0300
From:	vaLentin chernoZemski <valentin@...eground.com>
To:	linux-kernel@...r.kernel.org
Subject: PROBLEM: 3.2.14 - Sporadic HARD lockups/__slab_free/intel_pmu_handle_irq->perf_event_overflow

Kindly Note that kernel is patched against grsec but grsec is not 
enabled inside the kernel configuration.

[1.] One line summary of the problem:

Sporadic hard lockups cause server to freeze.

[2.] Full description of the problem/report:

Server is freezing with no output logged to /var/log/messages.

Bug messages are however properly exported and captured by the 
netconsole module.

Issue can't be recreated. It occurs sporadicly and hard lockups appears 
to be caused by different parts of the kernel at least to me. Not sure 
which part triggers them.

[3.] Keywords (i.e., modules, networking, kernel):

intel_pmu_handle_irq
perf_event_overflow
hard lockup
__slab_free
aacraid

[4.] Kernel information
[4.1.] Kernel version (from /proc/version):

Linux version 3.2.14-grsec-clean-sg1 (root@...tbed) (gcc version 4.1.2 
20080704 (Red Hat 4.1.2-52)) #2 SMP Wed Apr 11 03:57:50 CDT 2012

Note that kernel is patched against grsec but grsec is not enabled in 
the kernel configuration.

[4.2.] Kernel .config file:

Attached

[5.] Most recent kernel version which did not have the bug:

2.6.28 :-)

[6.] Output of Oops.. message (if applicable) with symbolic information
      resolved (see Documentation/oops-tracing.txt)

Case 1:

------------[ cut here ]------------
WARNING: at kernel/watchdog.c:241 watchdog_overflow_callback+0x9a/0xa4()
Hardware name: X8DTL
Watchdog detected hard LOCKUP on cpu 1
Modules linked in: netconsole configfs ipv6 nf_nat_ftp nf_conntrack_ftp 
xt_length xt_pkttype xt_dscp xt_multiport xt_owner ipt_REDIRECT 
iptable_nat nf_nat iptable_mangle iptable_raw autofs4 lockd sunrpc 
nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ip6_tables 
iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi dm_mirror 
dm_multipath acpi_pad acpi_ipmi ipmi_msghandler e1000e iTCO_wdt 
iTCO_vendor_support ioatdma dca i2c_i801 i7core_edac dm_region_hash 
dm_log usb_storage ata_piix aacraid uhci_hcd ohci_hcd ehci_hcd raid1 
md_mod [last unloaded: microcode]
Pid: 0, comm: swapper/1 Not tainted 3.2.14-grsec-clean-sg1 #2
Call Trace:
  [<c102f100>] ? vprintk+0x248/0x32b
  [<c1061ea7>] ? watchdog_overflow_callback+0x9a/0xa4
  [<c102e3ce>] warn_slowpath_common+0x75/0x8a
  [<c1061ea7>] ? watchdog_overflow_callback+0x9a/0xa4
  [<c1061e0d>] ? __touch_watchdog+0x16/0x16
  [<c102e45f>] warn_slowpath_fmt+0x2e/0x30
  [<c1061ea7>] watchdog_overflow_callback+0x9a/0xa4
  [<c106edd8>] __perf_event_overflow+0x131/0x1a4
  [<c100c0b5>] ? x86_perf_event_set_period+0x1e7/0x1f2
  [<c106ef6b>] perf_event_overflow+0x15/0x17
  [<c100fa78>] intel_pmu_handle_irq+0x1d1/0x221
  [<c1076ade>] ? __free_pages+0x1e/0x29
  [<c109cf0f>] ? __free_slab+0xd8/0xe0
  [<c109e1ed>] ? __slab_free+0x162/0x253
  [<c100bb9f>] perf_event_nmi_handler+0x16/0x1c
  [<c1004e3a>] nmi_handle+0x2e/0x49
  [<c1005083>] do_nmi+0x72/0x2af
  [<c1269650>] ? free_iova_mem+0xf/0x11
  [<c13025e9>] nmi_stack_correct+0x28/0x2d
  [<c13017ed>] ? _raw_spin_lock_irqsave+0x1b/0x24
  [<c126adac>] add_unmap+0x14/0x91
  [<c126cc27>] intel_unmap_sg+0xf6/0xfe
  [<c11fcfa7>] ? scsi_done+0xb/0xd
  [<c122ab02>] ? ata_scsi_qc_complete+0x2e7/0x2ef
  [<c126cb31>] ? intel_map_sg+0x1f0/0x1f0
  [<c122559a>] ata_sg_clean+0x6e/0x81
  [<c12255f6>] __ata_qc_complete+0x49/0xb4
  [<c12264ac>] ata_qc_complete+0x11e/0x131
  [<c12339a7>] ata_hsm_qc_complete+0xb5/0xbb
  [<c1234013>] ata_sff_hsm_move+0x666/0x6bd
  [<c1004e3a>] ? nmi_handle+0x2e/0x49
  [<c1005299>] ? do_nmi+0x288/0x2af
  [<c130007b>] ? __schedule+0x33e/0x836
  [<c1234107>] __ata_sff_port_intr+0x9d/0xa9
  [<c1234776>] ata_bmdma_port_intr+0x6d/0xce
  [<c1232957>] ata_bmdma_interrupt+0x71/0x14f
  [<c1062870>] handle_irq_event_percpu+0x25/0x110
  [<c106297f>] handle_irq_event+0x24/0x3b
  [<c1064908>] ? handle_simple_irq+0x4f/0x4f
  [<c106496e>] handle_fasteoi_irq+0x66/0x8c
  [<c1003c43>] handle_irq+0x6a/0x8c

Case 2:

------------[ cut here ]------------
WARNING: at kernel/watchdog.c:241 watchdog_overflow_callback+0x9a/0xa4()
Hardware name: X8DTL
Watchdog detected hard LOCKUP on cpu 5
Modules linked in: netconsole configfs nf_nat_ftp nf_conntrack_ftp 
xt_length xt_state xt_pkttype xt_dscp xt_multiport xt_owner ipt_REDIRECT 
iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 
iptable_mangle iptable_raw ip6t_REJECT ip6table_filter ip6_tables ipv6 
iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi dm_mirror 
dm_multipath acpi_pad acpi_ipmi ipmi_msghandler e1000e ioatdma iTCO_wdt 
iTCO_vendor_support dca i7core_edac i2c_i801 dm_region_hash dm_log 
usb_storage ata_piix aacraid uhci_hcd ohci_hcd ehci_hcd raid1 md_mod
Pid: 0, comm: swapper/5 Not tainted 3.2.14-grsec-clean-sg1 #2
Call Trace:
  [<c102f100>] ? vprintk+0x248/0x32b
  [<c1061ea7>] ? watchdog_overflow_callback+0x9a/0xa4
  [<c102e3ce>] warn_slowpath_common+0x75/0x8a
  [<c1061ea7>] ? watchdog_overflow_callback+0x9a/0xa4
  [<c1061e0d>] ? __touch_watchdog+0x16/0x16
  [<c102e45f>] warn_slowpath_fmt+0x2e/0x30
  [<c1061ea7>] watchdog_overflow_callback+0x9a/0xa4
  [<c106edd8>] __perf_event_overflow+0x131/0x1a4
  [<c100c0b5>] ? x86_perf_event_set_period+0x1e7/0x1f2
  [<c106ef6b>] perf_event_overflow+0x15/0x17
  [<c100fa78>] intel_pmu_handle_irq+0x1d1/0x221
  [<c100fabb>] ? intel_pmu_handle_irq+0x214/0x221
  [<c109e1ed>] ? __slab_free+0x162/0x253
  [<c100bb9f>] perf_event_nmi_handler+0x16/0x1c
  [<c1004e3a>] nmi_handle+0x2e/0x49
  [<c1005083>] do_nmi+0x72/0x2af
  [<c1269650>] ? free_iova_mem+0xf/0x11
  [<c13025e9>] nmi_stack_correct+0x28/0x2d
  [<c13017ef>] ? _raw_spin_lock_irqsave+0x1d/0x24
  [<c126adac>] add_unmap+0x14/0x91
  [<c126cc27>] intel_unmap_sg+0xf6/0xfe
  [<c10558e3>] ? __smp_call_function_single+0x7d/0x83
  [<c126cb31>] ? intel_map_sg+0x1f0/0x1f0
  [<c12038cc>] scsi_dma_unmap+0x48/0x4f
  [<f82d7a15>] io_callback+0x62/0x145 [aacraid]
  [<f82dcc6b>] aac_intr_normal+0x1d3/0x25f [aacraid]
  [<f82ddd3a>] aac_rx_intr_message+0x60/0x98 [aacraid]
  [<c1062870>] handle_irq_event_percpu+0x25/0x110
  [<c130007b>] ? __schedule+0x33e/0x836
  [<c106297f>] handle_irq_event+0x24/0x3b
  [<c1064908>] ? handle_simple_irq+0x4f/0x4f
  [<c106496e>] handle_fasteoi_irq+0x66/0x8c
  [<c1003c43>] handle_irq+0x6a/0x8c
  <IRQ>  [<c1032d9c>] ? _local_bh_enable+0xd/0xf
  [<c1003541>] do_IRQ+0x36/0x9c
  [<c1302d29>] common_interrupt+0x29/0x30
  [<c104007b>] ? flush_workqueue_prep_cwqs+0x14b/0x14b
  [<c11cbbfa>] ? acpi_idle_enter_bm+0x237/0x26e
  [<c1260f9b>] cpuidle_idle_call+0x57/0x9f
  [<c100183c>] cpu_idle+0x47/0x6a
  [<c1521271>] start_secondary+0x1a5/0x1ab
---[ end trace 3aa9068f1323980e ]---


[7.] A small shell script or example program which triggers the
      problem (if possible)

Unfortunately unavailable

[8.] Environment

[8.1.] Software (add the output of the ver_linux script here)

-----------------
Linux servername.com 3.2.14-grsec-clean-sg1 #2 SMP Wed Apr 11 03:57:50 
CDT 2012 i686 i686 i386 GNU/Linux

Gnu C                  4.1.2
Gnu make               3.81
binutils               2.17.50.0.6
util-linux             2.13-pre7
mount                  2.13-pre7
module-init-tools      3.3-pre2
e2fsprogs              1.39
pcmciautils            014
quota-tools            3.13.
PPP                    2.4.4
isdn4k-utils           3.9
Linux C Library        2.5
Dynamic linker (ldd)   2.5
Procps                 3.2.7
Net-tools              1.60
Kbd                    1.12
Sh-utils               5.97
udev                   095
Modules Loaded         hcpdriver nf_nat_ftp nf_conntrack_ftp xt_length 
xt_state xt_pkttype xt_dscp xt_multiport xt_owner ipt_REDIRECT 
iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 
iptable_mangle iptable_raw ip6t_REJECT ip6table_filter ip6_tables ipv6 
iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi dm_mirror 
dm_region_hash dm_log dm_multipath acpi_pad acpi_ipmi ipmi_msghandler 
igb iTCO_wdt iTCO_vendor_support ioatdma i2c_i801 dca i7core_edac 
ata_piix aacraid uhci_hcd ohci_hcd ehci_hcd raid1 md_mod
-----------------

[8.2.] Processor information (from /proc/cpuinfo):

processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 44
model name      : Intel(R) Xeon(R) CPU           E5620  @ 2.40GHz
stepping        : 2
cpu MHz         : 2400.324
cache size      : 12288 KB
physical id     : 0
siblings        : 8
core id         : 0
cpu cores       : 4
apicid          : 0
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 11
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge 
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx 
pdpe1gb rdtscp lm constant_tsc ida nonstop_tsc arat pni monitor ds_cpl 
vmx smx est tm2 ssse3 cx16 xtpr sse4_1 sse4_2 popcnt lahf_lm [8]
bogomips        : 4800.64

[8.3.] Module information (from /proc/modules):

netconsole 5653 0 - Live 0xf8305000 0xf82dc000
configfs 18130 2 netconsole, Live 0xf84c4000 0xf8220000
nf_nat_ftp 1314 0 - Live 0xf8757000 0xf8755000
nf_conntrack_ftp 4724 1 nf_nat_ftp, Live 0xf874e000 0xf874c000
xt_length 914 1 - Live 0xf8740000 0xf873e000
xt_state 948 51 - Live 0xf8738000 0xf8736000
xt_pkttype 798 4 - Live 0xf8730000 0xf872e000
xt_dscp 1203 1 - Live 0xf86b8000 0xf86b6000
xt_multiport 1331 2 - Live 0xf86b0000 0xf86ae000
xt_owner 864 4 - Live 0xf86a8000 0xf86a6000
ipt_REDIRECT 883 1 - Live 0xf869a000 0xf8698000
iptable_nat 2983 1 - Live 0xf8690000 0xf868e000
nf_nat 13254 3 nf_nat_ftp,ipt_REDIRECT,iptable_nat, Live 0xf8683000 
0xf8681000
nf_conntrack_ipv4 9626 54 iptable_nat,nf_nat, Live 0xf8673000 0xf8671000
nf_conntrack 56362 6 
nf_nat_ftp,nf_conntrack_ftp,xt_state,iptable_nat,nf_nat,nf_conntrack_ipv4, 
Live 0xf8654000 0xf8652000
nf_defrag_ipv4 1123 1 nf_conntrack_ipv4, Live 0xf8638000 0xf8636000
iptable_mangle 1269 0 - Live 0xf862a000 0xf8628000
iptable_raw 1060 0 - Live 0xf861b000 0xf8619000
ip6t_REJECT 2091 1 - Live 0xf85ee000 0xf85ec000
ip6table_filter 1245 1 - Live 0xf85b3000 0xf85b1000
ip6_tables 12454 1 ip6table_filter, Live 0xf85a7000 0xf85a5000
ipv6 242902 45 ip6t_REJECT, Live 0xf8554000 0xf8550000
iscsi_tcp 7435 0 - Live 0xf84f3000 0xf84f1000
libiscsi_tcp 12208 1 iscsi_tcp, Live 0xf84e4000 0xf84e2000
libiscsi 33681 2 iscsi_tcp,libiscsi_tcp, Live 0xf84cc000 0xf84ca000
scsi_transport_iscsi 33760 4 iscsi_tcp,libiscsi, Live 0xf84ad000 0xf84ab000
dm_mirror 10932 0 - Live 0xf83f7000 0xf83f5000
dm_region_hash 7873 1 dm_mirror, Live 0xf83e8000 0xf83e6000
dm_log 7385 2 dm_mirror,dm_region_hash, Live 0xf83b9000 0xf83b7000
dm_multipath 12486 0 - Live 0xf839c000 0xf8350000
acpi_pad 4593 0 - Live 0xf8334000 0xf8332000
acpi_ipmi 2615 0 - Live 0xf8314000 0xf82cd000
ipmi_msghandler 28300 1 acpi_ipmi, Live 0xf831b000 0xf82c5000
e1000e 114447 0 - Live 0xf837f000 0xf829d000
ioatdma 31331 24 - Live 0xf8223000 0xf8210000
dca 5184 1 ioatdma, Live 0xf8356000 0xf8354000
i7core_edac 14595 0 - Live 0xf8348000 0xf8346000
i2c_i801 6703 0 - Live 0xf8339000 0xf8337000
iTCO_wdt 10354 0 - Live 0xf8328000 0xf8326000
iTCO_vendor_support 2309 1 iTCO_wdt, Live 0xf8319000 0xf8317000
raid1 21370 0 - Live 0xf830a000 0xf8308000
md_mod 90902 1 raid1, Live 0xf82e0000 0xf82de000
ata_piix 12086 1 - Live 0xf82b3000 0xf82b1000
aacraid 64508 3 - Live 0xf827e000 0xf827c000
uhci_hcd 15568 0 - Live 0xf825a000 0xf8258000
ohci_hcd 14656 0 - Live 0xf8248000 0xf8246000
ehci_hcd 28982 0 - Live 0xf822e000 0xf822c000

[8.4.] Loaded driver and hardware information (/proc/ioports, /proc/iomem)

0000-03af : PCI Bus 0000:00
   0000-001f : dma1
   0020-0021 : pic1
   0040-0043 : timer0
   0050-0053 : timer1
   0060-0060 : keyboard
   0064-0064 : keyboard
   0070-0071 : rtc0
   0080-008f : dma page reg
   00a0-00a1 : pic2
   00c0-00df : dma2
   00f0-00ff : fpu
   02f8-02ff : serial
03b0-03bb : PCI Bus 0000:00
03c0-03df : PCI Bus 0000:00
   03c0-03df : vga+
03e0-0cf7 : PCI Bus 0000:00
   03f8-03ff : serial
   0400-041f : 0000:00:1f.3
     0400-041f : i801_smbus
   04d0-04d1 : pnp 00:09
   0500-057f : pnp 00:09
   0800-087f : pnp 00:09
     0800-0803 : ACPI PM1a_EVT_BLK
     0804-0805 : ACPI PM1a_CNT_BLK
     0808-080b : ACPI PM_TMR
     0810-0815 : ACPI CPU throttle
     0820-082f : ACPI GPE0_BLK
     0830-0833 : iTCO_wdt
     0850-0850 : ACPI PM2_CNT_BLK
     0860-087f : iTCO_wdt
   0a10-0a1f : pnp 00:06
   0ca2-0ca3 : pnp 00:09
0cf8-0cff : PCI conf1
0d00-efff : PCI Bus 0000:00
   1000-1fff : PCI Bus 0000:05
   a400-a40f : 0000:00:1f.5
     a400-a40f : ata_piix
   a480-a48f : 0000:00:1f.5
     a480-a48f : ata_piix
   a800-a803 : 0000:00:1f.5
     a800-a803 : ata_piix
   a880-a887 : 0000:00:1f.5
     a880-a887 : ata_piix
   ac00-ac03 : 0000:00:1f.5
     ac00-ac03 : ata_piix
   b000-b007 : 0000:00:1f.5
     b000-b007 : ata_piix
   b400-b40f : 0000:00:1f.2
     b400-b40f : ata_piix
   b480-b48f : 0000:00:1f.2
     b480-b48f : ata_piix
   b800-b803 : 0000:00:1f.2
     b800-b803 : ata_piix
   b880-b887 : 0000:00:1f.2
     b880-b887 : ata_piix
   bc00-bc03 : 0000:00:1f.2
     bc00-bc03 : ata_piix
   c000-c007 : 0000:00:1f.2
     c000-c007 : ata_piix
   c080-c09f : 0000:00:1d.2
     c080-c09f : uhci_hcd
   c400-c41f : 0000:00:1d.1
     c400-c41f : uhci_hcd
   c480-c49f : 0000:00:1d.0
     c480-c49f : uhci_hcd
   c800-c81f : 0000:00:1a.2
     c800-c81f : uhci_hcd
   c880-c89f : 0000:00:1a.1
     c880-c89f : uhci_hcd
   cc00-cc1f : 0000:00:1a.0
     cc00-cc1f : uhci_hcd
   d000-dfff : PCI Bus 0000:06
     dc00-dc1f : 0000:06:00.0
   e000-efff : PCI Bus 0000:07
     ec00-ec1f : 0000:07:00.0
f000-ffff : PCI Bus 0000:00

[8.5.] PCI information ('lspci -vvv' as root)
0000-03af : PCI Bus 0000:00
   0000-001f : dma1
   0020-0021 : pic1
   0040-0043 : timer0
   0050-0053 : timer1
   0060-0060 : keyboard
   0064-0064 : keyboard
   0070-0071 : rtc0
   0080-008f : dma page reg
   00a0-00a1 : pic2
   00c0-00df : dma2
   00f0-00ff : fpu
   02f8-02ff : serial
03b0-03bb : PCI Bus 0000:00
03c0-03df : PCI Bus 0000:00
   03c0-03df : vga+
03e0-0cf7 : PCI Bus 0000:00
   03f8-03ff : serial
   0400-041f : 0000:00:1f.3
     0400-041f : i801_smbus
   04d0-04d1 : pnp 00:09
   0500-057f : pnp 00:09
   0800-087f : pnp 00:09
     0800-0803 : ACPI PM1a_EVT_BLK
     0804-0805 : ACPI PM1a_CNT_BLK
     0808-080b : ACPI PM_TMR
     0810-0815 : ACPI CPU throttle
     0820-082f : ACPI GPE0_BLK
     0830-0833 : iTCO_wdt
     0850-0850 : ACPI PM2_CNT_BLK
     0860-087f : iTCO_wdt
   0a10-0a1f : pnp 00:06
   0ca2-0ca3 : pnp 00:09
0cf8-0cff : PCI conf1
0d00-efff : PCI Bus 0000:00
   1000-1fff : PCI Bus 0000:05
   a400-a40f : 0000:00:1f.5
     a400-a40f : ata_piix
   a480-a48f : 0000:00:1f.5
     a480-a48f : ata_piix
   a800-a803 : 0000:00:1f.5
     a800-a803 : ata_piix
   a880-a887 : 0000:00:1f.5
     a880-a887 : ata_piix
   ac00-ac03 : 0000:00:1f.5
     ac00-ac03 : ata_piix
   b000-b007 : 0000:00:1f.5
     b000-b007 : ata_piix
   b400-b40f : 0000:00:1f.2
     b400-b40f : ata_piix
   b480-b48f : 0000:00:1f.2
     b480-b48f : ata_piix
   b800-b803 : 0000:00:1f.2
     b800-b803 : ata_piix
   b880-b887 : 0000:00:1f.2
     b880-b887 : ata_piix
   bc00-bc03 : 0000:00:1f.2
     bc00-bc03 : ata_piix
   c000-c007 : 0000:00:1f.2
     c000-c007 : ata_piix
   c080-c09f : 0000:00:1d.2
     c080-c09f : uhci_hcd
   c400-c41f : 0000:00:1d.1
     c400-c41f : uhci_hcd
   c480-c49f : 0000:00:1d.0
     c480-c49f : uhci_hcd
   c800-c81f : 0000:00:1a.2
     c800-c81f : uhci_hcd
   c880-c89f : 0000:00:1a.1
     c880-c89f : uhci_hcd
   cc00-cc1f : 0000:00:1a.0
     cc00-cc1f : uhci_hcd
   d000-dfff : PCI Bus 0000:06
     dc00-dc1f : 0000:06:00.0
   e000-efff : PCI Bus 0000:07
     ec00-ec1f : 0000:07:00.0
f000-ffff : PCI Bus 0000:00
root@...new:~# cat /proc/iomem
00000000-0000ffff : reserved
00010000-00095bff : System RAM
00095c00-0009ffff : reserved
000a0000-000bffff : PCI Bus 0000:00
   000a0000-000bffff : Video RAM area
000c0000-000c7fff : Video ROM
000c8000-000c8fff : Adapter ROM
000c9000-000cf7ff : Adapter ROM
000d0000-000dffff : PCI Bus 0000:00
000e4000-000fffff : reserved
   000f0000-000fffff : System ROM
00100000-bf77ffff : System RAM
   01000000-01306fff : Kernel code
   01481000-014eecbf : Kernel data
   0154b000-01598fff : Kernel bss
bf780000-bf78dfff : RAM buffer
bf78e000-bf78ffff : reserved
bf790000-bf79dfff : ACPI Tables
bf79e000-bf7cffff : ACPI Non-volatile Storage
bf7d0000-bf7dffff : reserved
bf7e0000-bf7ebfff : RAM buffer
bf7ec000-bfffffff : reserved
c0000000-dfffffff : PCI Bus 0000:00
   c0000000-c01fffff : PCI Bus 0000:07
   c0200000-c03fffff : PCI Bus 0000:06
   c0400000-c05fffff : PCI Bus 0000:05
   c0600000-c07fffff : PCI Bus 0000:05
e0000000-efffffff : PCI MMCONFIG 0000 [bus 00-ff]
   e0000000-efffffff : reserved
     e0000000-efffffff : pnp 00:0c
f0000000-fed8ffff : PCI Bus 0000:00
   f9000000-f9ffffff : PCI Bus 0000:08
     f9000000-f9ffffff : 0000:08:01.0
   fab00000-fadfffff : PCI Bus 0000:01
     fab80000-fabfffff : 0000:01:00.0
     fac00000-fadfffff : 0000:01:00.0
   faf00000-fb7fffff : PCI Bus 0000:08
     faffc000-faffffff : 0000:08:01.0
     fb000000-fb7fffff : 0000:08:01.0
   fbc00000-fbcfffff : PCI Bus 0000:06
     fbcdc000-fbcdffff : 0000:06:00.0
       fbcdc000-fbcdffff : e1000e
     fbce0000-fbcfffff : 0000:06:00.0
       fbce0000-fbcfffff : e1000e
   fbd00000-fbdfffff : PCI Bus 0000:07
     fbddc000-fbddffff : 0000:07:00.0
       fbddc000-fbddffff : e1000e
     fbde0000-fbdfffff : 0000:07:00.0
       fbde0000-fbdfffff : e1000e
   fbed2000-fbed20ff : 0000:00:1f.3
   fbed4000-fbed7fff : 0000:00:1b.0
   fbed8000-fbed83ff : 0000:00:1d.7
     fbed8000-fbed83ff : ehci_hcd
   fbeda000-fbeda3ff : 0000:00:1a.7
     fbeda000-fbeda3ff : ehci_hcd
   fbedc000-fbedffff : 0000:00:16.7
     fbedc000-fbedffff : ioatdma
   fbee0000-fbee3fff : 0000:00:16.6
     fbee0000-fbee3fff : ioatdma
   fbee4000-fbee7fff : 0000:00:16.5
     fbee4000-fbee7fff : ioatdma
   fbee8000-fbeebfff : 0000:00:16.4
     fbee8000-fbeebfff : ioatdma
   fbeec000-fbeeffff : 0000:00:16.3
     fbeec000-fbeeffff : ioatdma
   fbef0000-fbef3fff : 0000:00:16.2
     fbef0000-fbef3fff : ioatdma
   fbef4000-fbef7fff : 0000:00:16.1
     fbef4000-fbef7fff : ioatdma
   fbef8000-fbefbfff : 0000:00:16.0
     fbef8000-fbefbfff : ioatdma
   fec00000-fec003ff : IOAPIC 0
   fec8a000-fec8afff : 0000:00:13.0
     fec8a000-fec8a3ff : IOAPIC 1
   fed00000-fed003ff : HPET 0
   fed1c000-fed1ffff : pnp 00:01
     fed1c000-fed1ffff : pnp 00:09
   fed20000-fed3ffff : pnp 00:09
   fed40000-fed8ffff : pnp 00:09
fee00000-fee00fff : Local APIC
   fee00000-fee00fff : reserved
     fee00000-fee00fff : pnp 00:0b
ffc00000-ffffffff : reserved
100000000-43fffffff : System RAM

[8.6.] SCSI information (from /proc/scsi/scsi)

Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
   Vendor: Adaptec  Model: 5405Z RAID10     Rev: V1.0
   Type:   Direct-Access                    ANSI  SCSI revision: 02
Host: scsi0 Channel: 01 Id: 00 Lun: 00
   Vendor: WDC      Model: WD1003FBYX-0     Rev: 01.0
   Type:   Direct-Access                    ANSI  SCSI revision: 05
Host: scsi0 Channel: 01 Id: 01 Lun: 00
   Vendor: WDC      Model: WD1003FBYX-0     Rev: 01.0
   Type:   Direct-Access                    ANSI  SCSI revision: 05
Host: scsi0 Channel: 01 Id: 02 Lun: 00
   Vendor: WDC      Model: WD1003FBYX-0     Rev: 01.0
   Type:   Direct-Access                    ANSI  SCSI revision: 05
Host: scsi0 Channel: 01 Id: 03 Lun: 00
   Vendor: WDC      Model: WD1002FBYS-0     Rev: 03.0
   Type:   Direct-Access                    ANSI  SCSI revision: 05
Host: scsi1 Channel: 00 Id: 00 Lun: 00
   Vendor: ATA      Model: INTEL SSDSA2CW12 Rev: 4PC1
   Type:   Direct-Access                    ANSI  SCSI revision: 05

[8.7.] Other information that might be relevant to the problem
        (please look in /proc and include all information that you
        think to be relevant):

None

[X.] Other notes, patches, fixes, workarounds:

Freezes like the one described above has been outgoing for us since 
2.6.28. Every kernel version we are trying to utilize _after_ 2.6.28 
causes sporadic machine restarts, hard lockups and in 99% of the cases 
_NO_ output is logged to /var/log/messages. This applies to 2.6.32, 
2.6.38.6, 3.2.5, 3.2.9. Sadly but we are still struggling to find stable 
kernel for our servers after 2.6.28.

Not sure if this is going to help but before we tried if 3.2.14 will 
work correctly for us, we used 3.2.5 and experienced similar lockups on 
similar hardware configuration.

------------[ cut here ]------------
WARNING: at kernel/watchdog.c:241 watchdog_overflow_callback+0x97/0xa3()
Watchdog detected hard LOCKUP on cpu 1
Modules linked in: netconsole cryptoloop configfs nf_nat_ftp 
nf_conntrack_ftp xt_length ipt_REJECT xt_state xt_pkttype xt_dscp 
xt_multiport xt_owner iptable_filter ipt_REDIRECT iptable_nat nf_nat 
nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_mangle iptable_raw 
ip_tables iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi xt_set 
ip_set nfnetlink dm_mirror dm_region_hash dm_log dm_multipath dm_mod 
thermal pci_slot hed acpi_pad sg usbhid evdev button processor e1000e 
ioatdma iTCO_wdt iTCO_vendor_support i7core_edac dca edac_core ata_piix 
libata 3w_sas sd_mod scsi_mod uhci_hcd ohci_hcd ehci_hcd [last unloaded: 
netconsole]
Pid: 0, comm: swapper/1 Not tainted 3.2.5-grsec-sg6 #1
Call Trace:
  [<0005c011>] ? watchdog_overflow_callback+0x97/0xa3
  [<0002ac89>] ? warn_slowpath_common+0x5d/0x72
  [<0005c011>] ? watchdog_overflow_callback+0x97/0xa3
  [<0005bf7a>] ? __touch_watchdog+0x12/0x12
  [<0002ad26>] ? warn_slowpath_fmt+0x33/0x37
  [<0005c011>] ? watchdog_overflow_callback+0x97/0xa3
  [<00068454>] ? __perf_event_overflow+0x141/0x1bb
  [<0000ffff>] ? intel_pmu_enable_event+0x142/0x1d0
  [<000686c4>] ? perf_event_overflow+0x12/0x14
  [<0001079c>] ? intel_pmu_handle_irq+0x1c3/0x213
  [<0000cfaa>] ? perf_event_nmi_handler+0x13/0x18
  [<00005f83>] ? nmi_handle+0x2c/0x48
  [<000061db>] ? do_nmi+0x9d/0x2de
  [<0021b15c>] ? dma_pfn_level_pte+0x7b/0x93
  [<0021c4ca>] ? dma_pte_free_pagetable+0x74/0x1d6
  [<0029a625>] ? nmi_stack_correct+0x34/0x3e
  [<00004642>] ? arch_show_interrupts+0xab/0x5a7
  [<00210068>] ? write_page+0x1e8/0x3a1
  [<00299533>] ? _raw_spin_lock_irqsave+0x18/0x20
  [<0021cad1>] ? add_unmap+0x11/0x8c
  [<0021e531>] ? intel_map_sg+0x1de/0x1de
  [<002bead5>] ? scsi_dma_unmap+0x45/0x4c [scsi_mod]
  [<00750001>] ? 0x750000
  [<002efef7>] ? twl_interrupt+0x524/0x54d [3w_sas]
  [<000063f7>] ? do_nmi+0x2b9/0x2de
  [<00092724>] ? compaction_alloc+0xa5/0x219
  [<000713ea>] ? show_free_areas+0x5db/0x6e4
  [<0005c9c3>] ? handle_irq_event_percpu+0x24/0x110
  [<00006161>] ? do_nmi+0x23/0x2de
  [<0005e940>] ? handle_edge_irq+0xa1/0xa1
  [<0005cad0>] ? handle_irq_event+0x21/0x37
  [<0005e940>] ? handle_edge_irq+0xa1/0xa1
  [<0005e9a3>] ? handle_fasteoi_irq+0x63/0x7b
  [<00004c97>] ? handle_irq+0x6a/0x98
  <IRQ>  [<00004bbf>] ? do_IRQ+0x31/0x8a
  [<000143e4>] ? smp_apic_timer_interrupt+0x61/0x6d
  [<0029ae35>] ? common_interrupt+0x35/0x40
  [<00013791>] ? native_machine_shutdown+0x3b/0x72
  [<0000488b>] ? arch_show_interrupts+0x2f4/0x5a7
  [<0000488b>] ? arch_show_interrupts+0x2f4/0x5a7
  [<000100d8>] ? intel_pmu_enable_all+0x4b/0xd1
  [<002123bf>] ? need_resched+0xc/0x10
  [<0021252c>] ? poll_idle+0x1e/0x6b
  [<002124c6>] ? cpuidle_idle_call+0x53/0x9b
  [<00002a8c>] ? cpu_idle+0x44/0x6e
---[ end trace 7c5429b10ec8a603 ]---
------------[ cut here ]------------
WARNING: at kernel/watchdog.c:241 watchdog_overflow_callback+0x97/0xa3()

Not sure if the above information will help you to track this issue down 
but any assistance will be greately appreciated. If there is any other 
information we can pass let us know. Unfortunately the machines where 
these issues occur are in production thus enabling too many debugging 
options and often reboots are not a good solution :(

Best regards,

vaLentin

View attachment "config-3.2.14-grsec-clean-sg1" of type "text/plain" (66544 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ