lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Date:	Tue, 22 Apr 2014 14:02:06 +0200
From:	Raphaël Bauduin <rblists@...il.com>
To:	linux-kernel@...r.kernel.org
Subject: Problem:  Access to out of range physical memory when handling paging
 request (kernel 2.6.32.61)

Hi,

I'm getting regular kernel paging request errors on a server running a vanilla kernel 2.6.32.61. Since 7 march, there have been at least 11 such 
errors. We have not been able to identify exact circumstances in which this bug happens. I hope this is a right place to report these kind of 
problems (in addition to https://bugzilla.kernel.org/show_bug.cgi?id=73991).

Here is an example:

BUG: unable to handle kernel paging request at ffff8804c001fb39
IP: [<ffffffff810bdbf3>] page_evictable+0x25/0x81
PGD 1002063 PUD 0
Oops: 0000 [#1] SMP
last sysfs file: /sys/devices/system/cpu/cpu15/topology/thread_siblings
CPU 6
Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT xt_tcpudp kvm_amd kvm 
ip6table_filter ip6_tables iptable_filter ip_tables x_tables tun nfsd exportfs nfs lockd fscache nfs_acl auth_rpcgss sunrpc bridge stp bonding 
dm_round_robin dm_multipath scsi_dh loop snd_pcm snd_timer snd soundcore snd_page_alloc serio_raw evdev tpm_tis tpm tpm_bios psmouse pcspkr 
button amd64_edac_mod edac_core edac_mce_amd container i2c_piix4 shpchp pci_hotplug i2c_core processor ext3 jbd mbcache dm_mirror dm_region_hash 
dm_log dm_snapshot dm_mod sd_mod crc_t10dif lpfc mptsas scsi_transport_fc mptscsih mptbase scsi_tgt ehci_hcd scsi_transport_sas tg3 ohci_hcd 
libphy scsi_mod usbcore nls_base thermal fan thermal_sys [last unloaded: scsi_wait_scan]
Pid: 185, comm: kswapd0 Not tainted 2.6.32.61vanilla #1 PRIMERGY BX630 S2
RIP: 0010:[<ffffffff810bdbf3>]  [<ffffffff810bdbf3>] page_evictable+0x25/0x81
RSP: 0018:ffff880416a3ba80  EFLAGS: 00010282
RAX: ffff8804c001fad6 RBX: ffffea000ba17088 RCX: 0000000000000020
RDX: 020000000002004c RSI: 0000000000000000 RDI: ffffea000ba17088
RBP: ffff880000015c80 R08: ffff880426458f00 R09: ffff880000015c80
R10: 0000000000000002 R11: ffff8800451e9850 R12: ffffea000ba170b0
R13: 0000000000000000 R14: 0000000000000001 R15: 0000000000000020
FS:  00007f9e11b07820(0000) GS:ffff88000fcc0000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: ffff8804c001fb39 CR3: 00000001ac27d000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process kswapd0 (pid: 185, threadinfo ffff880416a3a000, task ffff8804150f4600)
Stack:
  0000000000000020 ffffffff810bec01 0000000000000000 ffffea0000000001
<0> ffffea0000000001 ffffea0005cdd6f8 0000000000000009 ffff880416a3bb00
<0> ffff880416a3be10 ffff8800000170c0 0000001500000000 ffff8800000170c0
Call Trace:
  [<ffffffff810bec01>] ? shrink_active_list+0x19e/0x2d9
  [<ffffffff810bedfb>] ? shrink_list+0xbf/0x767
  [<ffffffff810bb3c9>] ? determine_dirtyable_memory+0xd/0x1d
  [<ffffffff810bb441>] ? get_dirty_limits+0x1d/0x259
  [<ffffffff810bf723>] ? shrink_zone+0x280/0x342
  [<ffffffff810c0148>] ? kswapd+0x4b9/0x683
  [<ffffffff810bd7df>] ? isolate_pages_global+0x0/0x20f
  [<ffffffff810651de>] ? autoremove_wake_function+0x0/0x2e
  [<ffffffff810bfc8f>] ? kswapd+0x0/0x683
  [<ffffffff81064f11>] ? kthread+0x79/0x81
  [<ffffffff81011baa>] ? child_rip+0xa/0x20
  [<ffffffff81064e98>] ? kthread+0x0/0x81
  [<ffffffff81011ba0>] ? child_rip+0x0/0x20
Code: 81 e9 83 2a 16 00 48 83 ec 08 48 8b 17 48 8b 47 18 f7 c2 00 00 01 00 74 09 48 c7 c0 80 9f 46 81 eb 09 a8 01 75 0b 48 85 c0 74 06 <f6> 40 
63 02 75 4b f7 c2 00 00 40 00 75 43 48 85 f6 74 42 48 8b
RIP  [<ffffffff810bdbf3>] page_evictable+0x25/0x81
  RSP <ffff880416a3ba80>
CR2: ffff8804c001fb39
---[ end trace 41d830aa2ac6f872 ]---

The reported address
   ffff8804c001fb39
is according to https://www.kernel.org/doc/Documentation/x86/x86_64/mm.txt
a direct mapping to physical memory, but it is located at about 19.5Gb, which is over the total physical memory.

Addresses reported in these 10 occurences of the error are:
00000000c001fade
00000000c001fb06
ffff8804c001fade
ffff8804c001fb2e
ffff8804c001fb39

The server has 16Gb of RAM, 32Gb of swap. The server is mainly used to run KVM guest instances (10) and the total memory configured for KVM 
instances is 13.5Gb.

Below are further details (and also here: https://bugzilla.kernel.org/show_bug.cgi?id=73991 where other logs are provided )

Let me know if I should provide more info

Raphaël


~# cat /proc/version
Linux version 2.6.32.61vanilla (root@...ave01) (gcc version 4.3.2 (Debian 4.3.2-1.1) ) #1 SMP Fri Mar 7 13:29:11 CET 2014

# ./ver_linux
If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.

Linux sMaster01 2.6.32.61vanilla #1 SMP Fri Mar 7 13:29:11 CET 2014 x86_64 GNU/Linux

Gnu C                  4.3.2
Gnu make               3.81
binutils               2.18.0.20080103
util-linux             2.13.1.1
mount                  2.13.1.1
module-init-tools      3.4
e2fsprogs              1.41.3
Linux C Library        2.7
Dynamic linker (ldd)   2.7
Procps                 3.2.7
Net-tools              1.60
Console-tools          0.2.3
Sh-utils               6.10
udev                   125
Modules Loaded         ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT xt_tcpudp kvm_amd 
kvm ip6table_filter ip6_tables iptable_filter ip_tables x_tables tun nfsd exportfs nfs lockd fscache nfs_acl auth_rpcgss sunrpc bridge stp 
bonding dm_round_robin dm_multipath scsi_dh loop snd_pcm snd_timer snd soundcore serio_raw snd_page_alloc tpm_tis psmouse tpm tpm_bios pcspkr 
evdev amd64_edac_mod edac_core edac_mce_amd button container i2c_piix4 processor shpchp pci_hotplug i2c_core ext3 jbd mbcache dm_mirror 
dm_region_hash dm_log dm_snapshot dm_mod sd_mod crc_t10dif mptsas lpfc mptscsih scsi_transport_fc ehci_hcd mptbase scsi_tgt scsi_transport_sas 
tg3 ohci_hcd libphy scsi_mod usbcore nls_base thermal fan thermal_sys

There are 16 cores, here's an extract:
# cat /proc/cpuinfo
processor       : 0
vendor_id       : AuthenticAMD
cpu family      : 16
model           : 4
model name      : Quad-Core AMD Opteron(tm) Processor 8380
stepping        : 2
cpu MHz         : 2500.037
cache size      : 512 KB
physical id     : 0
siblings        : 4
core id         : 0
cpu cores       : 4
apicid          : 4
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 5
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext 
fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic 
cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips        : 5000.06
TLB size        : 1024 4K pages
clflush size    : 64
cache_alignment : 64
address sizes   : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

# cat /proc/modules
ipt_MASQUERADE 1554 0 - Live 0xffffffffa01d2000
iptable_nat 4203 0 - Live 0xffffffffa0185000
nf_nat 13100 2 ipt_MASQUERADE,iptable_nat, Live 0xffffffffa01b3000
nf_conntrack_ipv4 9849 3 iptable_nat,nf_nat, Live 0xffffffffa0171000
nf_defrag_ipv4 1139 1 nf_conntrack_ipv4, Live 0xffffffffa0160000
xt_conntrack 2407 0 - Live 0xffffffffa0095000
nf_conntrack 45959 5 ipt_MASQUERADE,iptable_nat,nf_nat,nf_conntrack_ipv4,xt_conntrack, Live 0xffffffffa029e000
ipt_REJECT 1953 0 - Live 0xffffffffa007b000
xt_tcpudp 2319 0 - Live 0xffffffffa001c000
kvm_amd 31620 36 - Live 0xffffffffa014f000
kvm 214143 1 kvm_amd, Live 0xffffffffa045a000
ip6table_filter 2384 0 - Live 0xffffffffa0457000
ip6_tables 15091 1 ip6table_filter, Live 0xffffffffa044d000
iptable_filter 2258 0 - Live 0xffffffffa0447000
ip_tables 13915 2 iptable_nat,iptable_filter, Live 0xffffffffa043d000
x_tables 12845 7 ipt_MASQUERADE,iptable_nat,xt_conntrack,ipt_REJECT,xt_tcpudp,ip6_tables,ip_tables, Live 0xffffffffa0431000
tun 10748 22 - Live 0xffffffffa0428000
nfsd 252542 13 - Live 0xffffffffa03db000
exportfs 3074 1 nfsd, Live 0xffffffffa03d5000
nfs 239354 0 - Live 0xffffffffa0386000
lockd 57267 2 nfsd,nfs, Live 0xffffffffa036f000
fscache 29162 1 nfs, Live 0xffffffffa035d000
nfs_acl 2031 2 nfsd,nfs, Live 0xffffffffa0357000
auth_rpcgss 33316 2 nfsd,nfs, Live 0xffffffffa0346000
sunrpc 160476 21 nfsd,nfs,lockd,nfs_acl,auth_rpcgss, Live 0xffffffffa0307000
bridge 39278 0 - Live 0xffffffffa02f4000
stp 1440 1 bridge, Live 0xffffffffa02ee000
bonding 73895 0 - Live 0xffffffffa02d1000
dm_round_robin 2228 12 - Live 0xffffffffa02cb000
dm_multipath 13088 7 dm_round_robin, Live 0xffffffffa02c1000
scsi_dh 4640 1 dm_multipath, Live 0xffffffffa02b9000
loop 11703 0 - Live 0xffffffffa02b0000
snd_pcm 60487 0 - Live 0xffffffffa028d000
snd_timer 15534 1 snd_pcm, Live 0xffffffffa0282000
snd 46430 2 snd_pcm,snd_timer, Live 0xffffffffa026a000
soundcore 4598 1 snd, Live 0xffffffffa0262000
serio_raw 3752 0 - Live 0xffffffffa025c000
snd_page_alloc 6089 1 snd_pcm, Live 0xffffffffa0254000
tpm_tis 7448 0 - Live 0xffffffffa024c000
psmouse 49681 0 - Live 0xffffffffa0237000
tpm 10023 1 tpm_tis, Live 0xffffffffa022d000
tpm_bios 4521 1 tpm, Live 0xffffffffa0226000
pcspkr 1699 0 - Live 0xffffffffa0220000
evdev 7352 2 - Live 0xffffffffa0218000
amd64_edac_mod 13710 0 - Live 0xffffffffa020d000
edac_core 29261 6 amd64_edac_mod, Live 0xffffffffa01fb000
edac_mce_amd 6337 1 amd64_edac_mod, Live 0xffffffffa01f4000
button 4650 0 - Live 0xffffffffa01ec000
container 2389 0 - Live 0xffffffffa0080000
i2c_piix4 8328 0 - Live 0xffffffffa01e4000
processor 29727 0 - Live 0xffffffffa01d4000
shpchp 26248 0 - Live 0xffffffffa01c6000
pci_hotplug 21587 1 shpchp, Live 0xffffffffa01b9000
i2c_core 15739 1 i2c_piix4, Live 0xffffffffa01ad000
ext3 105686 3 - Live 0xffffffffa0188000
jbd 36877 1 ext3, Live 0xffffffffa0175000
mbcache 5050 1 ext3, Live 0xffffffffa016d000
dm_mirror 10827 0 - Live 0xffffffffa0164000
dm_region_hash 6584 1 dm_mirror, Live 0xffffffffa015c000
dm_log 7381 2 dm_mirror,dm_region_hash, Live 0xffffffffa00cb000
dm_snapshot 18289 0 - Live 0xffffffffa003f000
dm_mod 53338 35 dm_multipath,dm_mirror,dm_log,dm_snapshot, Live 0xffffffffa012b000
sd_mod 29657 15 - Live 0xffffffffa0145000
crc_t10dif 1276 1 sd_mod, Live 0xffffffffa000c000
mptsas 28916 2 - Live 0xffffffffa013b000
lpfc 360864 24 - Live 0xffffffffa00d0000
mptscsih 16360 1 mptsas, Live 0xffffffffa0088000
scsi_transport_fc 35131 1 lpfc, Live 0xffffffffa0070000
ehci_hcd 31457 0 - Live 0xffffffffa00c1000
mptbase 48190 2 mptsas,mptscsih, Live 0xffffffffa00b3000
scsi_tgt 8466 1 scsi_transport_fc, Live 0xffffffffa0005000
scsi_transport_sas 19673 1 mptsas, Live 0xffffffffa0046000
tg3 95311 0 - Live 0xffffffffa0099000
ohci_hcd 19071 0 - Live 0xffffffffa008e000
libphy 13446 1 tg3, Live 0xffffffffa0082000
scsi_mod 126117 8 scsi_dh,sd_mod,mptsas,lpfc,mptscsih,scsi_transport_fc,scsi_tgt,scsi_transport_sas, Live 0xffffffffa004f000
usbcore 122727 3 ehci_hcd,ohci_hcd, Live 0xffffffffa001f000
nls_base 6457 1 usbcore, Live 0xffffffffa0018000
thermal 11674 0 - Live 0xffffffffa000f000
fan 3346 0 - Live 0xffffffffa0009000
thermal_sys 11942 3 processor,thermal,fan, Live 0xffffffffa0000000

# cat /proc/ioports
0000-3fff : PCI Bus #00
   0000-001f : dma1
   0020-0021 : pic1
   0040-0043 : timer0
   0050-0053 : timer1
   0060-0060 : keyboard
   0064-0064 : keyboard
   0070-0071 : rtc0
   0080-008f : dma page reg
   00a0-00a1 : pic2
   00c0-00df : dma2
   00f0-00ff : fpu
   0100-0101 : pnp 00:09
   0200-020f : pnp 00:09
   02f8-02ff : serial
   03c0-03df : vga+
   03f8-03ff : serial
   040b-040b : pnp 00:09
   04d0-04d1 : pnp 00:09
   04d6-04d6 : pnp 00:09
   0500-0560 : pnp 00:09
     0500-0503 : ACPI PM1a_EVT_BLK
     0504-0505 : ACPI PM1a_CNT_BLK
     0508-050b : ACPI PM_TMR
     050c-0511 : ACPI CPU throttle
     0514-051b : ACPI GPE1_BLK
     0540-0543 : ACPI PM1b_EVT_BLK
     0544-0545 : ACPI PM1b_CNT_BLK
     0550-0557 : ACPI GPE0_BLK
     0558-055b : pnp 00:09
   0580-058f : pnp 00:09
     0580-0587 : piix4_smbus
   0590-0593 : pnp 00:09
   0600-061f : pnp 00:09
   0620-0623 : pnp 00:09
   0624-067f : pnp 00:09
   06c0-06c0 : pnp 00:09
   0700-0703 : pnp 00:09
   0c00-0c01 : pnp 00:09
   0c06-0c08 : pnp 00:09
   0c14-0c14 : pnp 00:09
   0c49-0c4a : pnp 00:09
   0c50-0c53 : pnp 00:09
   0c6c-0c6c : pnp 00:09
   0c6f-0c6f : pnp 00:09
   0ca4-0ca5 : pnp 00:09
   0cd6-0cd7 : pnp 00:09
   0cf8-0cff : PCI conf1
   0e00-0e7f : pnp 00:09
   0f00-0f7f : pnp 00:09
     0f50-0f58 : pnp 00:09
   1000-10ff : 0000:00:03.0
   1400-14ff : 0000:00:03.1
   1800-18ff : 0000:00:03.2
   2000-2fff : PCI Bus 0000:03
     2000-20ff : 0000:03:00.0
   3000-3fff : PCI Bus 0000:04
     3000-30ff : 0000:04:00.0
     3400-34ff : 0000:04:00.1
4000-5fff : PCI Bus #10
   4000-4fff : PCI Bus 0000:11
     4000-40ff : 0000:11:00.0
   5000-5fff : PCI Bus 0000:12
     5000-50ff : 0000:12:00.0
     5400-54ff : 0000:12:00.1
6000-ffff : PCI Bus #00

# cat /proc/iomem
00000000-0000ffff : reserved
00010000-0009c7ff : System RAM
0009c800-0009ffff : reserved
000a0000-000bffff : PCI Bus #00
000d0000-000fffff : reserved
00100000-d7f3ffff : System RAM
   01000000-01303b04 : Kernel code
   01303b05-014d0d8f : Kernel data
   0156f000-01680403 : Kernel bss
   20000000-23ffffff : GART
d7f40000-d7f53fff : ACPI Tables
d7f54000-d7f7ffff : ACPI Non-volatile Storage
d7f80000-d7ffffff : reserved
d8000000-df0fffff : PCI Bus #00
   d8000000-d81fffff : PCI Bus 0000:03
     d8000000-d81fffff : 0000:03:00.0
   d8200000-d82fffff : PCI Bus 0000:04
     d8200000-d823ffff : 0000:04:00.0
     d8240000-d827ffff : 0000:04:00.1
   dd000000-ddffffff : 0000:00:05.0
   de000000-de7fffff : 0000:00:05.0
   dea00000-decfffff : PCI Bus 0000:03
     dec00000-dec0ffff : 0000:03:00.0
       dec00000-dec0ffff : mpt
     dec10000-dec13fff : 0000:03:00.0
       dec10000-dec13fff : mpt
   ded00000-dedfffff : PCI Bus 0000:04
     ded80000-ded80fff : 0000:04:00.0
       ded80000-ded80fff : lpfc
     ded81000-ded81fff : 0000:04:00.1
       ded81000-ded81fff : lpfc
     ded82000-ded820ff : 0000:04:00.0
       ded82000-ded820ff : lpfc
     ded82400-ded824ff : 0000:04:00.1
       ded82400-ded824ff : lpfc
   dee00000-deefffff : PCI Bus 0000:08
     dee00000-deefffff : PCI Bus 0000:09
       dee00000-dee0ffff : 0000:09:04.0
         dee00000-dee0ffff : tg3
       dee10000-dee1ffff : 0000:09:04.0
         dee10000-dee1ffff : tg3
       dee20000-dee2ffff : 0000:09:04.1
         dee20000-dee2ffff : tg3
       dee30000-dee3ffff : 0000:09:04.1
         dee30000-dee3ffff : tg3
   def00000-deffffff : PCI Bus 0000:0a
     def00000-deffffff : PCI Bus 0000:0b
       def00000-def0ffff : 0000:0b:04.0
         def00000-def0ffff : tg3
       def10000-def1ffff : 0000:0b:04.0
         def10000-def1ffff : tg3
       def20000-def2ffff : 0000:0b:04.1
         def20000-def2ffff : tg3
       def30000-def3ffff : 0000:0b:04.1
         def30000-def3ffff : tg3
   df000000-df003fff : 0000:00:05.0
   df005000-df005fff : 0000:00:03.0
     df005000-df005fff : ohci_hcd
   df006000-df006fff : 0000:00:03.1
     df006000-df006fff : ohci_hcd
   df007000-df007fff : 0000:00:03.2
     df007000-df007fff : ehci_hcd
df100000-df8fffff : PCI Bus #10
   df100000-df1fffff : PCI Bus 0000:12
     df100000-df13ffff : 0000:12:00.0
     df140000-df17ffff : 0000:12:00.1
     df180000-df180fff : 0000:12:00.0
       df180000-df180fff : lpfc
     df181000-df181fff : 0000:12:00.1
       df181000-df181fff : lpfc
     df182000-df1820ff : 0000:12:00.0
       df182000-df1820ff : lpfc
     df182400-df1824ff : 0000:12:00.1
       df182400-df1824ff : lpfc
   df200000-df3fffff : PCI Bus 0000:11
     df200000-df3fffff : 0000:11:00.0
   df400000-df6fffff : PCI Bus 0000:11
     df600000-df60ffff : 0000:11:00.0
       df600000-df60ffff : mpt
     df610000-df613fff : 0000:11:00.0
       df610000-df613fff : mpt
   df700000-df7fffff : PCI Bus 0000:16
     df700000-df7fffff : PCI Bus 0000:17
       df700000-df70ffff : 0000:17:04.0
         df700000-df70ffff : tg3
       df710000-df71ffff : 0000:17:04.0
         df710000-df71ffff : tg3
       df720000-df72ffff : 0000:17:04.1
         df720000-df72ffff : tg3
       df730000-df73ffff : 0000:17:04.1
         df730000-df73ffff : tg3
   df800000-df8fffff : PCI Bus 0000:18
     df800000-df8fffff : PCI Bus 0000:19
       df800000-df80ffff : 0000:19:04.0
         df800000-df80ffff : tg3
       df810000-df81ffff : 0000:19:04.0
         df810000-df81ffff : tg3
       df820000-df82ffff : 0000:19:04.1
         df820000-df82ffff : tg3
       df830000-df83ffff : 0000:19:04.1
         df830000-df83ffff : tg3
df900000-dfffffff : PCI Bus #00
e0000000-efffffff : reserved
   e0000000-efffffff : pnp 00:09
     e0000000-e18fffff : PCI MMCONFIG 0 [00-18]
f0000000-fecfffff : PCI Bus #00
   fec00000-fec01fff : reserved
     fec00000-fec00fff : IOAPIC 0
     fec01000-fec01fff : IOAPIC 1
fed00000-ffffffff : PCI Bus #00
   fed00000-fed003ff : HPET 0
     fed00000-fed003ff : reserved
   fee00000-fee00fff : Local APIC
     fee00000-fee00fff : reserved
   fff80000-ffffffff : reserved
100000000-427ffffff : System RAM
428000000-fcffffffff : PCI Bus #00

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ