linux-kernel - howto troubleshoot 'kernel paging request' oops

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <CAF_e0cbZTD1iRX1=ZPpkH5pwDSMGOyWC=_swDgLu=N+hY=g5wg@mail.gmail.com>
Date:	Wed, 20 Jun 2012 20:42:17 -0500
From:	Richard Langly <richard.ringo.langly@...il.com>
To:	linux-kernel@...r.kernel.org
Subject: howto troubleshoot 'kernel paging request' oops

I've got a problem that I have been unable to trouble-shoot in my
kernel module for going on a few weeks now and running out of ideas.
I'm developing a networking KM and testing on two VM guests. The KM's
on both guests seem to run fine as I transfer a simple messages from
one node to the other.

So after sending from one guest, and receiving it on the second guest,
the VM'd kernel instances just idle, I leave them running as I
continue my work. I have /var/log/messages being tailed via netconsole
to my host OS. And here's where the problem begins ... usually about 2
minutes after I send a msg and the VM just idles, the following output
appears in /var/log/messages. I'm having difficulty tracing any of
this as the traceback doesn't "seem" to originate w/ my KM (though I
know my KM's must be at fault as it would never happen unless I load
and run them). So this oops only appears several minutes after the run
on both the sending and the receiving machine using my KM's.

Any advice/suggestions much appreciated.


---
[  217.952082] BUG: unable to handle kernel paging request at 000000011e0fc6c0
[  217.953026] IP: [<ffffffff814e0d73>] nf_nat_cleanup_conntrack+0x4a/0x71
[  217.953026] PGD 1e564067 PUD 0
[  217.953026] Oops: 0002 [#1] SMP
[  217.953026] CPU 0
[  217.953026] Modules linked in: testkm1(O) testkm2(O)
[  217.953026]
[  217.953026] Pid: 0, comm: swapper/0 Tainted: G           O
3.2.1-gentoo-r2 #2 Bochs Bochs
[  217.953026] RIP: 0010:[<ffffffff814e0d73>]  [<ffffffff814e0d73>]
nf_nat_cleanup_conntrack+0x4a/0x71
[  217.953026] RSP: 0018:ffff88001fa03d70  EFLAGS: 00010246
[  217.953026] RAX: 0000000000000000 RBX: ffff88001e1367f8 RCX: ffffffff81053f1f
[  217.953026] RDX: 000000011e0fc6c0 RSI: 0000000000000006 RDI: ffffffff81c79bd8
[  217.953026] RBP: ffff88001fa03d80 R08: ffff88001fa0d980 R09: 0000000000000001
[  217.953026] R10: ffff88001fa03f08 R11: ffff88001fa0d900 R12: ffff88001e1367e1
[  217.953026] R13: ffff88001e082138 R14: ffff88001fa03e90 R15: ffffffff81a01fd8
[  217.953026] FS:  0000000000000000(0000) GS:ffff88001fa00000(0000)
knlGS:0000000000000000
[  217.953026] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[  217.953026] CR2: 000000011e0fc6c0 CR3: 000000001f748000 CR4: 00000000000006f0
[  217.953026] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  217.953026] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  217.953026] Process swapper/0 (pid: 0, threadinfo ffffffff81a00000,
task ffffffff81a0d020)
[  217.953026] Stack:
[  217.953026]  ffff88001fa12bc0 ffffffff81c77dc8 ffff88001fa03db0
ffffffff81498e3f
[  217.953026]  7fffffffffffffff ffff88001e082138 ffffffff81c76280
0000000000000100
[  217.953026]  ffff88001fa03dd0 ffffffff814945db ffff88001e082138
ffffffff81c76280
[  217.953026] Call Trace:
[  217.953026]  <IRQ>
[  217.953026]  [<ffffffff81498e3f>] __nf_ct_ext_destroy+0x3b/0x53
[  217.953026]  [<ffffffff814945db>] nf_conntrack_free+0x20/0x4f
[  217.953026]  [<ffffffff814946b2>] destroy_conntrack+0xa8/0xad
[  217.953026]  [<ffffffff8149122c>] nf_conntrack_destroy+0x16/0x18
[  217.953026]  [<ffffffff81493a5a>] nf_ct_put+0x18/0x1a
[  217.953026]  [<ffffffff81494a64>] death_by_timeout+0x22/0x26
[  217.953026]  [<ffffffff8107aba4>] run_timer_softirq+0x1c6/0x295
[  217.953026]  [<ffffffff81494a42>] ? nf_ct_delete_from_lists+0x89/0x89
[  217.953026]  [<ffffffff81090013>] ? ktime_get+0x59/0x93
[  217.953026]  [<ffffffff81073b6f>] __do_softirq+0xc8/0x1a4
[  217.953026]  [<ffffffff8108c3e1>] ? hrtimer_interrupt+0x10d/0x19f
[  217.953026]  [<ffffffff815b6b2c>] call_softirq+0x1c/0x30
[  217.953026]  [<ffffffff81035a99>] do_softirq+0x41/0x7e
[  217.953026]  [<ffffffff81073932>] irq_exit+0x44/0xb4
[  217.953026]  [<ffffffff8104c4b9>] smp_apic_timer_interrupt+0x86/0x94
[  217.953026]  [<ffffffff815b539e>] apic_timer_interrupt+0x6e/0x80
[  217.953026]  <EOI>
[  217.953026]  [<ffffffff810535c4>] ? native_safe_halt+0x6/0x8
[  217.953026]  [<ffffffff8103b460>] default_idle+0x4b/0x85
[  217.953026]  [<ffffffff81033dd4>] cpu_idle+0x6e/0xa5
[  217.953026]  [<ffffffff8158edd9>] rest_init+0x6d/0x6f
[  217.953026]  [<ffffffff81aa9bcc>] start_kernel+0x350/0x35b
[  217.953026]  [<ffffffff81aa92b1>] x86_64_start_reservations+0xb8/0xbc
[  217.953026]  [<ffffffff81aa93b6>] x86_64_start_kernel+0x101/0x110
[  217.953026] Code: c2 85 d2 74 49 0f b6 58 11 48 01 c3 74 40 48 83
7b 20 00 74 39 48 c7 c7 d8 9b c7 81 e8 0c d6 0c 00 48 8b 03 48 8b 53
08 48 85 c0 <48> 89 02 74 04 48 89 50 08 48 bf 00 02 20 00 00 00 ad de
48 89
[  217.953026] RIP  [<ffffffff814e0d73>] nf_nat_cleanup_conntrack+0x4a/0x71
[  217.953026]  RSP <ffff88001fa03d70>
[  217.953026] CR2: 000000011e0fc6c0
[  218.039169] ---[ end trace c8420f05dc384e8a ]---
[  218.040439] Kernel panic - not syncing: Fatal exception in interrupt
[  218.042159] Pid: 0, comm: swapper/0 Tainted: G      D    O 3.2.1-gentoo-r2 #2
[  218.044068] Call Trace:
[  218.044717]  <IRQ>  [<ffffffff815ac249>] panic+0x8c/0x19e
[  218.046245]  [<ffffffff815af0a4>] oops_end+0xb1/0xc1
[  218.047590]  [<ffffffff81057b76>] no_context+0x202/0x211
[  218.049023]  [<ffffffff81053f1f>] ? pvclock_clocksource_read+0x4b/0xb4
[  218.050765]  [<ffffffff81057d3e>] __bad_area_nosemaphore+0x1b9/0x1d9
[  218.052471]  [<ffffffff81053592>] ? kvm_clock_read+0x19/0x1b
[  218.053992]  [<ffffffff81057d6c>] bad_area_nosemaphore+0xe/0x10
[  218.055580]  [<ffffffff815b1205>] do_page_fault+0x1c1/0x389
[  218.057077]  [<ffffffff81053f1f>] ? pvclock_clocksource_read+0x4b/0xb4
[  218.058822]  [<ffffffff81053592>] ? kvm_clock_read+0x19/0x1b
[  218.060383]  [<ffffffff81069c68>] ? enqueue_task_fair+0x2ab/0x414
[  218.061992]  [<ffffffff815b0c49>] do_async_page_fault+0x49/0x6b
[  218.063607]  [<ffffffff815ae7b5>] async_page_fault+0x25/0x30
[  218.065132]  [<ffffffff81053f1f>] ? pvclock_clocksource_read+0x4b/0xb4
[  218.066874]  [<ffffffff814e0d73>] ? nf_nat_cleanup_conntrack+0x4a/0x71
[  218.068696]  [<ffffffff81498e3f>] __nf_ct_ext_destroy+0x3b/0x53
[  218.070290]  [<ffffffff814945db>] nf_conntrack_free+0x20/0x4f
[  218.071831]  [<ffffffff814946b2>] destroy_conntrack+0xa8/0xad
[  218.073378]  [<ffffffff8149122c>] nf_conntrack_destroy+0x16/0x18
[  218.074995]  [<ffffffff81493a5a>] nf_ct_put+0x18/0x1a
[  218.076357]  [<ffffffff81494a64>] death_by_timeout+0x22/0x26
[  218.077889]  [<ffffffff8107aba4>] run_timer_softirq+0x1c6/0x295
[  218.079475]  [<ffffffff81494a42>] ? nf_ct_delete_from_lists+0x89/0x89
[  218.081202]  [<ffffffff81090013>] ? ktime_get+0x59/0x93
[  218.082611]  [<ffffffff81073b6f>] __do_softirq+0xc8/0x1a4
[  218.084060]  [<ffffffff8108c3e1>] ? hrtimer_interrupt+0x10d/0x19f
[  218.085698]  [<ffffffff815b6b2c>] call_softirq+0x1c/0x30
[  218.087121]  [<ffffffff81035a99>] do_softirq+0x41/0x7e
[  218.088503]  [<ffffffff81073932>] irq_exit+0x44/0xb4
[  218.089879]  [<ffffffff8104c4b9>] smp_apic_timer_interrupt+0x86/0x94
[  218.091591]  [<ffffffff815b539e>] apic_timer_interrupt+0x6e/0x80
[  218.093218]  <EOI>  [<ffffffff810535c4>] ? native_safe_halt+0x6/0x8
[  218.094945]  [<ffffffff8103b460>] default_idle+0x4b/0x85
[  218.096378]  [<ffffffff81033dd4>] cpu_idle+0x6e/0xa5
[  218.097719]  [<ffffffff8158edd9>] rest_init+0x6d/0x6f
[  218.099099]  [<ffffffff81aa9bcc>] start_kernel+0x350/0x35b
[  218.100575]  [<ffffffff81aa92b1>] x86_64_start_reservations+0xb8/0xbc
[  218.102320]  [<ffffffff81aa93b6>] x86_64_start_kernel+0x101/0x110
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/