[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <53708067.7050700@intel.com>
Date: Mon, 12 May 2014 16:03:51 +0800
From: Jet Chen <jet.chen@...el.com>
To: Vincent Guittot <vincent.guittot@...aro.org>
CC: Ingo Molnar <mingo@...nel.org>,
Fengguang Wu <fengguang.wu@...el.com>,
linux-ia64@...r.kernel.org, linux-s390@...r.kernel.org,
LKML <linux-kernel@...r.kernel.org>
Subject: [sched] BUG: unable to handle kernel paging request at 093cd001
Hi Vincent,
I got the below dmesg and the first bad commit is
git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git sched/core
commit 143e1e28cb40bed836b0a06567208bd7347c9672
Author: Vincent Guittot <vincent.guittot@...aro.org>
AuthorDate: Fri Apr 11 11:44:37 2014 +0200
Commit: Ingo Molnar <mingo@...nel.org>
CommitDate: Wed May 7 13:33:49 2014 +0200
sched: Rework sched_domain topology definition
We replace the old way to configure the scheduler topology with a new method
which enables a platform to declare additionnal level (if needed).
We still have a default topology table definition that can be used by platform
that don't want more level than the SMT, MC, CPU and NUMA ones. This table can
be overwritten by an arch which either wants to add new level where a load
balance make sense like BOOK or powergating level or wants to change the flags
configuration of some levels.
For each level, we need a function pointer that returns cpumask for each cpu,
a function pointer that returns the flags for the level and a name. Only flags
that describe topology, can be set by an architecture. The current topology
flags are:
SD_SHARE_CPUPOWER
SD_SHARE_PKG_RESOURCES
SD_NUMA
SD_ASYM_PACKING
Then, each level must be a subset on the next one. The build sequence of the
sched_domain will take care of removing useless levels like those with 1 CPU
and those with the same CPU span and no more relevant information for
load balancing than its children.
Signed-off-by: Vincent Guittot <vincent.guittot@...aro.org>
Tested-by: Dietmar Eggemann <dietmar.eggemann@....com>
Reviewed-by: Preeti U Murthy <preeti@...ux.vnet.ibm.com>
Reviewed-by: Dietmar Eggemann <dietmar.eggemann@....com>
Signed-off-by: Peter Zijlstra <peterz@...radead.org>
Cc: Andrew Morton <akpm@...ux-foundation.org>
Cc: Benjamin Herrenschmidt <benh@...nel.crashing.org>
Cc: Bjorn Helgaas <bhelgaas@...gle.com>
Cc: Chris Metcalf <cmetcalf@...era.com>
Cc: Christoph Lameter <cl@...ux.com>
Cc: David S. Miller <davem@...emloft.net>
Cc: Fenghua Yu <fenghua.yu@...el.com>
Cc: Greg Kroah-Hartman <gregkh@...uxfoundation.org>
Cc: Hanjun Guo <hanjun.guo@...aro.org>
Cc: Heiko Carstens <heiko.carstens@...ibm.com>
Cc: Jason Low <jason.low2@...com>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>
Cc: Martin Schwidefsky <schwidefsky@...ibm.com>
Cc: Tony Luck <tony.luck@...el.com>
Cc: linux390@...ibm.com
Cc: linux-ia64@...r.kernel.org
Cc: linux-s390@...r.kernel.org
Link: http://lkml.kernel.org/r/1397209481-28542-2-git-send-email-vincent.guittot@linaro.org
Signed-off-by: Ingo Molnar <mingo@...nel.org>
+--------------------------------------------+------------+------------+
| | 107437febd | 143e1e28cb |
+--------------------------------------------+------------+------------+
| boot_successes | 137 | 0 |
| boot_failures | 3 | 20 |
| BUG:kernel_boot_crashed | 2 | |
| BUG:kernel_boot_hang | 1 | |
| BUG:unable_to_handle_kernel_paging_request | 0 | 20 |
| Oops | 0 | 20 |
| EIP_is_at_build_sched_domains | 0 | 20 |
| Kernel_panic-not_syncing:Fatal_exception | 0 | 20 |
| backtrace:sched_init_smp | 0 | 20 |
| backtrace:kernel_init_freeable | 0 | 20 |
+--------------------------------------------+------------+------------+
[ 0.568094] KVM setup async PF for cpu 31
[ 0.568094] kvm-stealtime: cpu 31, msr e3df980
[ 0.569822] smpboot: Total of 32 processors activated (172393.34 BogoMIPS)
[ 0.572544] BUG: unable to handle kernel paging request at 093cd001
[ 0.573405] IP: [<c226eba3>] build_sched_domains+0x252/0x1545
[ 0.574193] *pdpt = 0000000000000000 *pde = f000ff53f000ff53 [ 0.575009] Oops: 0000 [#1] SMP [ 0.575492] Modules linked in:
[ 0.575924] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 3.15.0-rc2-00065-g143e1e2 #1
[ 0.576000] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
[ 0.576000] task: cb450000 ti: cb458000 task.ti: cb458000
[ 0.576000] EIP: 0060:[<c226eba3>] EFLAGS: 00010202 CPU: 0
[ 0.576000] EIP is at build_sched_domains+0x252/0x1545
[ 0.576000] EAX: 00000001 EBX: cb403210 ECX: cb409c64 EDX: 093cd000
[ 0.576000] ESI: 00000000 EDI: cb403210 EBP: cb459f44 ESP: cb459ee0
[ 0.576000] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[ 0.576000] CR0: 8005003b CR2: 093cd001 CR3: 028ed000 CR4: 000006f0
[ 0.576000] Stack:
[ 0.576000] 00000002 00000000 00000000 c226fd5f 000000d0 cb409c68 00000020 c23b9d70
[ 0.576000] cb4031c0 cb403900 cb409c58 cb409358 00000000 00000000 cb4031f0 cb409c60
[ 0.576000] cb4031e0 00000000 00000000 cb403900 c28e6ef4 cb441000 cb409c58 00000008
[ 0.576000] Call Trace:
[ 0.576000] [<c226fd5f>] ? build_sched_domains+0x140e/0x1545
[ 0.576000] [<c23b9d70>] ? alloc_cpumask_var_node+0x1f/0x77
[ 0.576000] [<c274ec97>] sched_init_smp+0x34d/0x3c6
[ 0.576000] [<c2737b85>] kernel_init_freeable+0x6e/0x187
[ 0.576000] [<c2266d9d>] ? finish_task_switch+0x3e/0xfa
[ 0.576000] [<c24fda4a>] kernel_init+0xb/0xcc
[ 0.576000] [<c25120c1>] ret_from_kernel_thread+0x21/0x30
[ 0.576000] [<c24fda3f>] ? rest_init+0xbf/0xbf
[ 0.576000] Code: 00 31 73 c2 8b 0c 11 85 c9 74 0a f6 41 3d 20 0f 85 b9 00 00 00 8b 04 02 e8 80 5f 0b 00 8b 43 04 85 c0 74 0f 8b 14 b5 00 31 73 c2 <8b> 04 10 e8 6a 5f 0b 00 8b 43 08 85 c0 74 0f 8b 14 b5 00 31 73
[ 0.576000] EIP: [<c226eba3>] build_sched_domains+0x252/0x1545 SS:ESP 0068:cb459ee0
[ 0.576000] CR2: 00000000093cd001
[ 0.576000] ---[ end trace 36408322af01ed21 ]---
[ 0.576000] Kernel panic - not syncing: Fatal exception
Attached dmesg for the parent commit, too, to help confirm whether it is a noise error.
git bisect start 0368ef8acc92bb28093a70450d07b30f5d7fcbe3 d6d211db37e75de2ddc3a4f979038c40df7cc79c --
git bisect good e8db95ef4c15f74e67b32c56508a2a5f87787f57 # 20:32 20+ 20 Merge 'cgroup/master' into devel-hourly-2014051009
git bisect bad 62a39464ff6599489982d4faf6057cf963939966 # 20:50 0- 20 Merge 'asoc/topic/intel' into devel-hourly-2014051009
git bisect good 13fe6e7f5579d510c4f256f0dd3ba683040bf14f # 21:05 20+ 20 Merge 'rockchip/v3.16-next/dt' into devel-hourly-2014051009
git bisect bad 0ee69d44c7d3cb475ed1a3f92fa4afb4192705e3 # 21:08 0- 11 Merge 'tip/sched/core' into devel-hourly-2014051009
git bisect good 2f330408d01207d82907a5ff5fc7256cbd4c486a # 21:14 20+ 20 Merge 'xhci/for-usb-linus' into devel-hourly-2014051009
git bisect good 792568ec6a31ca560ca4d528782cbc6cd2cea8b0 # 21:29 20+ 2 sched/numa: Count pages on active node as local
git bisect bad 39a4d9ca77a31503c6317e49742341d0859d5cb2 # 21:34 0- 20 sched/fair: Stop searching for tasks in newidle balance if there are runnable tasks
git bisect bad 143e1e28cb40bed836b0a06567208bd7347c9672 # 21:34 0- 20 sched: Rework sched_domain topology definition
git bisect good 68d1b02a58f5d9f584c1fb2923ed60ec68cbbd9b # 21:49 20+ 2 sched/numa: Do not set preferred_node on migration to a second choice node
git bisect good 107437febd495a50e2cd09c81bbaa84d30e57b07 # 21:59 20+ 2 mm/numa: Remove BUG_ON() in __handle_mm_fault()
# first bad commit: [143e1e28cb40bed836b0a06567208bd7347c9672] sched: Rework sched_domain topology definition
git bisect good 107437febd495a50e2cd09c81bbaa84d30e57b07 # 22:02 60+ 3 mm/numa: Remove BUG_ON() in __handle_mm_fault()
git bisect bad 0368ef8acc92bb28093a70450d07b30f5d7fcbe3 # 22:02 0- 14 0day head guard for 'devel-hourly-2014051009'
git bisect good 7e338c9991ecee9c2ac7a4cee2c2e11ecb563d02 # 00:58 60+ 2 Merge branch 'for-3.15' of git://linux-nfs.org/~bfields/linux
git bisect bad 9f072b9386bc35ca2ed534c0392f5036902ce41b # 01:15 0- 33 Add linux-next specific files for 20140509
This script may reproduce the error.
-----------------------------------------------------------------------------
#!/bin/bash
kernel=$1
kvm=(
qemu-system-x86_64 -cpu kvm64 -enable-kvm -kernel $kernel
-smp 2
-m 256M
-net nic,vlan=0,macaddr=00:00:00:00:00:00,model=virtio
-net user,vlan=0
-net nic,vlan=1,model=e1000
-net user,vlan=1
-boot order=nc
-no-reboot
-watchdog i6300esb
-serial stdio
-display none
-monitor null
)
append=(
debug
sched_debug
apic=debug
ignore_loglevel
sysrq_always_enabled
panic=10
prompt_ramdisk=0
earlyprintk=ttyS0,115200
console=ttyS0,115200
console=tty0
vga=normal
root=/dev/ram0
rw
)
"${kvm[@]}" --append "${append[*]}"
-----------------------------------------------------------------------------
Thanks,
Fengguang
View attachment "dmesg-quantal-f4-75:20140510211957:i386-randconfig-c0-05101824:3.15.0-rc2-00065-g143e1e2:1" of type "text/plain" (35942 bytes)
View attachment "dmesg-quantal-f1-93:20140511215938:i386-randconfig-c0-05101824:3.15.0-rc2-00064-g107437f:1" of type "text/plain" (22994 bytes)
Download attachment "i386-randconfig-c0-05101824-0368ef8acc92bb28093a70450d07b30f5d7fcbe3-BUG:-unable-to-handle-kernel-paging-request-10579.log" of type "application/octet-stream" (51986 bytes)
View attachment "config-3.15.0-rc2-00065-g143e1e2" of type "text/plain" (61325 bytes)
View attachment "Attached Message Part" of type "text/plain" (87 bytes)
Powered by blists - more mailing lists