lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date:	Mon, 12 May 2014 16:03:51 +0800
From:	Jet Chen <jet.chen@...el.com>
To:	Vincent Guittot <vincent.guittot@...aro.org>
CC:	Ingo Molnar <mingo@...nel.org>,
	Fengguang Wu <fengguang.wu@...el.com>,
	linux-ia64@...r.kernel.org, linux-s390@...r.kernel.org,
	LKML <linux-kernel@...r.kernel.org>
Subject: [sched] BUG: unable to handle kernel paging request at 093cd001

Hi Vincent,

I got the below dmesg and the first bad commit is

git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git sched/core
commit 143e1e28cb40bed836b0a06567208bd7347c9672
Author:     Vincent Guittot <vincent.guittot@...aro.org>
AuthorDate: Fri Apr 11 11:44:37 2014 +0200
Commit:     Ingo Molnar <mingo@...nel.org>
CommitDate: Wed May 7 13:33:49 2014 +0200

      sched: Rework sched_domain topology definition
          We replace the old way to configure the scheduler topology with a new method
      which enables a platform to declare additionnal level (if needed).
          We still have a default topology table definition that can be used by platform
      that don't want more level than the SMT, MC, CPU and NUMA ones. This table can
      be overwritten by an arch which either wants to add new level where a load
      balance make sense like BOOK or powergating level or wants to change the flags
      configuration of some levels.
          For each level, we need a function pointer that returns cpumask for each cpu,
      a function pointer that returns the flags for the level and a name. Only flags
      that describe topology, can be set by an architecture. The current topology
      flags are:
           SD_SHARE_CPUPOWER
       SD_SHARE_PKG_RESOURCES
       SD_NUMA
       SD_ASYM_PACKING
          Then, each level must be a subset on the next one. The build sequence of the
      sched_domain will take care of removing useless levels like those with 1 CPU
      and those with the same CPU span and no more relevant information for
      load balancing than its children.
          Signed-off-by: Vincent Guittot <vincent.guittot@...aro.org>
      Tested-by: Dietmar Eggemann <dietmar.eggemann@....com>
      Reviewed-by: Preeti U Murthy <preeti@...ux.vnet.ibm.com>
      Reviewed-by: Dietmar Eggemann <dietmar.eggemann@....com>
      Signed-off-by: Peter Zijlstra <peterz@...radead.org>
      Cc: Andrew Morton <akpm@...ux-foundation.org>
      Cc: Benjamin Herrenschmidt <benh@...nel.crashing.org>
      Cc: Bjorn Helgaas <bhelgaas@...gle.com>
      Cc: Chris Metcalf <cmetcalf@...era.com>
      Cc: Christoph Lameter <cl@...ux.com>
      Cc: David S. Miller <davem@...emloft.net>
      Cc: Fenghua Yu <fenghua.yu@...el.com>
      Cc: Greg Kroah-Hartman <gregkh@...uxfoundation.org>
      Cc: Hanjun Guo <hanjun.guo@...aro.org>
      Cc: Heiko Carstens <heiko.carstens@...ibm.com>
      Cc: Jason Low <jason.low2@...com>
      Cc: Linus Torvalds <torvalds@...ux-foundation.org>
      Cc: Martin Schwidefsky <schwidefsky@...ibm.com>
      Cc: Tony Luck <tony.luck@...el.com>
      Cc: linux390@...ibm.com
      Cc: linux-ia64@...r.kernel.org
      Cc: linux-s390@...r.kernel.org
      Link: http://lkml.kernel.org/r/1397209481-28542-2-git-send-email-vincent.guittot@linaro.org
      Signed-off-by: Ingo Molnar <mingo@...nel.org>

+--------------------------------------------+------------+------------+
|                                            | 107437febd | 143e1e28cb |
+--------------------------------------------+------------+------------+
| boot_successes                             | 137        | 0          |
| boot_failures                              | 3          | 20         |
| BUG:kernel_boot_crashed                    | 2          |            |
| BUG:kernel_boot_hang                       | 1          |            |
| BUG:unable_to_handle_kernel_paging_request | 0          | 20         |
| Oops                                       | 0          | 20         |
| EIP_is_at_build_sched_domains              | 0          | 20         |
| Kernel_panic-not_syncing:Fatal_exception   | 0          | 20         |
| backtrace:sched_init_smp                   | 0          | 20         |
| backtrace:kernel_init_freeable             | 0          | 20         |
+--------------------------------------------+------------+------------+

[    0.568094] KVM setup async PF for cpu 31
[    0.568094] kvm-stealtime: cpu 31, msr e3df980
[    0.569822] smpboot: Total of 32 processors activated (172393.34 BogoMIPS)
[    0.572544] BUG: unable to handle kernel paging request at 093cd001
[    0.573405] IP: [<c226eba3>] build_sched_domains+0x252/0x1545
[    0.574193] *pdpt = 0000000000000000 *pde = f000ff53f000ff53 [    0.575009] Oops: 0000 [#1] SMP [    0.575492] Modules linked in:
[    0.575924] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 3.15.0-rc2-00065-g143e1e2 #1
[    0.576000] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
[    0.576000] task: cb450000 ti: cb458000 task.ti: cb458000
[    0.576000] EIP: 0060:[<c226eba3>] EFLAGS: 00010202 CPU: 0
[    0.576000] EIP is at build_sched_domains+0x252/0x1545
[    0.576000] EAX: 00000001 EBX: cb403210 ECX: cb409c64 EDX: 093cd000
[    0.576000] ESI: 00000000 EDI: cb403210 EBP: cb459f44 ESP: cb459ee0
[    0.576000]  DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[    0.576000] CR0: 8005003b CR2: 093cd001 CR3: 028ed000 CR4: 000006f0
[    0.576000] Stack:
[    0.576000]  00000002 00000000 00000000 c226fd5f 000000d0 cb409c68 00000020 c23b9d70
[    0.576000]  cb4031c0 cb403900 cb409c58 cb409358 00000000 00000000 cb4031f0 cb409c60
[    0.576000]  cb4031e0 00000000 00000000 cb403900 c28e6ef4 cb441000 cb409c58 00000008
[    0.576000] Call Trace:
[    0.576000]  [<c226fd5f>] ? build_sched_domains+0x140e/0x1545
[    0.576000]  [<c23b9d70>] ? alloc_cpumask_var_node+0x1f/0x77
[    0.576000]  [<c274ec97>] sched_init_smp+0x34d/0x3c6
[    0.576000]  [<c2737b85>] kernel_init_freeable+0x6e/0x187
[    0.576000]  [<c2266d9d>] ? finish_task_switch+0x3e/0xfa
[    0.576000]  [<c24fda4a>] kernel_init+0xb/0xcc
[    0.576000]  [<c25120c1>] ret_from_kernel_thread+0x21/0x30
[    0.576000]  [<c24fda3f>] ? rest_init+0xbf/0xbf
[    0.576000] Code: 00 31 73 c2 8b 0c 11 85 c9 74 0a f6 41 3d 20 0f 85 b9 00 00 00 8b 04 02 e8 80 5f 0b 00 8b 43 04 85 c0 74 0f 8b 14 b5 00 31 73 c2 <8b> 04 10 e8 6a 5f 0b 00 8b 43 08 85 c0 74 0f 8b 14 b5 00 31 73
[    0.576000] EIP: [<c226eba3>] build_sched_domains+0x252/0x1545 SS:ESP 0068:cb459ee0
[    0.576000] CR2: 00000000093cd001
[    0.576000] ---[ end trace 36408322af01ed21 ]---
[    0.576000] Kernel panic - not syncing: Fatal exception

Attached dmesg for the parent commit, too, to help confirm whether it is a noise error.

git bisect start 0368ef8acc92bb28093a70450d07b30f5d7fcbe3 d6d211db37e75de2ddc3a4f979038c40df7cc79c --
git bisect good e8db95ef4c15f74e67b32c56508a2a5f87787f57  # 20:32     20+     20  Merge 'cgroup/master' into devel-hourly-2014051009
git bisect  bad 62a39464ff6599489982d4faf6057cf963939966  # 20:50      0-     20  Merge 'asoc/topic/intel' into devel-hourly-2014051009
git bisect good 13fe6e7f5579d510c4f256f0dd3ba683040bf14f  # 21:05     20+     20  Merge 'rockchip/v3.16-next/dt' into devel-hourly-2014051009
git bisect  bad 0ee69d44c7d3cb475ed1a3f92fa4afb4192705e3  # 21:08      0-     11  Merge 'tip/sched/core' into devel-hourly-2014051009
git bisect good 2f330408d01207d82907a5ff5fc7256cbd4c486a  # 21:14     20+     20  Merge 'xhci/for-usb-linus' into devel-hourly-2014051009
git bisect good 792568ec6a31ca560ca4d528782cbc6cd2cea8b0  # 21:29     20+      2  sched/numa: Count pages on active node as local
git bisect  bad 39a4d9ca77a31503c6317e49742341d0859d5cb2  # 21:34      0-     20  sched/fair: Stop searching for tasks in newidle balance if there are runnable tasks
git bisect  bad 143e1e28cb40bed836b0a06567208bd7347c9672  # 21:34      0-     20  sched: Rework sched_domain topology definition
git bisect good 68d1b02a58f5d9f584c1fb2923ed60ec68cbbd9b  # 21:49     20+      2  sched/numa: Do not set preferred_node on migration to a second choice node
git bisect good 107437febd495a50e2cd09c81bbaa84d30e57b07  # 21:59     20+      2  mm/numa: Remove BUG_ON() in __handle_mm_fault()
# first bad commit: [143e1e28cb40bed836b0a06567208bd7347c9672] sched: Rework sched_domain topology definition
git bisect good 107437febd495a50e2cd09c81bbaa84d30e57b07  # 22:02     60+      3  mm/numa: Remove BUG_ON() in __handle_mm_fault()
git bisect  bad 0368ef8acc92bb28093a70450d07b30f5d7fcbe3  # 22:02      0-     14  0day head guard for 'devel-hourly-2014051009'
git bisect good 7e338c9991ecee9c2ac7a4cee2c2e11ecb563d02  # 00:58     60+      2  Merge branch 'for-3.15' of git://linux-nfs.org/~bfields/linux
git bisect  bad 9f072b9386bc35ca2ed534c0392f5036902ce41b  # 01:15      0-     33  Add linux-next specific files for 20140509


This script may reproduce the error.

-----------------------------------------------------------------------------
#!/bin/bash

kernel=$1

kvm=(
	qemu-system-x86_64 -cpu kvm64 -enable-kvm 	-kernel $kernel
	-smp 2
	-m 256M
	-net nic,vlan=0,macaddr=00:00:00:00:00:00,model=virtio
	-net user,vlan=0
	-net nic,vlan=1,model=e1000
	-net user,vlan=1
	-boot order=nc
	-no-reboot
	-watchdog i6300esb
	-serial stdio
	-display none
	-monitor null
)

append=(
	debug
	sched_debug
	apic=debug
	ignore_loglevel
	sysrq_always_enabled
	panic=10
	prompt_ramdisk=0
	earlyprintk=ttyS0,115200
	console=ttyS0,115200
	console=tty0
	vga=normal
	root=/dev/ram0
	rw
)

"${kvm[@]}" --append "${append[*]}"
-----------------------------------------------------------------------------

Thanks,
Fengguang



View attachment "dmesg-quantal-f4-75:20140510211957:i386-randconfig-c0-05101824:3.15.0-rc2-00065-g143e1e2:1" of type "text/plain" (35942 bytes)

View attachment "dmesg-quantal-f1-93:20140511215938:i386-randconfig-c0-05101824:3.15.0-rc2-00064-g107437f:1" of type "text/plain" (22994 bytes)

Download attachment "i386-randconfig-c0-05101824-0368ef8acc92bb28093a70450d07b30f5d7fcbe3-BUG:-unable-to-handle-kernel-paging-request-10579.log" of type "application/octet-stream" (51986 bytes)

View attachment "config-3.15.0-rc2-00065-g143e1e2" of type "text/plain" (61325 bytes)

View attachment "Attached Message Part" of type "text/plain" (87 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ