lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date:	Mon, 4 Aug 2014 22:57:51 +0800
From:	Fengguang Wu <fengguang.wu@...el.com>
To:	Artem Fetishev <Artem_Fetishev@...m.com>
Cc:	Jet Chen <jet.chen@...el.com>, Su Tao <tao.su@...el.com>,
	Yuanhan Liu <yuanhan.liu@...el.com>, LKP <lkp@...org>,
	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>
Subject: Re: [x86] BUG: unable to handle kernel paging request at
 ffff880012770000

Hi Artem,

On Mon, Aug 04, 2014 at 12:20:45PM +0000, Artem Fetishev wrote:
> Are we talking about error in rapl_pmu_init()? I ran your script on v3.14 and the error (in rapl_pmu_init) did not appear. Once again, for 3.14-rc8 the error does show up. My patch was merged between v3.14-rc8 and v3.14. The purpose of the patch was fixing that GPF in rapl_pmu_init. So I doubt the bisect is correct.

> +-----------------------------------------------------------+------------+------------+---------------+
> |                                                           | d9060742fb | 825600c0f2 | next-20140723 |
> +-----------------------------------------------------------+------------+------------+---------------+
> | boot_successes                                            | 167        | 48         | 9             |
> | boot_failures                                             | 97         | 18         | 2             |
> | BUG:unable_to_handle_kernel_NULL_pointer_dereference      | 97         | 14         | 2             |
> | Oops                                                      | 97         | 3          | 2             |

> | RIP:rapl_pmu_init                                         | 97         |            |               |
> | Kernel_panic-not_syncing:Attempted_to_kill_init_exitcode= | 97         |            |               |
> | backtrace:rapl_pmu_init                                   | 97         |            |               |

The rapl_pmu_init bug is confirmed to be fixed judging from the above
3 lines.

> | backtrace:kernel_init_freeable                            | 97         | 3          |               |

However the below BUGs are new in commit 825600c0f2:

> | BUG:unable_to_handle_kernel_paging_request                | 0          | 17         | 2             |
> | BUG:unable_to_handle_kernel_NULL_pointer                  | 0          | 1          |               |
> | RIP:slob_page_alloc                                       | 0          | 3          |               |
> | backtrace:proc_create_data                                | 0          | 3          |               |
> | backtrace:init_timer_list_procfs                          | 0          | 3          |               |
> | BUG:unable_to_handle                                      | 0          | 1          |               |
> | BUG:unable_to_handl                                       | 0          | 1          |               |
> | BUG:unable_to_handle_ker                                  | 0          | 1          |               |
> | BUG:kernel_boot_hang                                      | 0          | 1          |               |
> | RIP:memset                                                | 0          | 0          | 2             |
> | RIP:__rmqueue                                             | 0          | 0          | 2             |
> | backtrace:cryptomgr_test                                  | 0          | 0          | 2             |
> | backtrace:do_fork                                         | 0          | 0          | 2             |
> +-----------------------------------------------------------+------------+------------+---------------+

BUT I find cryptomgr_test showing up in the dmesg, which is very good
at producing all kinds of bug messages. So please ignore this bug
report -- it has too strong noises and may not be correct.

> [    1.026545] cryptomgr_test (20) used greatest stack depth: 5632 bytes left
> [    1.030491] AVX version of gcm_enc/dec engaged.
> [    1.033812] alg: No test for __gcm-aes-aesni (__driver-gcm-aes-aesni)
> [    1.038806] BUG: unable to handle kernel paging request at ffff880012770000

Thanks,
Fengguang

> [    1.040058] IP: [<ffffffff8161d18f>] memset+0x1f/0xb0
> [    1.040058] PGD 4170067 PUD 4171067 PMD 13bee067 PTE 8000000012770060
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] BUG: unable to handle kernel paging request at ffff880012776440
> [    1.040058] IP: [<ffffffff810fb846>] match_held_lock+0x16/0x2f0
> [    1.040058] PGD 4170067 PUD 4171067 PMD 13bee067 PTE 8000000012776060
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 0000000000000072
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 000000000000006a
> 
> git bisect start v3.14 v3.13 --
> git bisect good 494479038d97f1b9f76fc633a360a681acdf035c  # 20:32     66+     28  Merge tag 'pinctrl-v3.14-2' of git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-pinctrl
> git bisect good 751a03c3728ed393287374078b98c3094a0b3bd2  # 20:36     66+     67  Merge tag 'pm+acpi-3.14-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm
> git bisect good 18f2af2d68815e1c4d5c275ebd030e27ef627582  # 20:43     66+     55  Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm
> git bisect good 708f04d2abf4e90abee61d9ffb1f165038017ecf  # 20:46     66+     58  block: free q->flush_rq in blk_init_allocated_queue error paths
> git bisect good 75c5a52da3fc2a06abb6c6192bdf5d680e56d37d  # 20:48     66+     31  vfs: Allocate anon_inode_inode in anon_inode_init()
> git bisect  bad 0f2776e6151a123552fd06b666fe755fa780a967  # 20:48      0-     18  Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client
> git bisect good 1fac1fa965a9fbdb38d8a4b945db3521f801c946  # 20:57     66+     26  Merge branch 'drm-fixes' of git://people.freedesktop.org/~airlied/linux
> git bisect  bad bc53267e97e4942e7f22c31cd8cf87840796accb  # 21:01      1-      5  Merge branch 'akpm' (patches from Andrew Morton)
> git bisect good 2946369ee277fa9fcc3372aabddc9c15dfabf744  # 21:11     66+      0  Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input
> git bisect  bad 825600c0f20e595daaa7a6dd8970f84fa2a2ee57  # 21:30      0-      7  x86: fix boot on uniprocessor systems
> git bisect good d9060742fbf630fe31951dfc10b798deb2813f01  # 22:39     66+     29  ocfs2: check if cluster name exists before deref
> # first bad commit: [825600c0f20e595daaa7a6dd8970f84fa2a2ee57] x86: fix boot on uniprocessor systems
> git bisect good d9060742fbf630fe31951dfc10b798deb2813f01  # 22:49    198+     97  ocfs2: check if cluster name exists before deref
> git bisect  bad 5eb00b037d9bb650b18b8f331bb9fb7a66559b5f  # 22:51      0-      2  Add linux-next specific files for 20140723
> git bisect  bad 82e13c71bc655b6dc7110da4e164079dadb44892  # 23:12      0-     77  Merge branch 'for-3.16' of git://linux-nfs.org/~bfields/linux
> git bisect  bad 1a58d9909611972fd1c081bb04a9f7dc2571e612  # 23:23      2-      1  Add linux-next specific files for 20140724
> 
> 
> This script may reproduce the error.
> 
> ----------------------------------------------------------------------------
> #!/bin/bash
> 
> kernel=$1
> initrd=yocto-minimal-x86_64.cgz
> 
> wget --no-clobber https://github.com/fengguang/reproduce-kernel-bug/blob/master/initrd/$initrd
> 
> kvm=(
> 	qemu-system-x86_64
> 	-enable-kvm
> 	-cpu Haswell,+smep,+smap
> 	-kernel $kernel
> 	-initrd $initrd
> 	-m 320
> 	-smp 1
> 	-net nic,vlan=1,model=e1000
> 	-net user,vlan=1
> 	-boot order=nc
> 	-no-reboot
> 	-watchdog i6300esb
> 	-rtc base=localtime
> 	-serial stdio
> 	-display none
> 	-monitor null 
> )
> 
> append=(
> 	hung_task_panic=1
> 	earlyprintk=ttyS0,115200
> 	debug
> 	apic=debug
> 	sysrq_always_enabled
> 	rcupdate.rcu_cpu_stall_timeout=100
> 	panic=10
> 	softlockup_panic=1
> 	nmi_watchdog=panic
> 	prompt_ramdisk=0
> 	console=ttyS0,115200
> 	console=tty0
> 	vga=normal
> 	root=/dev/ram0
> 	rw
> 	drbd.minor_count=8
> )
> 
> "${kvm[@]}" --append "${append[*]}"
> ----------------------------------------------------------------------------
> 
> Thanks,
> Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ