[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20140730153315.GA29233@localhost>
Date:	Wed, 30 Jul 2014 23:33:15 +0800
From:	Fengguang Wu <fengguang.wu@...el.com>
To:	Andy Lutomirski <luto@...capital.net>
Cc:	Jet Chen <jet.chen@...el.com>, Su Tao <tao.su@...el.com>,
	Yuanhan Liu <yuanhan.liu@...el.com>, LKP <lkp@...org>,
	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
	xen-devel@...ts.xenproject.org
Subject: Re: [x86_64,vsyscall] Kernel panic - not syncing: Attempted to kill
 init! exitcode=0x0000000b
On Wed, Jul 30, 2014 at 07:58:13AM -0700, Andy Lutomirski wrote:
> On Wed, Jul 30, 2014 at 7:29 AM, Fengguang Wu <fengguang.wu@...el.com> wrote:
> > Greetings,
> >
> > 0day kernel testing robot got the below dmesg and the first bad commit is
> >
> > git://git.kernel.org/pub/scm/linux/kernel/git/luto/linux.git x86/vsyscall
> > commit 442aba0c6131f0c41dfc5edb6bfb88335556523f
> > Author:     Andy Lutomirski <luto@...capital.net>
> > AuthorDate: Mon Jun 16 18:50:12 2014 -0700
> > Commit:     Andy Lutomirski <luto@...capital.net>
> > CommitDate: Mon Jun 30 14:32:44 2014 -0700
> 
> Was this a merge?
It's not a merge commit.
> Is there an easy way to see exactly what was tested? 
This script may reproduce the error. Note that it's not 100% reproducible.
----------------------------------------------------------------------------
#!/bin/bash
kernel=$1
initrd=yocto-minimal-x86_64.cgz
wget --no-clobber https://github.com/fengguang/reproduce-kernel-bug/blob/master/initrd/$initrd
kvm=(
        qemu-system-x86_64
        -cpu kvm64
        -enable-kvm
        -kernel $kernel
        -initrd $initrd
        -m 320
        -smp 1
        -net nic,vlan=1,model=e1000
        -net user,vlan=1
        -boot order=nc
        -no-reboot
        -watchdog i6300esb
        -rtc base=localtime
        -serial stdio
        -display none
        -monitor null
)
append=(
        hung_task_panic=1
        earlyprintk=ttyS0,115200
        debug
        apic=debug
        sysrq_always_enabled
        rcupdate.rcu_cpu_stall_timeout=100
        panic=10
        softlockup_panic=1
        nmi_watchdog=panic
        prompt_ramdisk=0
        console=ttyS0,115200
        console=tty0
        vga=normal
        root=/dev/ram0
        rw
        drbd.minor_count=8
)
"${kvm[@]}" --append "${append[*]}"
----------------------------------------------------------------------------
> I had a buggy
> commit called "x86: Split syscall_trace_enter
> into two phases" that could have caused this problem.
> 3f649f5658a163645e3ce15156176c325283762e was bad, but
> 714cf438762d342673b3b131d5c90bc69ca921a9 (the newer version of that
> commit) should be okay.  Neither is an ancestor of the commit that the
> bisect identified, though.
Yeah that patch lies in another branch "luto/x86/seccomp-fastpath",
so is not involved in this bug.
Thanks,
Fengguang
> >     x86_64,vsyscall: Make vsyscall emulation configurable
> >
> >     This adds CONFIG_X86_VSYSCALL_EMULATION, guarded by CONFIG_EXPERT.
> >     Turning it off completely disables vsyscall emulation, saving ~3.5k
> >     for vsyscall_64.c, 4k for vsyscall_emu_64.S (the fake vsyscall
> >     page), some tiny amount of core mm code that supports a gate area,
> >     and possibly 4k for a wasted pagetable.  The latter is because the
> >     vsyscall addresses are misaligned and fit poorly in the fixmap.
> >
> >     Signed-off-by: Andy Lutomirski <luto@...capital.net>
> >
> > ===================================================
> > PARENT COMMIT NOT CLEAN. LOOK OUT FOR WRONG BISECT!
> > ===================================================
> > Attached dmesg for the parent commit, too, to help confirm whether it is a noise error.
> >
> > +-----------------------------------------------------------+------------+------------+------------------+
> > |                                                           | e1656ab2ad | 442aba0c61 | v3.16-rc4_071018 |
> > +-----------------------------------------------------------+------------+------------+------------------+
> > | boot_successes                                            | 1160       | 99         | 3                |
> > | boot_failures                                             | 160        | 231        | 8                |
> > | BUG:kernel_boot_hang                                      | 160        | 51         | 2                |
> > | Kernel_panic-not_syncing:Attempted_to_kill_init_exitcode= | 0          | 180        | 6                |
> > | INFO:suspicious_RCU_usage                                 | 0          | 180        | 6                |
> > +-----------------------------------------------------------+------------+------------+------------------+
> >
> > mount: can't read '/proc/mounts': No such file or directory
> > [   33.736413] init[1]: segfault at ffffffffff600400 ip ffffffffff600400 sp 00007fff2894a8a8 error 15
> > [   33.737608] init[1]: segfault at ffffffffff600400 ip ffffffffff600400 sp 00007fff28949eb8 error 15
> > [   33.739046] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b
> > [   33.739046]
> > [   33.740015] CPU: 0 PID: 1 Comm: init Not tainted 3.16.0-rc3-00010-g442aba0 #4
> > [   33.740015]  0000000000000000 ffff880000033cc0 ffffffff81ff485f ffff880000033d38
> > [   33.740015]  ffffffff81ff1342 ffff880000000010 ffff880000033d48 ffff880000033ce8
> > [   33.740015]  ffffffff82c440c0 000000000000000b 8c6318c6318c6320 00000007db00a678
> > [   33.740015] Call Trace:
> > [   33.740015]  [<ffffffff81ff485f>] dump_stack+0x19/0x1b
> > [   33.740015]  [<ffffffff81ff1342>] panic+0xcb/0x1fb
> > [   33.740015]  [<ffffffff81093b2f>] do_exit+0x3dd/0x80f
> > [   33.740015]  [<ffffffff810b071d>] ? local_clock+0x14/0x1d
> > [   33.740015]  [<ffffffff81094002>] do_group_exit+0x75/0xb4
> > [   33.740015]  [<ffffffff8109c7e7>] get_signal_to_deliver+0x48a/0x4aa
> > [   33.740015]  [<ffffffff8100231a>] do_signal+0x43/0x5ba
> > [   33.740015]  [<ffffffff810b4b79>] ? lock_release_holdtime+0x6c/0x77
> > [   33.740015]  [<ffffffff810b83b5>] ? lock_release_non_nested+0xd0/0x21e
> > [   33.740015]  [<ffffffff810b0646>] ? sched_clock_cpu+0x4e/0x62
> > [   33.740015]  [<ffffffff810fd465>] ? might_fault+0x4f/0x9c
> > [   33.740015]  [<ffffffff810b6163>] ? trace_hardirqs_off_caller+0x36/0xa5
> > [   33.740015]  [<ffffffff82004298>] ? retint_signal+0x11/0x99
> > [   33.740015]  [<ffffffff810028b5>] do_notify_resume+0x24/0x53
> > [   33.740015]  [<ffffffff820042d4>] retint_signal+0x4d/0x99
> > [   33.740015] Kernel Offset: 0x0 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffff9fffffff)
> > [   33.740015] drm_kms_helper: panic occurred, switching back to text console
> > [   33.740015]
> > [   33.740015] ===============================
> > [   33.740015] [ INFO: suspicious RCU usage. ]
> > [   33.740015] 3.16.0-rc3-00010-g442aba0 #4 Not tainted
> > [   33.740015] -------------------------------
> > [   33.740015] include/linux/rcupdate.h:539 Illegal context switch in RCU read-side critical section!
> > [   33.740015]
> > [   33.740015] other info that might help us debug this:
> > [   33.740015]
> > [   33.740015]
> > [   33.740015] rcu_scheduler_active = 1, debug_locks = 0
> > [   33.740015] 3 locks held by init/1:
> > [   33.740015]  #0:  (panic_lock){....+.}, at: [<ffffffff81ff12ba>] panic+0x43/0x1fb
> > [   33.740015]  #1:  (rcu_read_lock){......}, at: [<ffffffff810ab879>] rcu_lock_acquire+0x0/0x23
> > [   33.740015]  #2:  (&dev->mode_config.mutex){+.+.+.}, at: [<ffffffff814a74d7>] drm_fb_helper_panic+0x5d/0xab
> > [   33.740015]
> > [   33.740015] stack backtrace:
> > [   33.740015] CPU: 0 PID: 1 Comm: init Not tainted 3.16.0-rc3-00010-g442aba0 #4
> > [   33.740015]  0000000000000000 ffff8800000339d0 ffffffff81ff485f ffff880000033a00
> > [   33.740015]  ffffffff810b8824 ffffffff82836248 000000000000024a 0000000000000000
> > [   33.740015]  ffff88001012e008 ffff880000033a10 ffffffff810adce3 ffff880000033a38
> > [   33.740015] Call Trace:
> > [   33.740015]  [<ffffffff81ff485f>] dump_stack+0x19/0x1b
> > [   33.740015]  [<ffffffff810b8824>] lockdep_rcu_suspicious+0xf6/0xff
> > [   33.740015]  [<ffffffff810adce3>] rcu_preempt_sleep_check+0x45/0x47
> > [   33.740015]  [<ffffffff810afedf>] __might_sleep+0x17/0x19a
> > [   33.740015]  [<ffffffff8200019e>] mutex_lock_nested+0x2e/0x369
> > [   33.740015]  [<ffffffff810b8657>] ? lock_release+0x154/0x185
> > [   33.740015]  [<ffffffff810b61df>] ? trace_hardirqs_off+0xd/0xf
> > [   33.740015]  [<ffffffff814b4ad3>] _object_find+0x25/0x6c
> > [   33.740015]  [<ffffffff814b5283>] drm_mode_object_find+0x38/0x53
> > [   33.740015]  [<ffffffff81593f6e>] cirrus_connector_best_encoder+0x21/0x2f
> > [   33.740015]  [<ffffffff814a5382>] drm_crtc_helper_set_config+0x38c/0x83c
> > [   33.740015]  [<ffffffff814b6c44>] drm_mode_set_config_internal+0x53/0xca
> > [   33.740015]  [<ffffffff814a731f>] restore_fbdev_mode+0x91/0xad
> > [   33.740015]  [<ffffffff814a74e3>] drm_fb_helper_panic+0x69/0xab
> > [   33.740015]  [<ffffffff810ab92c>] notifier_call_chain+0x61/0x8b
> > [   33.740015]  [<ffffffff810aba4f>] __atomic_notifier_call_chain+0x7e/0xe6
> > [   33.740015]  [<ffffffff810abac6>] atomic_notifier_call_chain+0xf/0x11
> > [   33.740015]  [<ffffffff81ff1367>] panic+0xf0/0x1fb
> > [   33.740015]  [<ffffffff81093b2f>] do_exit+0x3dd/0x80f
> > [   33.740015]  [<ffffffff810b071d>] ? local_clock+0x14/0x1d
> > [   33.740015]  [<ffffffff81094002>] do_group_exit+0x75/0xb4
> > [   33.740015]  [<ffffffff8109c7e7>] get_signal_to_deliver+0x48a/0x4aa
> > [   33.740015]  [<ffffffff8100231a>] do_signal+0x43/0x5ba
> > [   33.740015]  [<ffffffff810b4b79>] ? lock_release_holdtime+0x6c/0x77
> > [   33.740015]  [<ffffffff810b83b5>] ? lock_release_non_nested+0xd0/0x21e
> > [   33.740015]  [<ffffffff810b0646>] ? sched_clock_cpu+0x4e/0x62
> > [   33.740015]  [<ffffffff810fd465>] ? might_fault+0x4f/0x9c
> > [   33.740015]  [<ffffffff810b6163>] ? trace_hardirqs_off_caller+0x36/0xa5
> > [   33.740015]  [<ffffffff82004298>] ? retint_signal+0x11/0x99
> > [   33.740015]  [<ffffffff810028b5>] do_notify_resume+0x24/0x53
> > [   33.740015]  [<ffffffff820042d4>] retint_signal+0x4d/0x99
> > [   33.740015] Rebooting in 10 seconds..
> > Elapsed time: 40
> > qemu-system-x86_64 -cpu kvm64 -enable-kvm -kernel /kernel/x86_64-randconfig-hsxa0-07110255/442aba0c6131f0c41dfc5edb6bfb88335556523f/vmlinuz-3.16.0-rc3-00010-g442aba0 -append 'hung_task_panic=1 earlyprintk=ttyS0,115200 debug apic=debug sysrq_always_enabled rcupdate.rcu_cpu_stall_timeout=100 panic=10 softlockup_panic=1 nmi_watchdog=panic  prompt_ramdisk=0 console=ttyS0,115200 console=tty0 vga=normal  root=/dev/ram0 rw link=/kbuild-tests/run-queue/kvm/x86_64-randconfig-hsxa0-07110255/linux-devel:devel-hourly-2014071018:442aba0c6131f0c41dfc5edb6bfb88335556523f:bisect-linux9/.vmlinuz-442aba0c6131f0c41dfc5edb6bfb88335556523f-20140711073043-10-ivb41 branch=linux-devel/devel-hourly-2014071018 BOOT_IMAGE=/kernel/x86_64-randconfig-hsxa0-07110255/442aba0c6131f0c41dfc5edb6bfb88335556523f/vmlinuz-3.16.0-rc3-00010-g442aba0 drbd.minor_count=8'  -initrd /kernel-tests/initrd/yocto-minimal-x86_64.cgz -m 320 -smp 1 -net nic,vlan=1,model=e1000 -net user,vlan=1 -boot order=nc -no-reboot -watchdog i6300esb -rtc base=localtime -pidfile /dev/shm/kboot/pid-yocto-ivb41-17 -serial file:/dev/shm/kboot/serial-yocto-ivb41-17 -daemonize -display none -monitor null
> >
> > git bisect start c80be3ae11770011071103d3e920864c275472a8 cd3de83f147601356395b57a8673e9c5ff1e59d1 --
> > git bisect  bad 6e36d433610a3ebfdef000f1fb283e3f218a8a32  # 20:54      0-     19  Merge 'omap/omap-for-v3.16/fixes' into devel-hourly-2014071018
> > git bisect  bad 14604ab36faba88a89cb2c9611509f5a1c1cac21  # 20:54      0-    222  Merge 'ulf.hansson-mmc/next' into devel-hourly-2014071018
> > git bisect good 9141a68d71aa193f78aac5306fc728fba8fb59f4  # 21:50    330+     94  Merge 'm68k/for-linus' into devel-hourly-2014071018
> > git bisect  bad 13987d1746951b727146fef187406b7be00a3fd0  # 22:12      0-      7  Merge 'luto/x86/vsyscall' into devel-hourly-2014071018
> > git bisect good 7104a2e08de8bddb52d4714fad63d8a7977ea7f2  # 23:19    330+     22  x86_64: Move getcpu code from vsyscall_64.c to vdso/vma.c
> > git bisect good e1656ab2adfd1891f62610abe3e85ad992ee0cbf  # 23:26    330+    113  arm64,ia64,ppc,s390,sh,tile,um,x86,mm: Remove default gate area
> > git bisect  bad 465c34985bb9823bb4536eb6751197f2d295ca32  # 23:29     54-     91  x86,vdso: Set VM_MAYREAD for the vvar vma
> > git bisect  bad 442aba0c6131f0c41dfc5edb6bfb88335556523f  # 23:31      0-     37  x86_64,vsyscall: Make vsyscall emulation configurable
> > # first bad commit: [442aba0c6131f0c41dfc5edb6bfb88335556523f] x86_64,vsyscall: Make vsyscall emulation configurable
> > git bisect good e1656ab2adfd1891f62610abe3e85ad992ee0cbf  # 12:09    990+    160  arm64,ia64,ppc,s390,sh,tile,um,x86,mm: Remove default gate area
> > git bisect  bad c80be3ae11770011071103d3e920864c275472a8  # 12:10      0-      8  0day head guard for 'devel-hourly-2014071018'
> > git bisect good 85d90faed31ec74fb28a450fbc368d982a785924  # 13:11    990+    518  Merge branch 'drm-fixes' of git://people.freedesktop.org/~airlied/linux
> > git bisect good 47cf0ce945c8310228ff2d4bd756e5313f4659c1  # 13:21    990+    418  Add linux-next specific files for 20140710
> >
> >
> >
> > Thanks,
> > Fengguang
> >
> > _______________________________________________
> > LKP mailing list
> > LKP@...ux.intel.com
> >
> 
> 
> 
> -- 
> Andy Lutomirski
> AMA Capital Management, LLC
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/
Powered by blists - more mailing lists
 
