lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20180418133831.uef7d77ejdyjtxgh@wfg-t540p.sh.intel.com>
Date:   Wed, 18 Apr 2018 21:38:31 +0800
From:   Fengguang Wu <fengguang.wu@...el.com>
To:     Oleg Drokin <oleg.drokin@...el.com>,
        Andreas Dilger <andreas.dilger@...el.com>,
        James Simmons <jsimmons@...radead.org>
Cc:     Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
        Linus Torvalds <torvalds@...ux-foundation.org>,
        Denis Petrovic <denis.petrovic@....ece.fr>,
        lustre-devel@...ts.lustre.org, devel@...verdev.osuosl.org,
        linux-kernel@...r.kernel.org, lkp@...org
Subject: [cfs_trace_lock_tcd] BUG: unable to handle kernel NULL pointer
 dereference at 00000050

Hello,

FYI this happens in mainline kernel 4.17.0-rc1.
It looks like a new regression.

It occurs in 5 out of 5 boots.

[    6.524361] ledtrig-cpu: registered to indicate activity on CPUs
[    6.527658] NET: Registered protocol family 4
[    6.528191] comedi: version 0.7.76 - http://www.comedi.org
[    6.528851] LNetError: 1:0:(module.c:546:libcfs_init()) misc_register: error -16
[    7.220272] input: ImExPS/2 Generic Explorer Mouse as /devices/platform/i8042/serio1/input/input3
[    7.586283] BUG: unable to handle kernel NULL pointer dereference at 00000050
[    7.586962] *pdpt = 0000000000000000 *pde = f000ff53f000ff53
[    7.587002] Oops: 0000 [#1] PREEMPT
[    7.587002] CPU: 0 PID: 1 Comm: swapper Not tainted 4.17.0-rc1 #1
[    7.587002] EIP: cfs_trace_lock_tcd+0xb/0xa0:
						cfs_trace_lock_tcd at drivers/staging/lustre/lnet/libcfs/linux/linux-tracefile.c:149
[    7.587002] EFLAGS: 00210246 CPU: 0
[    7.587002] EAX: 00000000 EBX: 00000000 ECX: 81fcb588 EDX: 00000000
[    7.587002] ESI: 00001800 EDI: 8f5d1e08 EBP: 8f5d1d7c ESP: 8f5d1d70
[    7.587002]  DS: 007b ES: 007b FS: 0000 GS: 00e0 SS: 0068
[    7.587002] CR0: 80050033 CR2: 00000050 CR3: 022f0000 CR4: 000006b0
[    7.587002] Call Trace:
[    7.587002]  libcfs_debug_vmsg2+0x8f/0x82f:
						libcfs_debug_vmsg2 at drivers/staging/lustre/lnet/libcfs/tracefile.c:317
[    7.587002]  ? trace_irq_enable_rcuidle+0x25/0x62:
						static_key_false at include/linux/jump_label.h:206
						 (inlined by) trace_irq_enable_rcuidle at include/trace/events/preemptirq.h:40
[    7.587002]  ? slob_free+0x249/0x251:
						slob_free at mm/slob.c:421
[    7.587002]  libcfs_debug_msg+0x19/0x1b:
						libcfs_debug_msg at drivers/staging/lustre/lnet/libcfs/tracefile.c:287
[    7.587002]  ksocknal_startup+0xe77/0x12b2:
						ksocknal_startup at drivers/staging/lustre/lnet/klnds/socklnd/socklnd.c:2845 (discriminator 3)
[    7.587002]  ? lock_release+0x135/0x1ec:
						lock_release at kernel/locking/lockdep.c:3942
[    7.587002]  ? _raw_spin_unlock+0x3c/0x4b:
						__raw_spin_unlock at include/linux/spinlock_api_smp.h:152
						 (inlined by) _raw_spin_unlock at kernel/locking/spinlock.c:176
[    7.587002]  lnet_startup_lndni+0x4cd/0x9ec:
						lnet_startup_lndni at drivers/staging/lustre/lnet/lnet/api-ni.c:1304
[    7.587002]  LNetNIInit+0x880/0xa00:
						lnet_startup_lndnis at drivers/staging/lustre/lnet/lnet/api-ni.c:1385
						 (inlined by) LNetNIInit at drivers/staging/lustre/lnet/lnet/api-ni.c:1543
[    7.587002]  ? read_seqcount_retry+0x1b/0x22:
						read_seqcount_retry at include/linux/seqlock.h:222
						
						read_seqcount_retry+0x1b/0x22:
						read_seqcount_retry at include/linux/seqlock.h:222
						
						read_seqcount_retry+0x1b/0x22:
						read_seqcount_retry at include/linux/seqlock.h:222
[    7.587002]  srpc_startup+0x84/0x381:
						srpc_startup at drivers/staging/lustre/lnet/selftest/rpc.c:1613
[    7.587002]  lnet_selftest_init+0x2c4/0x5d9:
						lnet_selftest_init at drivers/staging/lustre/lnet/selftest/module.c:134
[    7.587002]  ? lnet_selftest_exit+0x8d/0x8d:
						lnet_selftest_init at drivers/staging/lustre/lnet/selftest/module.c:90
[    7.587002]  do_one_initcall+0x76/0x1d7:
						__read_once_size at include/linux/compiler.h:188
						 (inlined by) arch_atomic_read at arch/x86/include/asm/atomic.h:31
						 (inlined by) atomic_read at include/asm-generic/atomic-instrumented.h:22
						 (inlined by) static_key_count at include/linux/jump_label.h:194
						 (inlined by) static_key_false at include/linux/jump_label.h:206
						 (inlined by) trace_initcall_finish at include/trace/events/initcall.h:44
						 (inlined by) do_one_initcall at init/main.c:884
[    7.587002]  ? do_early_param+0x78/0x78:
						repair_env_string at init/main.c:250
[    7.587002]  ? rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
						
						rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
						
						rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
						
						rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
						
						rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
						
						rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
						
						rcu_read_unlock_sched_notrace+0x1e/0x2d:
						rcu_read_unlock_sched_notrace at include/linux/rcupdate.h:778
[    7.587002]  ? trace_initcall_level+0x3e/0x64:
						trace_initcall_level at include/trace/events/initcall.h:10 (discriminator 18)
[    7.587002]  kernel_init_freeable+0xfc/0x177:
						do_initcall_level at init/main.c:951
						 (inlined by) do_initcalls at init/main.c:959
						 (inlined by) do_basic_setup at init/main.c:977
						 (inlined by) kernel_init_freeable at init/main.c:1127
[    7.587002]  ? rest_init+0xbe/0xbe:
						kernel_init at init/main.c:1050
[    7.587002]  kernel_init+0xd/0xd0:
						kernel_init at init/main.c:1055
[    7.587002]  ret_from_fork+0x2e/0x38:
						ret_from_fork at arch/x86/entry/entry_32.S:311
[    7.587002] Code: 16 94 ff 8b 15 68 ab 03 82 f7 c2 00 00 0f 00 b8 02 00 00 00 75 08 31 c0 80 e6 ff 0f 95 c0 5d c3 55 89 e5 57 56 53 e8 99 16 94 ff <8b> 78 50 89 d6 6a 00 31 d2 89 c3 b8 50 f0 1b 82 66 83 ff 02 0f
[    7.587002] EIP: cfs_trace_lock_tcd+0xb/0xa0:
						cfs_trace_lock_tcd at drivers/staging/lustre/lnet/libcfs/linux/linux-tracefile.c:149 SS:ESP: 0068:8f5d1d70
[    7.587002] CR2: 0000000000000050
[    7.587002] ---[ end trace 4fbdcb515c61525e ]---
[    7.587002] Kernel panic - not syncing: Fatal exception

Attached the full dmesg, kconfig and reproduce scripts.

Thanks,
Fengguang

View attachment "dmesg-vm-lkp-nhm-dp1-yocto-i386-7:20180416204457:i386-randconfig-s1-201814:4.17.0-rc1:1" of type "text/plain" (37819 bytes)

View attachment ".config" of type "text/plain" (97901 bytes)

View attachment "job-script" of type "text/plain" (4022 bytes)

View attachment "reproduce-vm-lkp-nhm-dp1-yocto-i386-7:20180416204457:i386-randconfig-s1-201814:4.17.0-rc1:1" of type "text/plain" (1897 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ