lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Date:   Fri, 12 Feb 2021 13:54:28 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Peter Zijlstra <peterz@...radead.org>
Cc:     Valentin Schneider <valentin.schneider@....com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, aubrey.li@...ux.intel.com, yu.c.chen@...el.com
Subject: [sched]  975707f227: WARNING:at_kernel/sched/core.c:#balance_push


Greeting,

FYI, we noticed the following commit (built with gcc-9):

commit: 975707f227b07a8212060f94447171d15d7a681b ("sched: Prepare to use balance_push in ttwu()")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: rcutorture
version: 
with following parameters:

	runtime: 300s
	test: cpuhotplug
	torture_type: srcu

test-description: rcutorture is rcutorture kernel module load/unload test.
test-url: https://www.kernel.org/doc/Documentation/RCU/torture.txt


on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 8G

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):


+-------------------------------------------------+------------+------------+
|                                                 | 640f17c824 | 975707f227 |
+-------------------------------------------------+------------+------------+
| WARNING:at_kernel/sched/core.c:#sched_cpu_dying | 6          | 6          |
| RIP:sched_cpu_dying                             | 6          | 6          |
| WARNING:at_kernel/sched/core.c:#balance_push    | 0          | 4          |
| RIP:balance_push                                | 0          | 4          |
+-------------------------------------------------+------------+------------+


If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang@...el.com>


[  283.239704] WARNING: CPU: 0 PID: 15 at kernel/sched/core.c:7270 balance_push (kbuild/src/consumer/kernel/sched/core.c:7270) 
[  283.240927] Modules linked in: rcutorture torture
[  283.241410] CPU: 0 PID: 15 Comm: rcub/0 Tainted: G        W         5.11.0-rc3-00008-g975707f227b0 #1
[  283.242335] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.12.0-1 04/01/2014
[  283.243174] RIP: 0010:balance_push (kbuild/src/consumer/kernel/sched/core.c:7270) 
[ 283.243622] Code: ac b9 48 83 05 13 be 00 04 01 c6 05 a0 59 d4 03 01 e8 2f 79 12 00 48 c7 c7 0d dd 10 b9 48 83 05 d0 bd 00 04 01 e8 2e b0 c4 01 <0f> 0b b9 01 00 00 00 31 d2 be 01 00 00 00 48 c7 c7 60 1f ac b9 e8
All code
========
   0:	ac                   	lods   %ds:(%rsi),%al
   1:	b9 48 83 05 13       	mov    $0x13058348,%ecx
   6:	be 00 04 01 c6       	mov    $0xc6010400,%esi
   b:	05 a0 59 d4 03       	add    $0x3d459a0,%eax
  10:	01 e8                	add    %ebp,%eax
  12:	2f                   	(bad)  
  13:	79 12                	jns    0x27
  15:	00 48 c7             	add    %cl,-0x39(%rax)
  18:	c7                   	(bad)  
  19:	0d dd 10 b9 48       	or     $0x48b910dd,%eax
  1e:	83 05 d0 bd 00 04 01 	addl   $0x1,0x400bdd0(%rip)        # 0x400bdf5
  25:	e8 2e b0 c4 01       	callq  0x1c4b058
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	b9 01 00 00 00       	mov    $0x1,%ecx
  31:	31 d2                	xor    %edx,%edx
  33:	be 01 00 00 00       	mov    $0x1,%esi
  38:	48 c7 c7 60 1f ac b9 	mov    $0xffffffffb9ac1f60,%rdi
  3f:	e8                   	.byte 0xe8

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	b9 01 00 00 00       	mov    $0x1,%ecx
   7:	31 d2                	xor    %edx,%edx
   9:	be 01 00 00 00       	mov    $0x1,%esi
   e:	48 c7 c7 60 1f ac b9 	mov    $0xffffffffb9ac1f60,%rdi
  15:	e8                   	.byte 0xe8
[  283.245436] RSP: 0018:ffff95e54046fcf0 EFLAGS: 00010086
[  283.245977] RAX: 000000000000001d RBX: 0000000000000001 RCX: 0000000000000000
[  283.246678] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffffb9ad08a0
[  283.247378] RBP: ffff95e54046fd10 R08: 0000000000000000 R09: ffff95e54046fb38
[  283.248104] R10: 0000000000000001 R11: 0000000000000001 R12: ffff95e67fd2a240
[  283.248806] R13: ffff95e544164000 R14: 0000000000000001 R15: ffff95e544164000
[  283.249514] FS:  0000000000000000(0000) GS:ffff95e67fc00000(0000) knlGS:0000000000000000
[  283.250307] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  283.250876] CR2: 00007fdfc891f000 CR3: 0000000193464000 CR4: 00000000000406b0
[  283.251581] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  283.252300] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  283.253002] Call Trace:
[  283.253263] do_balance_callbacks (kbuild/src/consumer/kernel/sched/core.c:3978) 
[  283.253697] rt_mutex_setprio (kbuild/src/consumer/kernel/sched/core.c:5486) 
[  283.254103] task_blocks_on_rt_mutex (kbuild/src/consumer/kernel/locking/rtmutex.c:974 (discriminator 1)) 
[  283.254570] rt_mutex_slowlock (kbuild/src/consumer/kernel/locking/rtmutex.c:1262) 
[  283.254995] ? rcu_boost_kthread (kbuild/src/consumer/kernel/rcu/tree_plugin.h:1052 (discriminator 33)) 
[  283.255425] rcu_boost_kthread (kbuild/src/consumer/kernel/rcu/tree_plugin.h:1033 kbuild/src/consumer/kernel/rcu/tree_plugin.h:1056) 
[  283.255840] ? trace_rcu_utilization (kbuild/src/consumer/kernel/rcu/tree_plugin.h:1043) 
[  283.256301] kthread (kbuild/src/consumer/kernel/kthread.c:292 (discriminator 1)) 
[  283.256637] ? kthread_bind (kbuild/src/consumer/kernel/kthread.c:245) 
[  283.257010] ret_from_fork (kbuild/src/consumer/arch/x86/entry/entry_64.S:302) 
[  283.257379] ---[ end trace c38942c9fa7558e1 ]---
[  283.277895] numa_remove_cpu cpu 1 node 0: mask now 0
[  283.280276] smpboot: CPU 1 is now offline
[  283.293951] x86: Booting SMP configuration:
[  283.294391] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  283.330746] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  283.330843] masked ExtINT on CPU#1
[  283.331965] numa_add_cpu cpu 1 node 0: mask now 0-1
[  283.336608] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  283.375484] numa_remove_cpu cpu 1 node 0: mask now 0
[  283.378071] smpboot: CPU 1 is now offline
[  283.393976] x86: Booting SMP configuration:
[  283.394428] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  283.412639] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  283.412745] masked ExtINT on CPU#1
[  283.413871] numa_add_cpu cpu 1 node 0: mask now 0-1
[  283.418373] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  283.491884] numa_remove_cpu cpu 1 node 0: mask now 0
[  283.495272] smpboot: CPU 1 is now offline
[  283.525983] x86: Booting SMP configuration:
[  283.526425] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  283.544712] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  283.544813] masked ExtINT on CPU#1
[  283.545940] numa_add_cpu cpu 1 node 0: mask now 0-1
[  283.550440] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  283.666301] numa_remove_cpu cpu 1 node 0: mask now 0
[  283.669641] smpboot: CPU 1 is now offline
[  283.701933] x86: Booting SMP configuration:
[  283.702380] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  283.719844] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  283.719943] masked ExtINT on CPU#1
[  283.721076] numa_add_cpu cpu 1 node 0: mask now 0-1
[  283.726187] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  283.817853] numa_remove_cpu cpu 1 node 0: mask now 0
[  283.821126] smpboot: CPU 1 is now offline
[  283.854019] x86: Booting SMP configuration:
[  283.854468] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  283.871618] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  283.871719] masked ExtINT on CPU#1
[  283.872839] numa_add_cpu cpu 1 node 0: mask now 0-1
[  283.878038] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  283.942215] numa_remove_cpu cpu 1 node 0: mask now 0
[  283.945485] smpboot: CPU 1 is now offline
[  283.961956] x86: Booting SMP configuration:
[  283.962404] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  283.984487] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  283.984600] masked ExtINT on CPU#1
[  283.986305] numa_add_cpu cpu 1 node 0: mask now 0-1
[  283.990226] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  284.051485] numa_remove_cpu cpu 1 node 0: mask now 0
[  284.055360] smpboot: CPU 1 is now offline
[  284.117950] x86: Booting SMP configuration:
[  284.118401] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  284.139186] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  284.139326] masked ExtINT on CPU#1
[  284.142223] numa_add_cpu cpu 1 node 0: mask now 0-1
[  284.146683] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  284.195744] numa_remove_cpu cpu 1 node 0: mask now 0
[  284.199466] smpboot: CPU 1 is now offline
[  284.261983] x86: Booting SMP configuration:
[  284.262432] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  284.318185] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  284.318314] masked ExtINT on CPU#1
[  284.320303] numa_add_cpu cpu 1 node 0: mask now 0-1
[  284.326091] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  284.373794] numa_remove_cpu cpu 1 node 0: mask now 0
[  284.377273] smpboot: CPU 1 is now offline
[  284.397079] x86: Booting SMP configuration:
[  284.397597] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  284.426089] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  284.426213] masked ExtINT on CPU#1
[  284.428166] numa_add_cpu cpu 1 node 0: mask now 0-1
[  284.434347] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  284.542180] numa_remove_cpu cpu 1 node 0: mask now 0
[  284.545736] smpboot: CPU 1 is now offline
[  284.561643] x86: Booting SMP configuration:
[  284.562086] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  284.590505] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  284.590616] masked ExtINT on CPU#1
[  284.592263] numa_add_cpu cpu 1 node 0: mask now 0-1
[  284.600268] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  284.671227] numa_remove_cpu cpu 1 node 0: mask now 0
[  284.675079] smpboot: CPU 1 is now offline
[  284.769629] x86: Booting SMP configuration:
[  284.770076] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  284.798746] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  284.798867] masked ExtINT on CPU#1
[  284.800831] numa_add_cpu cpu 1 node 0: mask now 0-1
[  284.811787] kvm-guest: stealtime: cpu 1, msr 23fd16b80
[  284.871512] numa_remove_cpu cpu 1 node 0: mask now 0
[  284.876527] smpboot: CPU 1 is now offline
[  284.891570] x86: Booting SMP configuration:
[  284.892098] smpboot: Booting Node 0 Processor 1 APIC 0x1
[  284.926141] kvm-clock: cpu 1, msr 196b0e041, secondary cpu clock
[  284.926286] masked ExtINT on CPU#1


To reproduce:

        # build kernel
	cd linux
	cp config-5.11.0-rc3-00008-g975707f227b0 .config
	make HOSTCC=gcc-9 CC=gcc-9 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp qemu -k <bzImage> job-script # job-script is attached in this email



Thanks,
Oliver Sang


View attachment "config-5.11.0-rc3-00008-g975707f227b0" of type "text/plain" (145267 bytes)

View attachment "job-script" of type "text/plain" (4282 bytes)

Download attachment "dmesg.xz" of type "application/x-xz" (62748 bytes)

View attachment "rcutorture" of type "text/plain" (7176 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ