lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <Ys4sa3aGu3GAAR30@xsang-OptiPlex-9020>
Date:   Wed, 13 Jul 2022 10:22:35 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Longpeng <longpeng2@...wei.com>
Cc:     Thomas Gleixner <tglx@...utronix.de>,
        Sebastian Andrzej Siewior <bigeasy@...utronix.de>,
        Dongli Zhang <dongli.zhang@...cle.com>,
        Valentin Schneider <valentin.schneider@....com>,
        Henry Wang <Henry.Wang@....com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com
Subject: [cpu/hotplug]  c7dfb2591b:
 WARNING:at_kernel/rcu/rcutorture.c:#rcu_torture_fwd_prog_cr[rcutorture]


(please be noted from message in parent dmesg attached, we even doubt this is
a fix, instead of a regression. so just FYI and hope to get some guidance from
you. Thanks a lot!)


Greeting,

FYI, we noticed the following commit (built with gcc-11):

commit: c7dfb2591b23856182e2c9da6b1d526ef7fd6b2e ("cpu/hotplug: Allow the CPU in CPU_UP_PREPARE state to be brought up again.")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: rcutorture
version: 
with following parameters:

	runtime: 300s
	test: cpuhotplug
	torture_type: tasks-tracing

test-description: rcutorture is rcutorture kernel module load/unload test.
test-url: https://www.kernel.org/doc/Documentation/RCU/torture.txt


on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):


e8a69f12f01f487c c7dfb2591b23856182e2c9da6b1
---------------- ---------------------------
       fail:runs  %reproduction    fail:runs
           |             |             |
           :20         100%          20:20    dmesg.RIP:rcu_torture_fwd_prog_cr[rcutorture]
           :20         100%          20:20    dmesg.WARNING:at_kernel/rcu/rcutorture.c:#rcu_torture_fwd_prog_cr[rcutorture]



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang@...el.com>


however, in parent dmesg (attached), we noticed below:
(parent kernel is built with same config as attached)

  739 [   63.916107][  T456] smpboot: CPU 0 is now offline
  740 [   64.001999][  T456] smpboot: Booting Node 0 Processor 0 APIC 0x0
  741 [   70.096845][  T466] tasks-tracing-torture: rcu_torture_read_exit: Start of episode
  742 [   70.280275][  T466] tasks-tracing-torture: rcu_torture_read_exit: End of episode
  743 [   74.001241][  T456] smpboot: do_boot_cpu failed(-1) to wakeup CPU#0
  744 [   74.010849][  T456] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
  745 [   74.028550][  T456] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
  746 [   74.055381][  T456] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
...
15842 [  345.202204][  T293]
15843 [  135.197493] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
15844 [  345.202210][  T293]
15845 [  345.206162][  T293]
15846 [  135.266699] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
15847 [  345.206168][  T293]


not sure if this is an issue which fixed by this commit?

and seems below in dmesg from this commit should not be caused by this commit?

[   99.712428][  T460] ------------[ cut here ]------------
[   99.712942][  T460] WARNING: CPU: 1 PID: 460 at kernel/rcu/rcutorture.c:2427 rcu_torture_fwd_prog_cr+0x27d/0x30a [rcutorture]
[   99.713912][  T460] Modules linked in: rcutorture torture mousedev ghash_clmulni_intel bochs drm_vram_helper drm_ttm_helper ttm input_leds psmouse drm_kms_helper parport_pc i2c_piix4 parport evbug fuse drm drm_panel_orientation_quirks
[   99.715729][  T460] CPU: 1 PID: 460 Comm: rcu_torture_fwd Not tainted 5.18.0-rc2-00002-gc7dfb2591b23 #1
[   99.716539][  T460] RIP: 0010:rcu_torture_fwd_prog_cr+0x27d/0x30a [rcutorture]
[   99.717158][  T460] Code: 7c cb ff ff e8 77 6f f9 ff 84 c0 0f 85 86 00 00 00 8a 05 23 55 00 00 84 c0 75 7c e8 ad e9 ff ff 84 c0 75 73 49 83 fd 63 7f 02 <0f> 0b ff 75 d0 48 8b 55 c8 4d 8d 04 1e 4d 89 f1 49 2b 54 24 60 ff
[   99.718819][  T460] RSP: 0018:ffff888130107e78 EFLAGS: 00010293
[   99.719332][  T460] RAX: 0000000000000000 RBX: 000000000000c350 RCX: 0000000000001bb3
[   99.719999][  T460] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff81e1b44d
[   99.720666][  T460] RBP: ffff888130107ec0 R08: 0000000000000001 R09: 0000000000000000
[   99.721346][  T460] R10: 0000000000000000 R11: ffff888132822018 R12: ffff888132822000
[   99.722017][  T460] R13: 0000000000000000 R14: 0000000000000ed1 R15: 0000000000000000
[   99.722689][  T460] FS:  0000000000000000(0000) GS:ffff88843fc00000(0000) knlGS:0000000000000000
[   99.723442][  T460] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   99.723998][  T460] CR2: 00007f6c71d68000 CR3: 0000000100376000 CR4: 00000000000406a0
[   99.724698][  T460] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   99.725374][  T460] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   99.726046][  T460] Call Trace:
[   99.726325][  T460]  <TASK>
[   99.726576][  T460]  rcu_torture_fwd_prog.cold+0x42/0x10b [rcutorture]
[   99.727143][  T460]  ? rcu_torture_fwd_prog_cbfree+0xc0/0xc0 [rcutorture]
[   99.727741][  T460]  kthread+0xe3/0x100
[   99.728092][  T460]  ? kthread_complete_and_exit+0x40/0x40
[   99.728571][  T460]  ret_from_fork+0x1f/0x30
[   99.728943][  T460]  </TASK>
[   99.729198][  T460] irq event stamp: 409161
[   99.729562][  T460] hardirqs last  enabled at (409169): [<ffffffff8118c52f>] __up_console_sem+0x6f/0x80
[   99.730367][  T460] hardirqs last disabled at (409176): [<ffffffff8118c514>] __up_console_sem+0x54/0x80
[   99.731188][  T460] softirqs last  enabled at (409190): [<ffffffff82200308>] __do_softirq+0x308/0x4b6
[   99.731980][  T460] softirqs last disabled at (409185): [<ffffffff810f7408>] __irq_exit_rcu+0x88/0xc0
[   99.732774][  T460] ---[ end trace 0000000000000000 ]---



To reproduce:

        # build kernel
	cd linux
	cp config-5.18.0-rc2-00002-gc7dfb2591b23 .config
	make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage modules
	make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 INSTALL_MOD_PATH=<mod-install-dir> modules_install
	cd <mod-install-dir>
	find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz


        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.



-- 
0-DAY CI Kernel Test Service
https://01.org/lkp



View attachment "config-5.18.0-rc2-00002-gc7dfb2591b23" of type "text/plain" (147614 bytes)

View attachment "job-script" of type "text/plain" (4962 bytes)

Download attachment "dmesg.xz" of type "application/x-xz" (19764 bytes)

View attachment "rcutorture" of type "text/plain" (13344 bytes)

Download attachment "dmesg-parent.xz" of type "application/x-xz" (68760 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ