[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <202407041644.de55c25-oliver.sang@intel.com>
Date: Thu, 4 Jul 2024 16:30:34 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Xavier <xavier_qy@....com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
<mingo@...hat.com>, <peterz@...radead.org>, <juri.lelli@...hat.com>,
<vincent.guittot@...aro.org>, <dietmar.eggemann@....com>,
<rostedt@...dmis.org>, <bsegall@...gle.com>, <mgorman@...e.de>,
<bristot@...hat.com>, <vschneid@...hat.com>, Xavier <xavier_qy@....com>,
<oliver.sang@...el.com>
Subject: Re: [PATCH-RT sched v2 1/2] RT SCHED: Optimize the enqueue and
dequeue operations for rt_se
Hello,
kernel test robot noticed "WARNING:at_kernel/sched/rt.c:#__enqueue_rt_entity" on:
commit: ed0ed14c2b47993c00c4b3cdceabef535bcef32b ("[PATCH-RT sched v2 1/2] RT SCHED: Optimize the enqueue and dequeue operations for rt_se")
url: https://github.com/intel-lab-lkp/linux/commits/Xavier/RT-SCHED-Optimize-the-enqueue-and-dequeue-operations-for-rt_se/20240630-173825
base: https://git.kernel.org/cgit/linux/kernel/git/tip/tip.git c793a62823d1ce8f70d9cfc7803e3ea436277cda
patch link: https://lore.kernel.org/all/20240629112812.243691-2-xavier_qy@163.com/
patch subject: [PATCH-RT sched v2 1/2] RT SCHED: Optimize the enqueue and dequeue operations for rt_se
in testcase: blktests
version: blktests-x86_64-775a058-1_20240702
with following parameters:
disk: 1SSD
test: block-group-01
compiler: gcc-13
test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz (Skylake) with 32G memory
(please refer to attached dmesg/kmsg for entire log/backtrace)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Closes: https://lore.kernel.org/oe-lkp/202407041644.de55c25-oliver.sang@intel.com
[ 54.093440][ C2] ------------[ cut here ]------------
[ 54.094193][ T705] list_add double add: new=ffff888802a8abc0, prev=ffff888802a8abc0, next=ffff8887892c4dd0.
[ 54.098261][ C2] WARNING: CPU: 2 PID: 53 at kernel/sched/rt.c:1415 __enqueue_rt_entity (kernel/sched/rt.c:1415 (discriminator 1))
[ 54.103613][ T705] ------------[ cut here ]------------
[ 54.113477][ C2] Modules linked in: dm_multipath
[ 54.122743][ T705] kernel BUG at lib/list_debug.c:35!
[ 54.128080][ C2] btrfs blake2b_generic
[ 54.132987][ T705] Oops: invalid opcode: 0000 [#1] PREEMPT SMP KASAN PTI
[ 54.138148][ C2] xor zstd_compress
[ 54.142266][ T705] CPU: 3 PID: 705 Comm: multipathd Tainted: G S 6.10.0-rc1-00010-ged0ed14c2b47 #1
[ 54.149087][ C2] raid6_pq libcrc32c
[ 54.152852][ T705] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.8.1 12/05/2017
[ 54.163339][ C2] ipmi_devintf ipmi_msghandler
[ 54.167192][ T705] RIP: 0010:__list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1))
[ 54.175322][ C2] intel_rapl_msr intel_rapl_common
[ 54.180049][ T705] Code: 0b 48 89 f1 48 c7 c7 00 fa 26 84 48 89 de e8 d6 75 f2 fe 0f 0b 48 89 f2 48 89 d9 48 89 ee 48 c7 c7 80 fa 26 84 e8 bf 75 f2 fe <0f> 0b 48 89 f7 48 89 34 24 e8 11 cc 61 ff 48 8b 34 24 e9 71 ff ff
All code
========
0: 0b 48 89 or -0x77(%rax),%ecx
3: f1 icebp
4: 48 c7 c7 00 fa 26 84 mov $0xffffffff8426fa00,%rdi
b: 48 89 de mov %rbx,%rsi
e: e8 d6 75 f2 fe callq 0xfffffffffef275e9
13: 0f 0b ud2
15: 48 89 f2 mov %rsi,%rdx
18: 48 89 d9 mov %rbx,%rcx
1b: 48 89 ee mov %rbp,%rsi
1e: 48 c7 c7 80 fa 26 84 mov $0xffffffff8426fa80,%rdi
25: e8 bf 75 f2 fe callq 0xfffffffffef275e9
2a:* 0f 0b ud2 <-- trapping instruction
2c: 48 89 f7 mov %rsi,%rdi
2f: 48 89 34 24 mov %rsi,(%rsp)
33: e8 11 cc 61 ff callq 0xffffffffff61cc49
38: 48 8b 34 24 mov (%rsp),%rsi
3c: e9 .byte 0xe9
3d: 71 ff jno 0x3e
3f: ff .byte 0xff
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 48 89 f7 mov %rsi,%rdi
5: 48 89 34 24 mov %rsi,(%rsp)
9: e8 11 cc 61 ff callq 0xffffffffff61cc1f
e: 48 8b 34 24 mov (%rsp),%rsi
12: e9 .byte 0xe9
13: 71 ff jno 0x14
15: ff .byte 0xff
[ 54.186345][ C2] sd_mod t10_pi
[ 54.191424][ T705] RSP: 0018:ffffc90000327b38 EFLAGS: 00010046
[ 54.211022][ C2] x86_pkg_temp_thermal
[ 54.214447][ T705]
[ 54.220405][ C2] crc64_rocksoft_generic crc64_rocksoft
[ 54.224435][ T705] RAX: 0000000000000058 RBX: ffff8887892c4dd0 RCX: ffffffff82424f4e
[ 54.226632][ C2] intel_powerclamp crc64
[ 54.232145][ T705] RDX: 0000000000000000 RSI: 0000000000000008 RDI: ffff8887893b5380
[ 54.240012][ C2] coretemp sg
[ 54.244217][ T705] RBP: ffff888802a8abc0 R08: 0000000000000001 R09: fffff52000064f22
[ 54.252087][ C2] kvm_intel i915
[ 54.255330][ T705] R10: ffffc90000327917 R11: 205d324320202020 R12: ffff888802a8abc0
[ 54.263200][ C2] kvm crct10dif_pclmul
[ 54.266705][ T705] R13: ffff8887892c4dd0 R14: ffff888802a8ac00 R15: ffff8887892c4dd8
[ 54.274572][ C2] crc32_pclmul crc32c_intel
[ 54.278599][ T705] FS: 00007f1b015ee680(0000) GS:ffff888789380000(0000) knlGS:0000000000000000
[ 54.286469][ C2] drm_buddy ghash_clmulni_intel
[ 54.290934][ T705] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 54.299764][ C2] intel_gtt sha512_ssse3
[ 54.304580][ T705] CR2: 000055e6a99e25f8 CR3: 000000080473e006 CR4: 00000000003706f0
[ 54.311054][ C2] drm_display_helper
[ 54.315255][ T705] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 54.323124][ C2] rapl ttm
[ 54.326976][ T705] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 54.334845][ C2] drm_kms_helper
[ 54.337825][ T705] Call Trace:
[ 54.345696][ C2] ahci mei_wdt
[ 54.349201][ T705] <TASK>
[ 54.352357][ C2] intel_cstate wmi_bmof
[ 54.355687][ T705] ? die (arch/x86/kernel/dumpstack.c:421 arch/x86/kernel/dumpstack.c:434 arch/x86/kernel/dumpstack.c:447)
[ 54.358493][ C2] intel_uncore
[ 54.362610][ T705] ? do_trap (arch/x86/kernel/traps.c:114 arch/x86/kernel/traps.c:155)
[ 54.366202][ C2] binfmt_misc video
[ 54.369533][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1))
[ 54.373650][ C2] libahci mei_me
[ 54.377418][ T705] ? do_error_trap (arch/x86/include/asm/traps.h:58 arch/x86/kernel/traps.c:176)
[ 54.383104][ C2] i2c_i801 wmi
[ 54.386607][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1))
[ 54.391070][ C2] intel_pch_thermal i2c_smbus
[ 54.394400][ T705] ? handle_invalid_op (arch/x86/kernel/traps.c:214)
[ 54.400087][ C2] mei libata
[ 54.404727][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1))
[ 54.409540][ C2] acpi_pad fuse
[ 54.412697][ T705] ? exc_invalid_op (arch/x86/kernel/traps.c:267)
[ 54.418385][ C2] loop drm
[ 54.421803][ T705] ? asm_exc_invalid_op (arch/x86/include/asm/idtentry.h:621)
[ 54.426355][ C2] dm_mod ip_tables
[ 54.429337][ T705] ? llist_add_batch (lib/llist.c:33 (discriminator 14))
[ 54.434240][ C2]
[ 54.437928][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1))
[ 54.442661][ C2] CPU: 2 PID: 53 Comm: khugepaged Tainted: G S 6.10.0-rc1-00010-ged0ed14c2b47 #1
[ 54.444859][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1))
[ 54.450557][ C2] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.8.1 12/05/2017
[ 54.460974][ T705] __enqueue_rt_entity (include/linux/list.h:150 (discriminator 1) include/linux/list.h:183 (discriminator 1) kernel/sched/rt.c:1419 (discriminator 1))
[ 54.466661][ C2] RIP: 0010:__enqueue_rt_entity (kernel/sched/rt.c:1415 (discriminator 1))
[ 54.474792][ T705] enqueue_rt_entity (kernel/sched/rt.c:1616)
[ 54.479778][ C2] Code: fa 48 c1 ea 03 80 3c 02 00 0f 85 1f 03 00 00 49 8b bf 40 0a 00 00 44 89 ea 48 81 c7 b8 00 00 00 e8 15 72 05 00 e9 23 fa ff ff <0f> 0b e9 9b f6 ff ff 48 89 ee 48 89 df e8 8e d1 ff ff e9 f6 f5 ff
All code
========
0: fa cli
1: 48 c1 ea 03 shr $0x3,%rdx
5: 80 3c 02 00 cmpb $0x0,(%rdx,%rax,1)
9: 0f 85 1f 03 00 00 jne 0x32e
f: 49 8b bf 40 0a 00 00 mov 0xa40(%r15),%rdi
16: 44 89 ea mov %r13d,%edx
19: 48 81 c7 b8 00 00 00 add $0xb8,%rdi
20: e8 15 72 05 00 callq 0x5723a
25: e9 23 fa ff ff jmpq 0xfffffffffffffa4d
2a:* 0f 0b ud2 <-- trapping instruction
2c: e9 9b f6 ff ff jmpq 0xfffffffffffff6cc
31: 48 89 ee mov %rbp,%rsi
34: 48 89 df mov %rbx,%rdi
37: e8 8e d1 ff ff callq 0xffffffffffffd1ca
3c: e9 .byte 0xe9
3d: f6 f5 div %ch
3f: ff .byte 0xff
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: e9 9b f6 ff ff jmpq 0xfffffffffffff6a2
7: 48 89 ee mov %rbp,%rsi
a: 48 89 df mov %rbx,%rdi
d: e8 8e d1 ff ff callq 0xffffffffffffd1a0
12: e9 .byte 0xe9
13: f6 f5 div %ch
15: ff .byte 0xff
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240704/202407041644.de55c25-oliver.sang@intel.com
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
Powered by blists - more mailing lists