lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <202407041644.de55c25-oliver.sang@intel.com>
Date: Thu, 4 Jul 2024 16:30:34 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Xavier <xavier_qy@....com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
	<mingo@...hat.com>, <peterz@...radead.org>, <juri.lelli@...hat.com>,
	<vincent.guittot@...aro.org>, <dietmar.eggemann@....com>,
	<rostedt@...dmis.org>, <bsegall@...gle.com>, <mgorman@...e.de>,
	<bristot@...hat.com>, <vschneid@...hat.com>, Xavier <xavier_qy@....com>,
	<oliver.sang@...el.com>
Subject: Re: [PATCH-RT sched v2 1/2] RT SCHED: Optimize the enqueue and
 dequeue operations for rt_se



Hello,

kernel test robot noticed "WARNING:at_kernel/sched/rt.c:#__enqueue_rt_entity" on:

commit: ed0ed14c2b47993c00c4b3cdceabef535bcef32b ("[PATCH-RT sched v2 1/2] RT SCHED: Optimize the enqueue and dequeue operations for rt_se")
url: https://github.com/intel-lab-lkp/linux/commits/Xavier/RT-SCHED-Optimize-the-enqueue-and-dequeue-operations-for-rt_se/20240630-173825
base: https://git.kernel.org/cgit/linux/kernel/git/tip/tip.git c793a62823d1ce8f70d9cfc7803e3ea436277cda
patch link: https://lore.kernel.org/all/20240629112812.243691-2-xavier_qy@163.com/
patch subject: [PATCH-RT sched v2 1/2] RT SCHED: Optimize the enqueue and dequeue operations for rt_se

in testcase: blktests
version: blktests-x86_64-775a058-1_20240702
with following parameters:

	disk: 1SSD
	test: block-group-01



compiler: gcc-13
test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz (Skylake) with 32G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Closes: https://lore.kernel.org/oe-lkp/202407041644.de55c25-oliver.sang@intel.com


[   54.093440][    C2] ------------[ cut here ]------------
[   54.094193][  T705] list_add double add: new=ffff888802a8abc0, prev=ffff888802a8abc0, next=ffff8887892c4dd0.
[ 54.098261][ C2] WARNING: CPU: 2 PID: 53 at kernel/sched/rt.c:1415 __enqueue_rt_entity (kernel/sched/rt.c:1415 (discriminator 1)) 
[   54.103613][  T705] ------------[ cut here ]------------
[   54.113477][    C2] Modules linked in: dm_multipath
[   54.122743][  T705] kernel BUG at lib/list_debug.c:35!
[   54.128080][    C2]  btrfs blake2b_generic
[   54.132987][  T705] Oops: invalid opcode: 0000 [#1] PREEMPT SMP KASAN PTI
[   54.138148][    C2]  xor zstd_compress
[   54.142266][  T705] CPU: 3 PID: 705 Comm: multipathd Tainted: G S                 6.10.0-rc1-00010-ged0ed14c2b47 #1
[   54.149087][    C2]  raid6_pq libcrc32c
[   54.152852][  T705] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.8.1 12/05/2017
[   54.163339][    C2]  ipmi_devintf ipmi_msghandler
[ 54.167192][ T705] RIP: 0010:__list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1)) 
[   54.175322][    C2]  intel_rapl_msr intel_rapl_common
[ 54.180049][ T705] Code: 0b 48 89 f1 48 c7 c7 00 fa 26 84 48 89 de e8 d6 75 f2 fe 0f 0b 48 89 f2 48 89 d9 48 89 ee 48 c7 c7 80 fa 26 84 e8 bf 75 f2 fe <0f> 0b 48 89 f7 48 89 34 24 e8 11 cc 61 ff 48 8b 34 24 e9 71 ff ff
All code
========
   0:	0b 48 89             	or     -0x77(%rax),%ecx
   3:	f1                   	icebp  
   4:	48 c7 c7 00 fa 26 84 	mov    $0xffffffff8426fa00,%rdi
   b:	48 89 de             	mov    %rbx,%rsi
   e:	e8 d6 75 f2 fe       	callq  0xfffffffffef275e9
  13:	0f 0b                	ud2    
  15:	48 89 f2             	mov    %rsi,%rdx
  18:	48 89 d9             	mov    %rbx,%rcx
  1b:	48 89 ee             	mov    %rbp,%rsi
  1e:	48 c7 c7 80 fa 26 84 	mov    $0xffffffff8426fa80,%rdi
  25:	e8 bf 75 f2 fe       	callq  0xfffffffffef275e9
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	48 89 f7             	mov    %rsi,%rdi
  2f:	48 89 34 24          	mov    %rsi,(%rsp)
  33:	e8 11 cc 61 ff       	callq  0xffffffffff61cc49
  38:	48 8b 34 24          	mov    (%rsp),%rsi
  3c:	e9                   	.byte 0xe9
  3d:	71 ff                	jno    0x3e
  3f:	ff                   	.byte 0xff

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	48 89 f7             	mov    %rsi,%rdi
   5:	48 89 34 24          	mov    %rsi,(%rsp)
   9:	e8 11 cc 61 ff       	callq  0xffffffffff61cc1f
   e:	48 8b 34 24          	mov    (%rsp),%rsi
  12:	e9                   	.byte 0xe9
  13:	71 ff                	jno    0x14
  15:	ff                   	.byte 0xff
[   54.186345][    C2]  sd_mod t10_pi
[   54.191424][  T705] RSP: 0018:ffffc90000327b38 EFLAGS: 00010046
[   54.211022][    C2]  x86_pkg_temp_thermal
[   54.214447][  T705]
[   54.220405][    C2]  crc64_rocksoft_generic crc64_rocksoft
[   54.224435][  T705] RAX: 0000000000000058 RBX: ffff8887892c4dd0 RCX: ffffffff82424f4e
[   54.226632][    C2]  intel_powerclamp crc64
[   54.232145][  T705] RDX: 0000000000000000 RSI: 0000000000000008 RDI: ffff8887893b5380
[   54.240012][    C2]  coretemp sg
[   54.244217][  T705] RBP: ffff888802a8abc0 R08: 0000000000000001 R09: fffff52000064f22
[   54.252087][    C2]  kvm_intel i915
[   54.255330][  T705] R10: ffffc90000327917 R11: 205d324320202020 R12: ffff888802a8abc0
[   54.263200][    C2]  kvm crct10dif_pclmul
[   54.266705][  T705] R13: ffff8887892c4dd0 R14: ffff888802a8ac00 R15: ffff8887892c4dd8
[   54.274572][    C2]  crc32_pclmul crc32c_intel
[   54.278599][  T705] FS:  00007f1b015ee680(0000) GS:ffff888789380000(0000) knlGS:0000000000000000
[   54.286469][    C2]  drm_buddy ghash_clmulni_intel
[   54.290934][  T705] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   54.299764][    C2]  intel_gtt sha512_ssse3
[   54.304580][  T705] CR2: 000055e6a99e25f8 CR3: 000000080473e006 CR4: 00000000003706f0
[   54.311054][    C2]  drm_display_helper
[   54.315255][  T705] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   54.323124][    C2]  rapl ttm
[   54.326976][  T705] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   54.334845][    C2]  drm_kms_helper
[   54.337825][  T705] Call Trace:
[   54.345696][    C2]  ahci mei_wdt
[   54.349201][  T705]  <TASK>
[   54.352357][    C2]  intel_cstate wmi_bmof
[ 54.355687][ T705] ? die (arch/x86/kernel/dumpstack.c:421 arch/x86/kernel/dumpstack.c:434 arch/x86/kernel/dumpstack.c:447) 
[   54.358493][    C2]  intel_uncore
[ 54.362610][ T705] ? do_trap (arch/x86/kernel/traps.c:114 arch/x86/kernel/traps.c:155) 
[   54.366202][    C2]  binfmt_misc video
[ 54.369533][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1)) 
[   54.373650][    C2]  libahci mei_me
[ 54.377418][ T705] ? do_error_trap (arch/x86/include/asm/traps.h:58 arch/x86/kernel/traps.c:176) 
[   54.383104][    C2]  i2c_i801 wmi
[ 54.386607][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1)) 
[   54.391070][    C2]  intel_pch_thermal i2c_smbus
[ 54.394400][ T705] ? handle_invalid_op (arch/x86/kernel/traps.c:214) 
[   54.400087][    C2]  mei libata
[ 54.404727][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1)) 
[   54.409540][    C2]  acpi_pad fuse
[ 54.412697][ T705] ? exc_invalid_op (arch/x86/kernel/traps.c:267) 
[   54.418385][    C2]  loop drm
[ 54.421803][ T705] ? asm_exc_invalid_op (arch/x86/include/asm/idtentry.h:621) 
[   54.426355][    C2]  dm_mod ip_tables
[ 54.429337][ T705] ? llist_add_batch (lib/llist.c:33 (discriminator 14)) 
[   54.434240][    C2]
[ 54.437928][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1)) 
[   54.442661][    C2] CPU: 2 PID: 53 Comm: khugepaged Tainted: G S                 6.10.0-rc1-00010-ged0ed14c2b47 #1
[ 54.444859][ T705] ? __list_add_valid_or_report (lib/list_debug.c:35 (discriminator 1)) 
[   54.450557][    C2] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.8.1 12/05/2017
[ 54.460974][ T705] __enqueue_rt_entity (include/linux/list.h:150 (discriminator 1) include/linux/list.h:183 (discriminator 1) kernel/sched/rt.c:1419 (discriminator 1)) 
[ 54.466661][ C2] RIP: 0010:__enqueue_rt_entity (kernel/sched/rt.c:1415 (discriminator 1)) 
[ 54.474792][ T705] enqueue_rt_entity (kernel/sched/rt.c:1616) 
[ 54.479778][ C2] Code: fa 48 c1 ea 03 80 3c 02 00 0f 85 1f 03 00 00 49 8b bf 40 0a 00 00 44 89 ea 48 81 c7 b8 00 00 00 e8 15 72 05 00 e9 23 fa ff ff <0f> 0b e9 9b f6 ff ff 48 89 ee 48 89 df e8 8e d1 ff ff e9 f6 f5 ff
All code
========
   0:	fa                   	cli    
   1:	48 c1 ea 03          	shr    $0x3,%rdx
   5:	80 3c 02 00          	cmpb   $0x0,(%rdx,%rax,1)
   9:	0f 85 1f 03 00 00    	jne    0x32e
   f:	49 8b bf 40 0a 00 00 	mov    0xa40(%r15),%rdi
  16:	44 89 ea             	mov    %r13d,%edx
  19:	48 81 c7 b8 00 00 00 	add    $0xb8,%rdi
  20:	e8 15 72 05 00       	callq  0x5723a
  25:	e9 23 fa ff ff       	jmpq   0xfffffffffffffa4d
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	e9 9b f6 ff ff       	jmpq   0xfffffffffffff6cc
  31:	48 89 ee             	mov    %rbp,%rsi
  34:	48 89 df             	mov    %rbx,%rdi
  37:	e8 8e d1 ff ff       	callq  0xffffffffffffd1ca
  3c:	e9                   	.byte 0xe9
  3d:	f6 f5                	div    %ch
  3f:	ff                   	.byte 0xff

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	e9 9b f6 ff ff       	jmpq   0xfffffffffffff6a2
   7:	48 89 ee             	mov    %rbp,%rsi
   a:	48 89 df             	mov    %rbx,%rdi
   d:	e8 8e d1 ff ff       	callq  0xffffffffffffd1a0
  12:	e9                   	.byte 0xe9
  13:	f6 f5                	div    %ch
  15:	ff                   	.byte 0xff


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240704/202407041644.de55c25-oliver.sang@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ