lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202509231601.ffbd7efd-lkp@intel.com>
Date: Tue, 23 Sep 2025 16:44:58 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Peter Zijlstra <peterz@...radead.org>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
	<sched-ext@...ts.linux.dev>, <aubrey.li@...ux.intel.com>,
	<yu.c.chen@...el.com>, <oliver.sang@...el.com>
Subject: [peterz-queue:sched/cleanup] [sched]  13ea439400:
 WARNING:at_kernel/sched/deadline.c:#task_contending



Hello,

kernel test robot noticed "WARNING:at_kernel/sched/deadline.c:#task_contending" on:

commit: 13ea43940095719bc22d2e129712519fdc307c45 ("sched: Fold sched_class::switch{ing,ed}_{to,from}() into the change pattern")
https://git.kernel.org/cgit/linux/kernel/git/peterz/queue.git sched/cleanup

in testcase: stress-ng
version: stress-ng-x86_64-480b81369-1_20250919
with following parameters:

	nr_threads: 100%
	testtime: 60s
	test: cyclic
	cpufreq_governor: performance



config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480+ (Sapphire Rapids) with 256G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Closes: https://lore.kernel.org/oe-lkp/202509231601.ffbd7efd-lkp@intel.com


we also observed various issues on this commit while parent is quite clean.

=========================================================================================
tbox_group/testcase/rootfs/kconfig/compiler/nr_threads/testtime/test/cpufreq_governor:
  lkp-spr-2sp1/stress-ng/debian-13-x86_64-20250902.cgz/x86_64-rhel-9.4/gcc-14/100%/60s/cyclic/performance

f0b336327a1be3b1 13ea43940095719bc22d2e12971
---------------- ---------------------------
       fail:runs  %reproduction    fail:runs
           |             |             |
           :10          20%           2:10    dmesg.BUG:soft_lockup-CPU##stuck_for#s![diskstats:#]
           :10          10%           1:10    dmesg.BUG:soft_lockup-CPU##stuck_for#s![numa-vmstat:#]
           :10          10%           1:10    dmesg.BUG:soft_lockup-CPU##stuck_for#s![slabinfo:#]
           :10          40%           4:10    dmesg.BUG:soft_lockup-CPU##stuck_for#s![stress-ng-cycli:#]
           :10          20%           2:10    dmesg.BUG:soft_lockup-CPU##stuck_for#s![systemd-journal:#]
           :10          10%           1:10    dmesg.BUG:workqueue_lockup-pool
           :10          10%           1:10    dmesg.INFO:rcu_sched_detected_stalls_on_CPUs/tasks
           :10          10%           1:10    dmesg.INFO:task_blocked_for_more_than#seconds
           :10          90%           9:10    dmesg.Kernel_panic-not_syncing:softlockup:hung_tasks
           :10          10%           1:10    dmesg.RIP:__get_user_8
           :10          50%           5:10    dmesg.RIP:__sched_setscheduler
           :10          10%           1:10    dmesg.RIP:__schedule
           :10          10%           1:10    dmesg.RIP:__x64_sys_clock_nanosleep
           :10          10%           1:10    dmesg.RIP:calculate_sigpending
           :10          10%           1:10    dmesg.RIP:count_memcg_events
           :10          60%           6:10    dmesg.RIP:cpuidle_enter_state
           :10          90%           9:10    dmesg.RIP:dequeue_task_dl
           :10          10%           1:10    dmesg.RIP:do_nanosleep
           :10          10%           1:10    dmesg.RIP:do_user_addr_fault
           :10          50%           5:10    dmesg.RIP:enqueue_dl_entity
           :10          80%           8:10    dmesg.RIP:finish_task_switch
           :10          10%           1:10    dmesg.RIP:folio_add_file_rmap_ptes
           :10          70%           7:10    dmesg.RIP:inactive_task_timer
           :10          90%           9:10    dmesg.RIP:migrate_task_rq_dl
           :10          10%           1:10    dmesg.RIP:native_queued_spin_lock_slowpath
           :10          20%           2:10    dmesg.RIP:next_uptodate_folio
           :10          90%           9:10    dmesg.RIP:smp_call_function_many_cond
           :10          90%           9:10    dmesg.RIP:task_contending
           :10          80%           8:10    dmesg.RIP:task_non_contending
           :10          10%           1:10    dmesg.RIP:x64_sys_call
           :10          90%           9:10    dmesg.WARNING:at_kernel/sched/deadline.c:#dequeue_task_dl
           :10          50%           5:10    dmesg.WARNING:at_kernel/sched/deadline.c:#enqueue_dl_entity
           :10          70%           7:10    dmesg.WARNING:at_kernel/sched/deadline.c:#inactive_task_timer
           :10          90%           9:10    dmesg.WARNING:at_kernel/sched/deadline.c:#migrate_task_rq_dl
           :10          90%           9:10    dmesg.WARNING:at_kernel/sched/deadline.c:#task_contending
           :10          80%           8:10    dmesg.WARNING:at_kernel/sched/deadline.c:#task_non_contending




[   41.677103][  C194] ------------[ cut here ]------------
[   41.295461][    C1] Scheduler tracepoints stat_sleep, stat_iowait, stat_blocked and stat_runtime require the kernel parameter schedstats=enable or kernel.sched_schedstats=1
[   41.677280][  C147] ------------[ cut here ]------------
[   41.673528][    T0] Scheduler tracepoints stat_sleep, stat_iowait, stat_blocked and stat_runtime require the kernel parameter schedstats=enable or kernel.sched_schedstats=1
[   41.677090][T10156] Scheduler tracepoints stat_sleep, stat_iowait, stat_blocked and stat_runtime require the kernel parameter schedstats=enable or kernel.sched_schedstats=1
[ 41.677282][ C147] WARNING: CPU: 147 PID: 0 at kernel/sched/deadline.c:227 task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504) 
[   41.677096][T10156] sched: DL replenish lagged too much
[   41.677288][  C147] Modules linked in: intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_ifs i10nm_edac skx_edac_common nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel btrfs blake2b_generic kvm xor dax_hmem snd_pcm raid6_pq irqbypass cxl_acpi ast snd_timer ghash_clmulni_intel cxl_port binfmt_misc pmt_telemetry rapl pmt_discovery drm_client_lib intel_th_gth intel_cstate snd isst_if_mmio iaa_crypto ipmi_ssif qat_4xxx nvme isst_if_mbox_pci intel_sdsi pmt_class cxl_core drm_shmem_helper mei_me intel_th_pci i2c_i801 soundcore idxd intel_qat intel_uncore einj pcspkr isst_if_common nvme_core cdc_ether i2c_ismt intel_th mei i2c_smbus intel_vsec idxd_bus acpi_power_meter wmi drm_kms_helper crc8 ipmi_si acpi_ipmi ipmi_devintf ipmi_msghandler acpi_pad pinctrl_emmitsburg pfr_update pfr_telemetry drm fuse nfnetlink
[   41.677103][  C194] ------------[ cut here ]------------
[   41.677338][  C147] CPU: 147 UID: 0 PID: 0 Comm: swapper/147 Tainted: G S                  6.17.0-rc4-00012-g13ea43940095 #1 VOLUNTARY
for Remote File[   29.908797][ T2053] EDAC MC5: Giving out device to module i10nm_edac controller Intel_10nm Socket#1 IMC#1: DEV 0000:fe:0d.0 (INTERRUPT)
[   41.677280][  C147] ------------[ cut here ]------------
[   41.677341][  C147] Tainted: [S]=CPU_OUT_OF_SPEC
[ 41.677282][ C147] WARNING: CPU: 147 PID: 0 at kernel/sched/deadline.c:227 task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504) 
[   41.677342][  C147] Hardware name: Intel Corporation D50DNP1SBB/D50DNP1SBB, BIOS SE5C7411.86B.8118.D04.2206151341 06/15/2022
[ 41.677344][ C147] RIP: 0010:task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504) 
[   41.677288][  C147] Modules linked in: intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_ifs i10nm_edac skx_edac_common nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel btrfs blake2b_generic kvm xor dax_hmem snd_pcm raid6_pq irqbypass cxl_acpi ast snd_timer ghash_clmulni_intel cxl_port binfmt_misc pmt_telemetry rapl pmt_discovery drm_client_lib intel_th_gth intel_cstate snd isst_if_mmio iaa_crypto ipmi_ssif qat_4xxx nvme isst_if_mbox_pci intel_sdsi pmt_class cxl_core drm_shmem_helper mei_me intel_th_pci i2c_i801 soundcore idxd intel_qat intel_uncore einj pcspkr isst_if_common nvme_core cdc_ether i2c_ismt intel_th mei i2c_smbus intel_vsec idxd_bus acpi_power_meter wmi drm_kms_helper crc8 ipmi_si acpi_ipmi ipmi_devintf ipmi_msghandler acpi_pad pinctrl_emmitsburg pfr_update pfr_telemetry drm fuse nfnetlink
[   29.924285][ T2053] EDAC MC6: Giving out device to module i10nm_edac controller Intel_10nm Socket#1 IMC#2: DEV 0000:fe:0e.0 (INTERRUPT)
[ 41.677347][ C147] Code: 78 fe ff ff 83 f8 01 74 28 85 c0 7f bd be 03 00 00 00 5b e9 a7 de 64 00 0f 0b eb 88 0f 0b e9 23 ff ff ff 0f 0b e9 43 ff ff ff <0f> 0b e9 49 ff ff ff 48 c7 c6 50 e4 34 81 48 8d bb a8 12 00 00 5b
All code
========
   0:	78 fe                	js     0x0
   2:	ff                   	(bad)
   3:	ff 83 f8 01 74 28    	incl   0x287401f8(%rbx)
   9:	85 c0                	test   %eax,%eax
   b:	7f bd                	jg     0xffffffffffffffca
   d:	be 03 00 00 00       	mov    $0x3,%esi
  12:	5b                   	pop    %rbx
  13:	e9 a7 de 64 00       	jmp    0x64debf
  18:	0f 0b                	ud2
  1a:	eb 88                	jmp    0xffffffffffffffa4
  1c:	0f 0b                	ud2
  1e:	e9 23 ff ff ff       	jmp    0xffffffffffffff46
  23:	0f 0b                	ud2
  25:	e9 43 ff ff ff       	jmp    0xffffffffffffff6d
  2a:*	0f 0b                	ud2		<-- trapping instruction
  2c:	e9 49 ff ff ff       	jmp    0xffffffffffffff7a
  31:	48 c7 c6 50 e4 34 81 	mov    $0xffffffff8134e450,%rsi
  38:	48 8d bb a8 12 00 00 	lea    0x12a8(%rbx),%rdi
  3f:	5b                   	pop    %rbx

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2
   2:	e9 49 ff ff ff       	jmp    0xffffffffffffff50
   7:	48 c7 c6 50 e4 34 81 	mov    $0xffffffff8134e450,%rsi
   e:	48 8d bb a8 12 00 00 	lea    0x12a8(%rbx),%rdi
  15:	5b                   	pop    %rbx
[   41.677338][  C147] CPU: 147 UID: 0 PID: 0 Comm: swapper/147 Tainted: G S                  6.17.0-rc4-00012-g13ea43940095 #1 VOLUNTARY
[   41.677350][  C147] RSP: 0018:ffa000000eec4e00 EFLAGS: 00010087
[   41.677341][  C147] Tainted: [S]=CPU_OUT_OF_SPEC
[   41.677353][  C147] RAX: ff11001fe02f1140 RBX: ff110040477d42b0 RCX: 0000000000000093
0m] Reached targ[   29.954811][ T2053] EDAC i10nm: v0.0.6
[   41.677354][  C147] RDX: 00000000000e6666 RSI: 0000000000000000 RDI: ff110040477d42b0
[   41.677342][  C147] Hardware name: Intel Corporation D50DNP1SBB/D50DNP1SBB, BIOS SE5C7411.86B.8118.D04.2206151341 06/15/2022
[ 41.677344][ C147] RIP: 0010:task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504) 
[   41.677355][  C147] RBP: 0000000000100009 R08: 0000000000017701 R09: 0000000000000093
Startin[   29.979254][   T20] intel_rapl_common: Found RAPL domain package
[ 41.677347][ C147] Code: 78 fe ff ff 83 f8 01 74 28 85 c0 7f bd be 03 00 00 00 5b e9 a7 de 64 00 0f 0b eb 88 0f 0b e9 23 ff ff ff 0f 0b e9 43 ff ff ff <0f> 0b e9 49 ff ff ff 48 c7 c6 50 e4 34 81 48 8d bb a8 12 00 00 5b
All code
========
   0:	78 fe                	js     0x0
   2:	ff                   	(bad)
   3:	ff 83 f8 01 74 28    	incl   0x287401f8(%rbx)
   9:	85 c0                	test   %eax,%eax
   b:	7f bd                	jg     0xffffffffffffffca
   d:	be 03 00 00 00       	mov    $0x3,%esi
  12:	5b                   	pop    %rbx
  13:	e9 a7 de 64 00       	jmp    0x64debf
  18:	0f 0b                	ud2
  1a:	eb 88                	jmp    0xffffffffffffffa4
  1c:	0f 0b                	ud2
  1e:	e9 23 ff ff ff       	jmp    0xffffffffffffff46
  23:	0f 0b                	ud2
  25:	e9 43 ff ff ff       	jmp    0xffffffffffffff6d
  2a:*	0f 0b                	ud2		<-- trapping instruction
  2c:	e9 49 ff ff ff       	jmp    0xffffffffffffff7a
  31:	48 c7 c6 50 e4 34 81 	mov    $0xffffffff8134e450,%rsi
  38:	48 8d bb a8 12 00 00 	lea    0x12a8(%rbx),%rdi
  3f:	5b                   	pop    %rbx

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2
   2:	e9 49 ff ff ff       	jmp    0xffffffffffffff50
   7:	48 c7 c6 50 e4 34 81 	mov    $0xffffffff8134e450,%rsi
   e:	48 8d bb a8 12 00 00 	lea    0x12a8(%rbx),%rdi
  15:	5b                   	pop    %rbx
[   41.677357][  C147] R10: 0000000000000000 R11: ffa000000eec4ff8 R12: 0000000000000001
[   41.677358][  C147] R13: ff110040477d42b0 R14: 0000000000000088 R15: ffa000000eec4ed0
[   41.677350][  C147] RSP: 0018:ffa000000eec4e00 EFLAGS: 00010087
[   41.677359][  C147] FS:  0000000000000000(0000) GS:ff1100205c4a2000(0000) knlGS:0000000000000000
[   41.677353][  C147] RAX: ff11001fe02f1140 RBX: ff110040477d42b0 RCX: 0000000000000093
[   41.677361][  C147] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   41.677354][  C147] RDX: 00000000000e6666 RSI: 0000000000000000 RDI: ff110040477d42b0
[   30.004528][  T354] intel_rapl_common: Found RAPL domain package
[   41.677355][  C147] RBP: 0000000000100009 R08: 0000000000017701 R09: 0000000000000093
[   41.677362][  C147] CR2: 00007f7ff20b8620 CR3: 00000020a4f97003 CR4: 0000000000f73ef0
[   30.020455][  T354] intel_rapl_common: Found RAPL domain dram
[   41.677357][  C147] R10: 0000000000000000 R11: ffa000000eec4ff8 R12: 0000000000000001
[   41.677364][  C147] PKRU: 55555554
[   41.677358][  C147] R13: ff110040477d42b0 R14: 0000000000000088 R15: ffa000000eec4ed0
[   41.677365][  C147] Call Trace:
[   41.677359][  C147] FS:  0000000000000000(0000) GS:ff1100205c4a2000(0000) knlGS:0000000000000000
[   41.677367][  C147]  <IRQ>
[ 41.677370][ C147] enqueue_dl_entity (kernel/sched/deadline.c:62 kernel/sched/deadline.c:80 kernel/sched/deadline.c:1036 kernel/sched/deadline.c:2022) 
[   41.677361][  C147] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
LKP: ttyS0: 2334: current_version: 90000c0, target_version: 90000c0
[   41.677362][  C147] CR2: 00007f7ff20b8620 CR3: 00000020a4f97003 CR4: 0000000000f73ef0
[ 41.677373][ C147] enqueue_task_dl (kernel/sched/deadline.c:62 kernel/sched/deadline.c:2127) 
[   41.677364][  C147] PKRU: 55555554
[ 41.677376][ C147] enqueue_task (kernel/sched/core.c:2090) 
[   41.677365][  C147] Call Trace:
[ 41.677378][ C147] ttwu_do_activate (kernel/sched/core.c:3717) 
LKP: ttyS0: 2334: skip deploy intel ucode as ucode is same
LKP: ttyS0: 2334: Kernel tests: Boot OK!
[   41.677367][  C147]  <IRQ>
[ 41.677381][ C147] try_to_wake_up (kernel/sched/sched.h:1864 kernel/sched/core.c:3980 kernel/sched/core.c:4303) 
[ 41.677370][ C147] enqueue_dl_entity (kernel/sched/deadline.c:62 kernel/sched/deadline.c:80 kernel/sched/deadline.c:1036 kernel/sched/deadline.c:2022) 
[ 41.677384][ C147] ? __pfx_hrtimer_wakeup (kernel/time/hrtimer.c:1990) 
LKP: ttyS0: 2334: HOSTNAME lkp-spr-2sp1, MAC a4:bf:01:91:28:12, kernel 6.17.0-rc4-00012-g13ea43940095 1
[ 41.677373][ C147] enqueue_task_dl (kernel/sched/deadline.c:62 kernel/sched/deadline.c:2127) 
[ 41.677388][ C147] hrtimer_wakeup (kernel/time/hrtimer.c:2000) 
[ 41.677390][ C147] __hrtimer_run_queues (kernel/time/hrtimer.c:1761 kernel/time/hrtimer.c:1825) 
[ 41.677376][ C147] enqueue_task (kernel/sched/core.c:2090) 
[ 41.677393][ C147] hrtimer_interrupt (kernel/time/hrtimer.c:1890) 
[ 41.677378][ C147] ttwu_do_activate (kernel/sched/core.c:3717) 
[ 41.677381][ C147] try_to_wake_up (kernel/sched/sched.h:1864 kernel/sched/core.c:3980 kernel/sched/core.c:4303) 
[ 41.677395][ C147] __sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1039 arch/x86/kernel/apic/apic.c:1056) 
[ 41.677384][ C147] ? __pfx_hrtimer_wakeup (kernel/time/hrtimer.c:1990) 
[ 41.677400][ C147] sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1050 (discriminator 35) arch/x86/kernel/apic/apic.c:1050 (discriminator 35)) 
[   41.677402][  C147]  </IRQ>
[ 41.677388][ C147] hrtimer_wakeup (kernel/time/hrtimer.c:2000) 
[   36.334175][ T2362] is_virt=false
[   41.677403][  C147]  <TASK>
[ 41.677390][ C147] __hrtimer_run_queues (kernel/time/hrtimer.c:1761 kernel/time/hrtimer.c:1825) 
[ 41.677405][ C147] asm_sysvec_apic_timer_interrupt (arch/x86/include/asm/idtentry.h:702) 
[   36.334178][ T2362]
[ 41.677393][ C147] hrtimer_interrupt (kernel/time/hrtimer.c:1890) 
[ 41.677408][ C147] RIP: 0010:cpuidle_enter_state (drivers/cpuidle/cpuidle.c:292) 
[ 41.677395][ C147] __sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1039 arch/x86/kernel/apic/apic.c:1056) 


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250923/202509231601.ffbd7efd-lkp@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ