[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202504211553.3ba9400-lkp@intel.com>
Date: Mon, 21 Apr 2025 16:12:11 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Arnd Bergmann <arnd@...db.de>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
Ingo Molnar <mingo@...nel.org>, Linus Torvalds
<torvalds@...ux-foundation.org>, <oliver.sang@...el.com>
Subject: [linus:master] [x86/cpu] f388f60ca9:
BUG:soft_lockup-CPU##stuck_for#s![swapper:#]
Hello,
by this commit, we notice big config diff [1]
then in this rcutorture tests, parent runs quite clean, f388f60ca9 shows
various random issues.
=========================================================================================
tbox_group/testcase/rootfs/kconfig/compiler/runtime/test/torture_type:
vm-snb/rcutorture/debian-11.1-i386-20220923.cgz/i386-randconfig-r071-20250410/gcc-12/300s/default/tasks-tracing
fc2d5cbe541032e7 f388f60ca9041a95c9b3f157d31
---------------- ---------------------------
fail:runs %reproduction fail:runs
| | |
:500 30% 149:500 last_state.booting
:500 7% 35:500 dmesg.BUG:kernel_hang_in_boot_stage
:500 9% 45:500 dmesg.BUG:soft_lockup-CPU##stuck_for#s![swapper:#]
:500 10% 51:500 dmesg.BUG:workqueue_lockup-pool
:500 0% 1:500 dmesg.EIP:__timer_delete_sync
:500 1% 5:500 dmesg.EIP:_raw_spin_unlock_irq
:500 0% 2:500 dmesg.EIP:_raw_spin_unlock_irqrestore
:500 0% 1:500 dmesg.EIP:console_emit_next_record
:500 0% 1:500 dmesg.EIP:handle_softirqs
:500 1% 3:500 dmesg.EIP:lock_acquire
:500 0% 2:500 dmesg.EIP:lock_release
:500 0% 1:500 dmesg.EIP:queue_delayed_work_on
:500 9% 45:500 dmesg.EIP:timekeeping_notify
:500 3% 14:500 dmesg.INFO:rcu_preempt_detected_stalls_on_CPUs/tasks
:500 6% 32:500 dmesg.INFO:task_blocked_for_more_than#seconds
:500 1% 3:500 dmesg.IP-Config:Auto-configuration_of_network_failed
:500 9% 45:500 dmesg.Kernel_panic-not_syncing:softlockup:hung_tasks
:500 29% 146:500 dmesg.boot_failures
we don't have enough knowledge to dig deep these issues. so just make this
report to consult with you if these issues are related with config diff.
and if so, is this config diff reasonable by this commit?
below our normal report just FYI.
kernel test robot noticed "BUG:soft_lockup-CPU##stuck_for#s![swapper:#]" on:
commit: f388f60ca9041a95c9b3f157d316ed7c8f297e44 ("x86/cpu: Drop configuration options for early 64-bit CPUs")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
[test failed on linus/master e618ee89561b6b0fdc69f79e6fd0c33375d3e6b4]
[test failed on linux-next/master 01c6df60d5d4ae00cd5c1648818744838bba7763]
in testcase: rcutorture
version:
with following parameters:
runtime: 300s
test: default
torture_type: tasks-tracing
config: i386-randconfig-r071-20250410
compiler: gcc-12
test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G
(please refer to attached dmesg/kmsg for entire log/backtrace)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Closes: https://lore.kernel.org/oe-lkp/202504211553.3ba9400-lkp@intel.com
[ 721.016745][ C0] watchdog: BUG: soft lockup - CPU#0 stuck for 626s! [swapper:1]
[ 721.016779][ C0] CPU#0 Utilization every 96s during lockup:
[ 721.016779][ C0] #1: 39% system, 0% softirq, 0% hardirq, 0% idle
[ 721.016779][ C0] #2: 42% system, 0% softirq, 0% hardirq, 0% idle
[ 721.016779][ C0] #3: 47% system, 0% softirq, 0% hardirq, 0% idle
[ 721.016779][ C0] #4: 34% system, 0% softirq, 0% hardirq, 0% idle
[ 721.016779][ C0] #5: 32% system, 0% softirq, 0% hardirq, 0% idle
[ 721.016779][ C0] Modules linked in:
[ 721.016779][ C0] irq event stamp: 159506
[ 721.016779][ C0] hardirqs last enabled at (159505): timekeeping_notify (arch/x86/include/asm/irqflags.h:42 arch/x86/include/asm/irqflags.h:97 arch/x86/include/asm/irqflags.h:155 include/linux/stop_machine.h:154 include/linux/stop_machine.h:161 kernel/time/timekeeping.c:1521)
[ 721.016779][ C0] hardirqs last disabled at (159506): sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] softirqs last enabled at (159174): handle_softirqs (kernel/softirq.c:408 kernel/softirq.c:589)
[ 721.016779][ C0] softirqs last disabled at (159159): __do_softirq (kernel/softirq.c:596)
[ 721.016779][ C0] CPU: 0 UID: 0 PID: 1 Comm: swapper Not tainted 6.14.0-rc3-00037-gf388f60ca904 #1
[ 721.016779][ C0] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-debian-1.16.2-1 04/01/2014
[ 721.016779][ C0] EIP: timekeeping_notify (kernel/time/timekeeping.c:1522)
[ 721.016779][ C0] Code: 5f e9 ff ff 8d 45 e8 e8 57 d4 ff ff 85 ff 74 16 8b 57 5c 85 d2 74 04 89 f8 ff d2 8b 87 88 00 00 00 e8 d5 3e ff ff 85 f6 75 9b <e8> 7f b9 00 00 31 c0 39 1d a4 70 14 84 0f 95 c0 f7 d8 8b 55 f0 2b
All code
========
0: 5f pop %rdi
1: e9 ff ff 8d 45 jmp 0x458e0005
6: e8 e8 57 d4 ff call 0xffffffffffd457f3
b: ff 85 ff 74 16 8b incl -0x74e98b01(%rbp)
11: 57 push %rdi
12: 5c pop %rsp
13: 85 d2 test %edx,%edx
15: 74 04 je 0x1b
17: 89 f8 mov %edi,%eax
19: ff d2 call *%rdx
1b: 8b 87 88 00 00 00 mov 0x88(%rdi),%eax
21: e8 d5 3e ff ff call 0xffffffffffff3efb
26: 85 f6 test %esi,%esi
28: 75 9b jne 0xffffffffffffffc5
2a:* e8 7f b9 00 00 call 0xb9ae <-- trapping instruction
2f: 31 c0 xor %eax,%eax
31: 39 1d a4 70 14 84 cmp %ebx,-0x7beb8f5c(%rip) # 0xffffffff841470db
37: 0f 95 c0 setne %al
3a: f7 d8 neg %eax
3c: 8b 55 f0 mov -0x10(%rbp),%edx
3f: 2b .byte 0x2b
Code starting with the faulting instruction
===========================================
0: e8 7f b9 00 00 call 0xb984
5: 31 c0 xor %eax,%eax
7: 39 1d a4 70 14 84 cmp %ebx,-0x7beb8f5c(%rip) # 0xffffffff841470b1
d: 0f 95 c0 setne %al
10: f7 d8 neg %eax
12: 8b 55 f0 mov -0x10(%rbp),%edx
15: 2b .byte 0x2b
[ 721.016779][ C0] EAX: 00026f11 EBX: 8316b7e0 ECX: 00000006 EDX: 7e26f13f
[ 721.016779][ C0] ESI: 00000200 EDI: 835e7220 EBP: 86d15ed8 ESP: 86d15ec0
[ 721.016779][ C0] DS: 007b ES: 007b FS: 0000 GS: 0000 SS: 0068 EFLAGS: 00000206
[ 721.016779][ C0] CR0: 80050033 CR2: ffdaa000 CR3: 03a16000 CR4: 000406d0
[ 721.016779][ C0] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
[ 721.016779][ C0] DR6: fffe0ff0 DR7: 00000400
[ 721.016779][ C0] Call Trace:
[ 721.016779][ C0] ? show_regs (arch/x86/kernel/dumpstack.c:478)
[ 721.016779][ C0] ? watchdog_timer_fn (kernel/watchdog.c:767)
[ 721.016779][ C0] ? schedule_work (drivers/usb/core/hub.c:919)
[ 721.016779][ C0] ? __hrtimer_run_queues+0x12f/0x1cf
[ 721.016779][ C0] ? hrtimer_run_queues (kernel/time/hrtimer.c:2023)
[ 721.016779][ C0] ? update_process_times (kernel/time/timer.c:2458 kernel/time/timer.c:2514)
[ 721.016779][ C0] ? tick_periodic (kernel/time/tick-common.c:103)
[ 721.016779][ C0] ? tick_handle_periodic (kernel/time/tick-common.c:144)
[ 721.016779][ C0] ? vmware_sched_clock (arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] ? __sysvec_apic_timer_interrupt (arch/x86/include/asm/trace/irq_vectors.h:41 arch/x86/include/asm/trace/irq_vectors.h:41 arch/x86/kernel/apic/apic.c:1056)
[ 721.016779][ C0] ? sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1049 arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] ? handle_exception (arch/x86/entry/entry_32.S:1055)
[ 721.016779][ C0] ? vmware_sched_clock (arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] ? timekeeping_notify (kernel/time/timekeeping.c:1522)
[ 721.016779][ C0] ? vmware_sched_clock (arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] ? timekeeping_notify (kernel/time/timekeeping.c:1522)
[ 721.016779][ C0] __clocksource_select (kernel/time/clocksource.c:1077 (discriminator 1))
[ 721.016779][ C0] ? boot_override_clock (kernel/time/clocksource.c:1109)
[ 721.016779][ C0] clocksource_select (kernel/time/clocksource.c:1094)
[ 721.016779][ C0] clocksource_done_booting (kernel/time/clocksource.c:1118)
[ 721.016779][ C0] do_one_initcall (init/main.c:1257)
[ 721.016779][ C0] ? rdinit_setup (init/main.c:1305)
[ 721.016779][ C0] do_initcalls (init/main.c:1318 init/main.c:1335)
[ 721.016779][ C0] ? rest_init (init/main.c:1449)
[ 721.016779][ C0] kernel_init_freeable (init/main.c:1572)
[ 721.016779][ C0] kernel_init (init/main.c:1459)
[ 721.016779][ C0] ret_from_fork (arch/x86/kernel/process.c:154)
[ 721.016779][ C0] ? rest_init (init/main.c:1449)
[ 721.016779][ C0] ret_from_fork_asm (arch/x86/entry/entry_32.S:737)
[ 721.016779][ C0] entry_INT80_32 (arch/x86/entry/entry_32.S:945)
[ 721.016779][ C0] Kernel panic - not syncing: softlockup: hung tasks
[ 721.016779][ C0] CPU: 0 UID: 0 PID: 1 Comm: swapper Tainted: G L 6.14.0-rc3-00037-gf388f60ca904 #1
[ 721.016779][ C0] Tainted: [L]=SOFTLOCKUP
[ 721.016779][ C0] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-debian-1.16.2-1 04/01/2014
[ 721.016779][ C0] Call Trace:
[ 721.016779][ C0] dump_stack_lvl (lib/dump_stack.c:124)
[ 721.016779][ C0] dump_stack (lib/dump_stack.c:130)
[ 721.016779][ C0] panic (kernel/panic.c:258 kernel/panic.c:375)
[ 721.016779][ C0] watchdog_timer_fn (kernel/watchdog.c:740)
[ 721.016779][ C0] ? schedule_work (drivers/usb/core/hub.c:919)
[ 721.016779][ C0] __hrtimer_run_queues+0x12f/0x1cf
[ 721.016779][ C0] hrtimer_run_queues (kernel/time/hrtimer.c:2023)
[ 721.016779][ C0] update_process_times (kernel/time/timer.c:2458 kernel/time/timer.c:2514)
[ 721.016779][ C0] tick_periodic (kernel/time/tick-common.c:103)
[ 721.016779][ C0] tick_handle_periodic (kernel/time/tick-common.c:144)
[ 721.016779][ C0] ? vmware_sched_clock (arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] __sysvec_apic_timer_interrupt (arch/x86/include/asm/trace/irq_vectors.h:41 arch/x86/include/asm/trace/irq_vectors.h:41 arch/x86/kernel/apic/apic.c:1056)
[ 721.016779][ C0] sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1049 arch/x86/kernel/apic/apic.c:1049)
[ 721.016779][ C0] handle_exception (arch/x86/entry/entry_32.S:1055)
[ 721.016779][ C0] EIP: timekeeping_notify (kernel/time/timekeeping.c:1522)
[ 721.016779][ C0] Code: 5f e9 ff ff 8d 45 e8 e8 57 d4 ff ff 85 ff 74 16 8b 57 5c 85 d2 74 04 89 f8 ff d2 8b 87 88 00 00 00 e8 d5 3e ff ff 85 f6 75 9b <e8> 7f b9 00 00 31 c0 39 1d a4 70 14 84 0f 95 c0 f7 d8 8b 55 f0 2b
All code
========
0: 5f pop %rdi
1: e9 ff ff 8d 45 jmp 0x458e0005
6: e8 e8 57 d4 ff call 0xffffffffffd457f3
b: ff 85 ff 74 16 8b incl -0x74e98b01(%rbp)
11: 57 push %rdi
12: 5c pop %rsp
13: 85 d2 test %edx,%edx
15: 74 04 je 0x1b
17: 89 f8 mov %edi,%eax
19: ff d2 call *%rdx
1b: 8b 87 88 00 00 00 mov 0x88(%rdi),%eax
21: e8 d5 3e ff ff call 0xffffffffffff3efb
26: 85 f6 test %esi,%esi
28: 75 9b jne 0xffffffffffffffc5
2a:* e8 7f b9 00 00 call 0xb9ae <-- trapping instruction
2f: 31 c0 xor %eax,%eax
31: 39 1d a4 70 14 84 cmp %ebx,-0x7beb8f5c(%rip) # 0xffffffff841470db
37: 0f 95 c0 setne %al
3a: f7 d8 neg %eax
3c: 8b 55 f0 mov -0x10(%rbp),%edx
3f: 2b .byte 0x2b
Code starting with the faulting instruction
===========================================
0: e8 7f b9 00 00 call 0xb984
5: 31 c0 xor %eax,%eax
7: 39 1d a4 70 14 84 cmp %ebx,-0x7beb8f5c(%rip) # 0xffffffff841470b1
d: 0f 95 c0 setne %al
10: f7 d8 neg %eax
12: 8b 55 f0 mov -0x10(%rbp),%edx
15: 2b .byte 0x2b
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250421/202504211553.3ba9400-lkp@intel.com
[1]
--- /pkg/linux/i386-randconfig-r071-20250410/gcc-12/fc2d5cbe541032e74a66599ba843803cebbfed0e/.config 2025-04-15 15:41:11.316836213 +0800
+++ /pkg/linux/i386-randconfig-r071-20250410/gcc-12/f388f60ca9041a95c9b3f157d316ed7c8f297e44/.config 2025-04-15 15:41:17.009901645 +0800
@@ -321,7 +321,7 @@ CONFIG_ARCH_CPUIDLE_HALTPOLL=y
# CONFIG_PVH is not set
# CONFIG_PARAVIRT_TIME_ACCOUNTING is not set
CONFIG_PARAVIRT_CLOCK=y
-# CONFIG_M486SX is not set
+CONFIG_M486SX=y
# CONFIG_M486 is not set
# CONFIG_M586 is not set
# CONFIG_M586TSC is not set
@@ -333,7 +333,6 @@ CONFIG_PARAVIRT_CLOCK=y
# CONFIG_MPENTIUM4 is not set
# CONFIG_MK6 is not set
# CONFIG_MK7 is not set
-CONFIG_MK8=y
# CONFIG_MCRUSOE is not set
# CONFIG_MEFFICEON is not set
# CONFIG_MWINCHIPC6 is not set
@@ -344,26 +343,24 @@ CONFIG_MK8=y
# CONFIG_MCYRIXIII is not set
# CONFIG_MVIAC3_2 is not set
# CONFIG_MVIAC7 is not set
-# CONFIG_MCORE2 is not set
# CONFIG_MATOM is not set
CONFIG_X86_GENERIC=y
CONFIG_X86_INTERNODE_CACHE_SHIFT=6
CONFIG_X86_L1_CACHE_SHIFT=6
+CONFIG_X86_F00F_BUG=y
+CONFIG_X86_INVD_BUG=y
+CONFIG_X86_ALIGNMENT_16=y
CONFIG_X86_INTEL_USERCOPY=y
-CONFIG_X86_USE_PPRO_CHECKSUM=y
-CONFIG_X86_TSC=y
-CONFIG_X86_HAVE_PAE=y
-CONFIG_X86_CMPXCHG64=y
-CONFIG_X86_CMOV=y
-CONFIG_X86_MINIMUM_CPU_FAMILY=6
-CONFIG_X86_DEBUGCTLMSR=y
+CONFIG_X86_MINIMUM_CPU_FAMILY=4
CONFIG_IA32_FEAT_CTL=y
CONFIG_X86_VMX_FEATURE_NAMES=y
CONFIG_CPU_SUP_INTEL=y
+CONFIG_CPU_SUP_CYRIX_32=y
CONFIG_CPU_SUP_AMD=y
CONFIG_CPU_SUP_HYGON=y
CONFIG_CPU_SUP_CENTAUR=y
CONFIG_CPU_SUP_TRANSMETA_32=y
+CONFIG_CPU_SUP_UMC_32=y
CONFIG_CPU_SUP_ZHAOXIN=y
CONFIG_CPU_SUP_VORTEX_32=y
CONFIG_HPET_TIMER=y
@@ -410,7 +407,6 @@ CONFIG_X86_MSR=y
# CONFIG_X86_CPUID is not set
# CONFIG_NOHIGHMEM is not set
CONFIG_HIGHMEM4G=y
-# CONFIG_HIGHMEM64G is not set
# CONFIG_VMSPLIT_3G is not set
# CONFIG_VMSPLIT_3G_OPT is not set
CONFIG_VMSPLIT_2G=y
@@ -418,7 +414,6 @@ CONFIG_VMSPLIT_2G=y
# CONFIG_VMSPLIT_1G is not set
CONFIG_PAGE_OFFSET=0x80000000
CONFIG_HIGHMEM=y
-# CONFIG_X86_PAE is not set
# CONFIG_X86_CPA_STATISTICS is not set
CONFIG_ARCH_FLATMEM_ENABLE=y
CONFIG_ARCH_SPARSEMEM_ENABLE=y
@@ -427,6 +422,7 @@ CONFIG_ILLEGAL_POINTER_VALUE=0
# CONFIG_HIGHPTE is not set
CONFIG_X86_CHECK_BIOS_CORRUPTION=y
CONFIG_X86_BOOTPARAM_MEMORY_CORRUPTION_CHECK=y
+# CONFIG_MATH_EMULATION is not set
CONFIG_MTRR=y
CONFIG_MTRR_SANITIZER=y
CONFIG_MTRR_SANITIZER_ENABLE_DEFAULT=0
@@ -472,8 +468,8 @@ CONFIG_USE_X86_SEG_SUPPORT=y
CONFIG_CC_HAS_SLS=y
CONFIG_CC_HAS_RETURN_THUNK=y
CONFIG_CC_HAS_ENTRY_PADDING=y
-CONFIG_FUNCTION_PADDING_CFI=0
-CONFIG_FUNCTION_PADDING_BYTES=4
+CONFIG_FUNCTION_PADDING_CFI=11
+CONFIG_FUNCTION_PADDING_BYTES=16
CONFIG_CPU_MITIGATIONS=y
# CONFIG_MITIGATION_RETPOLINE is not set
# CONFIG_MITIGATION_GDS is not set
@@ -741,7 +737,8 @@ CONFIG_ARCH_HAS_GCOV_PROFILE_ALL=y
CONFIG_HAVE_GCC_PLUGINS=y
# CONFIG_GCC_PLUGINS is not set
CONFIG_FUNCTION_ALIGNMENT_4B=y
-CONFIG_FUNCTION_ALIGNMENT=4
+CONFIG_FUNCTION_ALIGNMENT_16B=y
+CONFIG_FUNCTION_ALIGNMENT=16
# end of General architecture-dependent options
CONFIG_RT_MUTEXES=y
@@ -1114,7 +1111,6 @@ CONFIG_NFC_SHDLC=y
#
# Near Field Communication (NFC) devices
#
-# CONFIG_NFC_MEI_PHY is not set
# CONFIG_NFC_SIM is not set
# CONFIG_NFC_PORT100 is not set
# CONFIG_NFC_PN544_I2C is not set
@@ -1607,9 +1603,7 @@ CONFIG_EEPROM_IDT_89HPESX=y
# CONFIG_CB710_CORE is not set
CONFIG_SENSORS_LIS3_I2C=y
CONFIG_ALTERA_STAPL=y
-CONFIG_INTEL_MEI=y
-CONFIG_INTEL_MEI_ME=y
-# CONFIG_INTEL_MEI_TXE is not set
+# CONFIG_INTEL_MEI is not set
# CONFIG_VMWARE_VMCI is not set
CONFIG_ECHO=y
# CONFIG_MISC_ALCOR_PCI is not set
@@ -3412,7 +3406,6 @@ CONFIG_TQMX86_WDT=y
CONFIG_W83977F_WDT=y
CONFIG_MACHZ_WDT=y
CONFIG_SBC_EPX_C3_WATCHDOG=y
-# CONFIG_INTEL_MEI_WDT is not set
CONFIG_NI903X_WDT=y
# CONFIG_NIC7018_WDT is not set
# CONFIG_MEN_A21_WDT is not set
@@ -5752,7 +5745,6 @@ CONFIG_GENERIC_NET_UTILS=y
# CONFIG_PRIME_NUMBERS is not set
CONFIG_RATIONAL=y
CONFIG_GENERIC_IOMAP=y
-CONFIG_ARCH_USE_CMPXCHG_LOCKREF=y
CONFIG_ARCH_HAS_FAST_MULTIPLIER=y
CONFIG_ARCH_USE_SYM_ANNOTATIONS=y
@@ -6186,7 +6178,6 @@ CONFIG_SAMPLE_VFIO_MDEV_MDPY=y
CONFIG_SAMPLE_VFIO_MDEV_MBOCHS=y
CONFIG_SAMPLE_ANDROID_BINDERFS=y
CONFIG_SAMPLE_VFS=y
-# CONFIG_SAMPLE_INTEL_MEI is not set
# CONFIG_SAMPLE_TPS6594_PFSM is not set
CONFIG_SAMPLE_WATCHDOG=y
CONFIG_SAMPLE_WATCH_QUEUE=y
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
Powered by blists - more mailing lists