lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20211107093845.GB11442@xsang-OptiPlex-9020>
Date:   Sun, 7 Nov 2021 17:38:45 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Yafang Shao <laoar.shao@...il.com>
Cc:     lkp@...ts.01.org, lkp@...el.com, mingo@...hat.com,
        peterz@...radead.org, juri.lelli@...hat.com,
        vincent.guittot@...aro.org, dietmar.eggemann@....com,
        rostedt@...dmis.org, bsegall@...gle.com, mgorman@...e.de,
        bristot@...hat.com, linux-kernel@...r.kernel.org,
        Yafang Shao <laoar.shao@...il.com>,
        Valentin Schneider <valentin.schneider@....com>,
        aubrey.li@...ux.intel.com, yu.c.chen@...el.com
Subject: [sched/fair]  64228563c2:
 WARNING:at_kernel/kthread.c:#__kthread_bind_mask



Greeting,

FYI, we noticed the following commit (built with gcc-9):

commit: 64228563c20f024e40e4bdaa51eeec99002c489f ("[RFC PATCH 2/4] sched/fair: Introduce cfs_migration")
url: https://github.com/0day-ci/linux/commits/Yafang-Shao/sched-Introduce-cfs_migration/20211104-225939
base: https://git.kernel.org/cgit/linux/kernel/git/tip/tip.git 8ea9183db4ad8afbcb7089a77c23eaf965b0cacd
patch link: https://lore.kernel.org/lkml/20211104145713.4419-3-laoar.shao@gmail.com

in testcase: leaking-addresses
version: leaking-addresses-x86_64-cf2a85e-1_20211103
with following parameters:

	ucode: 0x28



on test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz with 16G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):


+--------------------------------------------------+------------+------------+
|                                                  | 812ea7cfb1 | 64228563c2 |
+--------------------------------------------------+------------+------------+
| boot_successes                                   | 10         | 0          |
| boot_failures                                    | 0          | 11         |
| WARNING:at_kernel/kthread.c:#__kthread_bind_mask | 0          | 11         |
| RIP:__kthread_bind_mask                          | 0          | 11         |
+--------------------------------------------------+------------+------------+


If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang@...el.com>


[ 3.072411][ T1] WARNING: CPU: 0 PID: 1 at kernel/kthread.c:465 __kthread_bind_mask (kernel/kthread.c:465 (discriminator 1)) 
[    3.073411][    T1] Modules linked in:
[    3.074411][    T1] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.15.0-rc4-00071-g64228563c20f #1
[    3.075411][    T1] Hardware name: Dell Inc. OptiPlex 9020/0DNKMN, BIOS A05 12/05/2013
[ 3.076411][ T1] RIP: 0010:__kthread_bind_mask (kernel/kthread.c:465 (discriminator 1)) 
[ 3.077411][ T1] Code: 89 e2 b7 00 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 55 41 54 55 48 89 f5 89 d6 53 48 89 fb e8 28 bb 01 00 48 85 c0 75 09 <0f> 0b 5b 5d 41 5c 41 5d c3 4c 8d ab a4 0c 00 00 4c 89 ef e8 4b ed
All code
========
   0:	89 e2                	mov    %esp,%edx
   2:	b7 00                	mov    $0x0,%bh
   4:	66 0f 1f 84 00 00 00 	nopw   0x0(%rax,%rax,1)
   b:	00 00 
   d:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
  12:	41 55                	push   %r13
  14:	41 54                	push   %r12
  16:	55                   	push   %rbp
  17:	48 89 f5             	mov    %rsi,%rbp
  1a:	89 d6                	mov    %edx,%esi
  1c:	53                   	push   %rbx
  1d:	48 89 fb             	mov    %rdi,%rbx
  20:	e8 28 bb 01 00       	callq  0x1bb4d
  25:	48 85 c0             	test   %rax,%rax
  28:	75 09                	jne    0x33
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	5b                   	pop    %rbx
  2d:	5d                   	pop    %rbp
  2e:	41 5c                	pop    %r12
  30:	41 5d                	pop    %r13
  32:	c3                   	retq   
  33:	4c 8d ab a4 0c 00 00 	lea    0xca4(%rbx),%r13
  3a:	4c 89 ef             	mov    %r13,%rdi
  3d:	e8                   	.byte 0xe8
  3e:	4b ed                	rex.WXB in (%dx),%eax

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	5b                   	pop    %rbx
   3:	5d                   	pop    %rbp
   4:	41 5c                	pop    %r12
   6:	41 5d                	pop    %r13
   8:	c3                   	retq   
   9:	4c 8d ab a4 0c 00 00 	lea    0xca4(%rbx),%r13
  10:	4c 89 ef             	mov    %r13,%rdi
  13:	e8                   	.byte 0xe8
  14:	4b ed                	rex.WXB in (%dx),%eax
[    3.078411][    T1] RSP: 0000:ffffc9000002fdd8 EFLAGS: 00010246
[    3.079411][    T1] RAX: 0000000000000000 RBX: ffff888100aed000 RCX: 0000000000000000
[    3.080411][    T1] RDX: 0000000000000001 RSI: 0000000000000246 RDI: 00000000ffffffff
[    3.081411][    T1] RBP: ffffffff822101e0 R08: ffffffff8284cec8 R09: ffffffff8284cec8
[    3.082411][    T1] R10: 0000000000000000 R11: ffff8883fda278f0 R12: 0000000000000008
[    3.083411][    T1] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[    3.084411][    T1] FS:  0000000000000000(0000) GS:ffff8883fda00000(0000) knlGS:0000000000000000
[    3.085411][    T1] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    3.086411][    T1] CR2: ffff88841ea01000 CR3: 000000041da10001 CR4: 00000000001706f0
[    3.087411][    T1] Call Trace:
[ 3.088413][ T1] kthread_unpark (kernel/kthread.c:478 kernel/kthread.c:570) 
[ 3.089411][ T1] cfs_migration_init (kernel/sched/fair.c:12029 (discriminator 3)) 
[ 3.090412][ T1] ? setup_sched_thermal_decay_shift (kernel/sched/fair.c:12014) 
[ 3.091411][ T1] do_one_initcall (init/main.c:1303) 
[ 3.092412][ T1] kernel_init_freeable (init/main.c:1419 init/main.c:1603) 
[ 3.093412][ T1] ? rest_init (init/main.c:1497) 
[ 3.094411][ T1] kernel_init (init/main.c:1507) 
[ 3.095411][ T1] ret_from_fork (arch/x86/entry/entry_64.S:301) 
[    3.096413][    T1] ---[ end trace 221e592f8f64f075 ]---
[    3.097411][    T1] rcu: Hierarchical SRCU implementation.
[    3.098989][    T5] NMI watchdog: Enabled. Permanently consumes one hw-PMU counter.
[    3.099473][    T1] smp: Bringing up secondary CPUs ...
[    3.100482][    T1] x86: Booting SMP configuration:
[    3.101412][    T1] .... node  #0, CPUs:      #1
[    0.534696][    T0] masked ExtINT on CPU#1
[    3.110490][    T1]  #2
[    0.534696][    T0] masked ExtINT on CPU#2
[    3.117413][    T1]  #3
[    0.534696][    T0] masked ExtINT on CPU#3
[    3.124184][    T1]  #4
[    0.534696][    T0] masked ExtINT on CPU#4
[    3.130933][    T1] MDS CPU bug present and SMT on, data leak possible. See https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/mds.html for more details.
[    3.131497][    T1]  #5
[    0.534696][    T0] masked ExtINT on CPU#5
[    3.138269][    T1]  #6
[    0.534696][    T0] masked ExtINT on CPU#6
[    3.145166][    T1]  #7
[    0.534696][    T0] masked ExtINT on CPU#7
[    3.152059][    T1] smp: Brought up 1 node, 8 CPUs
[    3.152412][    T1] smpboot: Max logical packages: 1
[    3.153411][    T1] smpboot: Total of 8 processors activated (54273.93 BogoMIPS)
[    3.177958][   T60] node 0 deferred pages initialised in 22ms
[    3.184695][    T1] devtmpfs: initialized
[    3.185443][    T1] x86/mm: Memory block size: 128MB
[    3.187216][    T1] ACPI: PM: Registering ACPI NVS region [mem 0xd1695000-0xd169bfff] (28672 bytes)
[    3.187412][    T1] ACPI: PM: Registering ACPI NVS region [mem 0xda71c000-0xda7fffff] (933888 bytes)
[    3.188450][    T1] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1911260446275000 ns
[    3.189413][    T1] futex hash table entries: 2048 (order: 5, 131072 bytes, linear)
[    3.190454][    T1] pinctrl core: initialized pinctrl subsystem
[    3.191527][    T1] NET: Registered PF_NETLINK/PF_ROUTE protocol family
[    3.192555][    T1] audit: initializing netlink subsys (disabled)
[    3.193423][   T71] audit: type=2000 audit(1636161192.177:1): state=initialized audit_enabled=0 res=1
[    3.193471][    T1] thermal_sys: Registered thermal governor 'fair_share'
[    3.194412][    T1] thermal_sys: Registered thermal governor 'bang_bang'
[    3.195411][    T1] thermal_sys: Registered thermal governor 'step_wise'
[    3.196411][    T1] thermal_sys: Registered thermal governor 'user_space'
[    3.197420][    T1] cpuidle: using governor menu
[    3.199511][    T1] ACPI: bus type PCI registered
[    3.200412][    T1] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5
[    3.201454][    T1] PCI: MMCONFIG for domain 0000 [bus 00-3f] at [mem 0xf8000000-0xfbffffff] (base 0xf8000000)
[    3.202412][    T1] PCI: MMCONFIG at [mem 0xf8000000-0xfbffffff] reserved in E820
[    3.203416][    T1] pmd_set_huge: Cannot satisfy [mem 0xf8000000-0xf8200000] with a huge-page mapping due to MTRR override.
[    3.204444][    T1] PCI: Using configuration type 1 for base access
[    3.205522][    T1] core: PMU erratum BJ122, BV98, HSD29 worked around, HT is on
[    3.207568][    T1] Kprobes globally optimized
[    3.208425][    T1] HugeTLB registered 1.00 GiB page size, pre-allocated 0 pages
[    3.209412][    T1] HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages
[    3.210428][    T1] cryptd: max_cpu_qlen set to 1000
[    3.211440][    T1] ACPI: Added _OSI(Module Device)
[    3.212415][    T1] ACPI: Added _OSI(Processor Device)
[    3.213411][    T1] ACPI: Added _OSI(3.0 _SCP Extensions)
[    3.214411][    T1] ACPI: Added _OSI(Processor Aggregator Device)
[    3.215411][    T1] ACPI: Added _OSI(Linux-Dell-Video)
[    3.216411][    T1] ACPI: Added _OSI(Linux-Lenovo-NV-HDMI-Audio)
[    3.217411][    T1] ACPI: Added _OSI(Linux-HPI-Hybrid-Graphics)
[    3.224227][    T1] ACPI: 6 ACPI AML tables successfully acquired and loaded
[    3.225178][    T1] ACPI: [Firmware Bug]: BIOS _OSI(Linux) query ignored
[    3.225803][    T1] ACPI: Dynamic OEM Table Load:
[    3.226414][    T1] ACPI: SSDT 0xFFFF88841EAB5000 0003D3 (v01 PmRef  Cpu0Cst  00003001 INTL 20120711)
[    3.227913][    T1] ACPI: Dynamic OEM Table Load:
[    3.228414][    T1] ACPI: SSDT 0xFFFF88841EAC4000 0005AA (v01 PmRef  ApIst    00003000 INTL 20120711)
[    3.229916][    T1] ACPI: Dynamic OEM Table Load:
[    3.230413][    T1] ACPI: SSDT 0xFFFF888100EF8800 000119 (v01 PmRef  ApCst    00003000 INTL 20120711)
[    3.232725][    T1] ACPI: Interpreter enabled
[    3.233432][    T1] ACPI: PM: (supports S0 S3 S4 S5)
[    3.234411][    T1] ACPI: Using IOAPIC for interrupt routing
[    3.235430][    T1] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
[    3.236560][    T1] ACPI: Enabled 9 GPEs in block 00 to 3F
[    3.243366][    T1] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-3e])
[    3.243414][    T1] acpi PNP0A08:00: _OSC: OS supports [ExtendedConfig ASPM ClockPM Segments MSI HPX-Type3]
[    3.244781][    T1] acpi PNP0A08:00: _OSC: OS now controls [PCIeHotplug SHPCHotplug PME AER PCIeCapability LTR]
[    3.245679][    T1] PCI host bridge to bus 0000:00
[    3.246412][    T1] pci_bus 0000:00: root bus resource [io  0x0000-0x0cf7 window]
[    3.247411][    T1] pci_bus 0000:00: root bus resource [io  0x0d00-0xffff window]
[    3.248411][    T1] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
[    3.249411][    T1] pci_bus 0000:00: root bus resource [mem 0x000d4000-0x000d7fff window]
[    3.250411][    T1] pci_bus 0000:00: root bus resource [mem 0x000d8000-0x000dbfff window]
[    3.251411][    T1] pci_bus 0000:00: root bus resource [mem 0x000dc000-0x000dffff window]
[    3.252411][    T1] pci_bus 0000:00: root bus resource [mem 0x000e0000-0x000e3fff window]
[    3.253411][    T1] pci_bus 0000:00: root bus resource [mem 0x000e4000-0x000e7fff window]
[    3.254411][    T1] pci_bus 0000:00: root bus resource [mem 0xdf200000-0xfeafffff window]
[    3.255411][    T1] pci_bus 0000:00: root bus resource [bus 00-3e]
[    3.256436][    T1] pci 0000:00:00.0: [8086:0c00] type 00 class 0x060000
[    3.257508][    T1] pci 0000:00:02.0: [8086:0412] type 00 class 0x030000
[    3.258416][    T1] pci 0000:00:02.0: reg 0x10: [mem 0xf7800000-0xf7bfffff 64bit]
[    3.259414][    T1] pci 0000:00:02.0: reg 0x18: [mem 0xe0000000-0xefffffff 64bit pref]


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.



---
0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation

Thanks,
Oliver Sang


View attachment "config-5.15.0-rc4-00071-g64228563c20f" of type "text/plain" (174086 bytes)

View attachment "job-script" of type "text/plain" (5461 bytes)

Download attachment "dmesg.xz" of type "application/x-xz" (20580 bytes)

View attachment "leaking-addresses" of type "text/plain" (3065 bytes)

View attachment "job.yaml" of type "text/plain" (4457 bytes)

View attachment "reproduce" of type "text/plain" (125 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ