lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:   Tue, 7 Mar 2023 10:16:47 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Zhang Qiao <zhangqiao22@...wei.com>
CC:     <oe-lkp@...ts.linux.dev>, <lkp@...el.com>,
        Vincent Guittot <vincent.guittot@...aro.org>,
        <linux-kernel@...r.kernel.org>, <mingo@...hat.com>,
        <peterz@...radead.org>, <juri.lelli@...hat.com>,
        <dietmar.eggemann@....com>, <rostedt@...dmis.org>,
        <bsegall@...gle.com>, <mgorman@...e.de>, <bristot@...hat.com>,
        <vschneid@...hat.com>, <rkagan@...zon.de>, <zhangqiao22@...wei.com>
Subject: Re: [PATCH v2] sched/fair: sanitize vruntime of entity being migrated


Greeting,

FYI, we noticed WARNING:at_kernel/sched/sched.h:#assert_clock_updated due to commit (built with gcc-11):

commit: 77900e3a6934d2a2e33f26775447e1dceeb1c503 ("[PATCH v2] sched/fair: sanitize vruntime of entity being migrated")
url: https://github.com/intel-lab-lkp/linux/commits/Zhang-Qiao/sched-fair-sanitize-vruntime-of-entity-being-migrated/20230306-205822
base: https://git.kernel.org/cgit/linux/kernel/git/tip/tip.git 7c4a5b89a0b5a57a64b601775b296abf77a9fe97
patch link: https://lore.kernel.org/all/20230306132418.50389-1-zhangqiao22@huawei.com/
patch subject: [PATCH v2] sched/fair: sanitize vruntime of entity being migrated

in testcase: boot

on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):


If you fix the issue, kindly add following tag
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Link: https://lore.kernel.org/oe-lkp/202303070945.c7b65897-oliver.sang@intel.com


[    0.702980][    T1] ------------[ cut here ]------------
[    0.704539][    T1] rq->clock_update_flags < RQCF_ACT_SKIP
[ 0.704551][ T1] WARNING: CPU: 0 PID: 1 at kernel/sched/sched.h:1496 assert_clock_updated+0x21/0x24 
[    0.706944][    T1] Modules linked in:
[    0.706944][    T1] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 6.2.0-rc6-00112-g77900e3a6934 #11
[    0.706944][    T1] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.0-debian-1.16.0-5 04/01/2014
[ 0.706944][ T1] RIP: assert_clock_updated+0x21/0x24 
[ 0.706944][ T1] Code: 01 85 c0 7e 03 ff 47 60 c3 83 ff 01 77 1e 80 3d 1a 1c 89 01 00 75 15 48 c7 c7 ce fd 22 82 c6 05 0a 1c 89 01 01 e8 27 ea fc ff <0f> 0b c3 0f 1f 44 00 00 41 55 41 54 49 89 fc 55 53 48 8b af 30 01
All code
========
   0:	01 85 c0 7e 03 ff    	add    %eax,-0xfc8140(%rbp)
   6:	47 60                	rex.RXB (bad) 
   8:	c3                   	retq   
   9:	83 ff 01             	cmp    $0x1,%edi
   c:	77 1e                	ja     0x2c
   e:	80 3d 1a 1c 89 01 00 	cmpb   $0x0,0x1891c1a(%rip)        # 0x1891c2f
  15:	75 15                	jne    0x2c
  17:	48 c7 c7 ce fd 22 82 	mov    $0xffffffff8222fdce,%rdi
  1e:	c6 05 0a 1c 89 01 01 	movb   $0x1,0x1891c0a(%rip)        # 0x1891c2f
  25:	e8 27 ea fc ff       	callq  0xfffffffffffcea51
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	c3                   	retq   
  2d:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
  32:	41 55                	push   %r13
  34:	41 54                	push   %r12
  36:	49 89 fc             	mov    %rdi,%r12
  39:	55                   	push   %rbp
  3a:	53                   	push   %rbx
  3b:	48                   	rex.W
  3c:	8b                   	.byte 0x8b
  3d:	af                   	scas   %es:(%rdi),%eax
  3e:	30 01                	xor    %al,(%rcx)

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	c3                   	retq   
   3:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
   8:	41 55                	push   %r13
   a:	41 54                	push   %r12
   c:	49 89 fc             	mov    %rdi,%r12
   f:	55                   	push   %rbp
  10:	53                   	push   %rbx
  11:	48                   	rex.W
  12:	8b                   	.byte 0x8b
  13:	af                   	scas   %es:(%rdi),%eax
  14:	30 01                	xor    %al,(%rcx)
[    0.706944][    T1] RSP: 0000:ffffc90000013d28 EFLAGS: 00010082
[    0.706944][    T1] RAX: 0000000000000000 RBX: ffff88810cba1fc0 RCX: 0000000000000003
[    0.706944][    T1] RDX: 0000000000000105 RSI: 0000000000000001 RDI: 0000000000000001
[    0.706944][    T1] RBP: ffff88842fc2c580 R08: 0000000000000000 R09: 0000000000000026
[    0.706944][    T1] R10: 0000000000000000 R11: 000000002d2d2d2d R12: ffff88842fc2c600
[    0.706944][    T1] R13: 0000000000000001 R14: 0000000000000001 R15: ffff88810cba1f40
[    0.706944][    T1] FS:  0000000000000000(0000) GS:ffff88842fc00000(0000) knlGS:0000000000000000
[    0.706944][    T1] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.706944][    T1] CR2: ffff88843ffff000 CR3: 0000000002412000 CR4: 00000000000406f0
[    0.706944][    T1] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[    0.706944][    T1] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[    0.706944][    T1] Call Trace:
[    0.706944][    T1]  <TASK>
[ 0.706944][ T1] entity_is_long_sleep+0x1b/0x38 
[ 0.706944][ T1] migrate_task_rq_fair (fair.c:?) 
[ 0.706944][ T1] set_task_cpu (??:?) 
[ 0.706944][ T1] try_to_wake_up (core.c:?) 
[ 0.706944][ T1] ? __kthread_bind_mask (kthread.c:?) 
[ 0.706944][ T1] bringup_cpu (cpu.c:?) 
[ 0.706944][ T1] ? cpu_smt_allowed (cpu.c:?) 
[ 0.706944][ T1] cpuhp_invoke_callback (cpu.c:?) 
[ 0.706944][ T1] __cpuhp_invoke_callback_range (cpu.c:?) 
[ 0.706944][ T1] _cpu_up (cpu.c:?) 
[ 0.706944][ T1] cpu_up (cpu.c:?) 
[ 0.706944][ T1] bringup_nonboot_cpus (??:?) 
[ 0.706944][ T1] smp_init (??:?) 
[ 0.706944][ T1] kernel_init_freeable (main.c:?) 
[ 0.706944][ T1] ? rest_init (main.c:?) 
[ 0.706944][ T1] kernel_init (main.c:?) 
[ 0.706944][ T1] ret_from_fork (??:?) 
[    0.706944][    T1]  </TASK>
[    0.706944][    T1] ---[ end trace 0000000000000000 ]---
[    0.707026][    T1] smp: Brought up 1 node, 2 CPUs
[    0.707887][    T1] smpboot: Max logical packages: 1
[    0.708793][    T1] smpboot: Total of 2 processors activated (8779.66 BogoMIPS)
[    0.711760][    T1] devtmpfs: initialized
[    0.711827][    T1] x86/mm: Memory block size: 128MB
[ 0.716871][ T1] calling ipc_ns_init+0x0/0x18 @ 1 
[ 0.717857][ T1] initcall ipc_ns_init+0x0/0x18 returned 0 after 0 usecs 
[ 0.718963][ T1] calling init_mmap_min_addr+0x0/0x1a @ 1 
[ 0.719972][ T1] initcall init_mmap_min_addr+0x0/0x1a returned 0 after 0 usecs 
[ 0.721432][ T1] calling pci_realloc_setup_params+0x0/0x49 @ 1 
[ 0.722492][ T1] initcall pci_realloc_setup_params+0x0/0x49 returned 0 after 0 usecs 
[ 0.723421][ T1] calling inet_frag_wq_init+0x0/0x41 @ 1 
[ 0.726984][ T1] initcall inet_frag_wq_init+0x0/0x41 returned 0 after 4000 usecs 
[ 0.728809][ T1] calling e820__register_nvs_regions+0x0/0x3c @ 1 
[ 0.729914][ T1] initcall e820__register_nvs_regions+0x0/0x3c returned 0 after 0 usecs 
[ 0.730959][ T1] calling cpufreq_register_tsc_scaling+0x0/0x2e @ 1 
[ 0.732104][ T1] initcall cpufreq_register_tsc_scaling+0x0/0x2e returned 0 after 0 usecs 
[ 0.733576][ T1] calling cache_ap_register+0x0/0x2c @ 1 
[ 0.734609][ T1] initcall cache_ap_register+0x0/0x2c returned 0 after 0 usecs 
[ 0.735412][ T1] calling reboot_init+0x0/0x41 @ 1 
[ 0.736372][ T1] initcall reboot_init+0x0/0x41 returned 0 after 0 usecs 
[ 0.737551][ T1] calling init_lapic_sysfs+0x0/0x25 @ 1 
[ 0.738527][ T1] initcall init_lapic_sysfs+0x0/0x25 returned 0 after 0 usecs 
[ 0.739387][ T1] calling alloc_frozen_cpus+0x0/0xc @ 1 
[ 0.740351][ T1] initcall alloc_frozen_cpus+0x0/0xc returned 0 after 0 usecs 
[ 0.743426][ T1] calling cpu_hotplug_pm_sync_init+0x0/0x18 @ 1 
[ 0.744547][ T1] initcall cpu_hotplug_pm_sync_init+0x0/0x18 returned 0 after 0 usecs 
[ 0.745965][ T1] calling wq_sysfs_init+0x0/0x2f @ 1 
[ 0.746922][ T1] initcall wq_sysfs_init+0x0/0x2f returned 0 after 0 usecs 
[ 0.747307][ T1] calling ksysfs_init+0x0/0x9d @ 1 
[ 0.748263][ T1] initcall ksysfs_init+0x0/0x9d returned 0 after 0 usecs 
[ 0.749522][ T1] calling schedutil_gov_init+0x0/0x15 @ 1 
[ 0.750537][ T1] initcall schedutil_gov_init+0x0/0x15 returned 0 after 0 usecs 
[ 0.751416][ T1] calling pm_init+0x0/0x77 @ 1 
[ 0.752336][ T1] initcall pm_init+0x0/0x77 returned 0 after 0 usecs 
[ 0.753480][ T1] calling pm_disk_init+0x0/0x1c @ 1 
[ 0.754434][ T1] initcall pm_disk_init+0x0/0x1c returned 0 after 0 usecs 
[ 0.755378][ T1] calling swsusp_header_init+0x0/0x30 @ 1 
[ 0.756414][ T1] initcall swsusp_header_init+0x0/0x30 returned 0 after 0 usecs 
[ 0.757816][ T1] calling rcu_set_runtime_mode+0x0/0x1b @ 1 
[ 0.758858][ T1] initcall rcu_set_runtime_mode+0x0/0x1b returned 0 after 0 usecs 
[ 0.759395][ T1] calling init_jiffies_clocksource+0x0/0x1c @ 1 
[    0.760446][    T1] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645041785100000 ns
[ 0.762235][ T1] initcall init_jiffies_clocksource+0x0/0x1c returned 0 after 0 usecs 
[ 0.763415][ T1] calling futex_init+0x0/0xcf @ 1 
[    0.764361][    T1] futex hash table entries: 512 (order: 3, 32768 bytes, linear)
[ 0.765718][ T1] initcall futex_init+0x0/0xcf returned 0 after 0 usecs 
[ 0.766890][ T1] calling cgroup_wq_init+0x0/0x2d @ 1 
[ 0.767275][ T1] initcall cgroup_wq_init+0x0/0x2d returned 0 after 0 usecs 
[ 0.768502][ T1] calling cgroup1_wq_init+0x0/0x2d @ 1 
[ 0.769581][ T1] initcall cgroup1_wq_init+0x0/0x2d returned 0 after 0 usecs 
[ 0.770943][ T1] calling ftrace_mod_cmd_init+0x0/0x10 @ 1 
[ 0.772006][ T1] initcall ftrace_mod_cmd_init+0x0/0x10 returned 0 after 0 usecs 
[ 0.773409][ T1] calling init_graph_trace+0x0/0x61 @ 1 
[ 0.774336][ T1] initcall init_graph_trace+0x0/0x61 returned 0 after 0 usecs 
[ 0.778960][ T1] calling trace_events_eprobe_init_early+0x0/0x2b @ 1 
[ 0.780166][ T1] initcall trace_events_eprobe_init_early+0x0/0x2b returned 0 after 0 usecs 
[ 0.781739][ T1] calling trace_events_synth_init_early+0x0/0x2b @ 1 
[ 0.782877][ T1] initcall trace_events_synth_init_early+0x0/0x2b returned 0 after 0 usecs 
[ 0.783421][ T1] calling init_kprobe_trace_early+0x0/0x2a @ 1 
[ 0.784494][ T1] initcall init_kprobe_trace_early+0x0/0x2a returned 0 after 0 usecs 
[ 0.785931][ T1] calling memory_failure_init+0x0/0x9a @ 1 
[ 0.786956][ T1] initcall memory_failure_init+0x0/0x9a returned 0 after 0 usecs 
[ 0.788354][ T1] calling cma_init_reserved_areas+0x0/0x2f @ 1 
[ 0.790834][ T1] initcall cma_init_reserved_areas+0x0/0x2f returned 0 after 0 usecs 
[ 0.791422][ T1] calling fsnotify_init+0x0/0x4d @ 1 
[ 0.792399][ T1] initcall fsnotify_init+0x0/0x4d returned 0 after 0 usecs 
[ 0.793578][ T1] calling filelock_init+0x0/0xa4 @ 1 
[ 0.794580][ T1] initcall filelock_init+0x0/0xa4 returned 0 after 0 usecs 
[ 0.795312][ T1] calling init_script_binfmt+0x0/0x1a @ 1 
[ 0.796310][ T1] initcall init_script_binfmt+0x0/0x1a returned 0 after 0 usecs 
[ 0.797654][ T1] calling init_elf_binfmt+0x0/0x1a @ 1 
[ 0.798613][ T1] initcall init_elf_binfmt+0x0/0x1a returned 0 after 0 usecs 
[ 0.799388][ T1] calling init_compat_elf_binfmt+0x0/0x1a @ 1 
[ 0.800453][ T1] initcall init_compat_elf_binfmt+0x0/0x1a returned 0 after 0 usecs 


To reproduce:

        # build kernel
	cd linux
	cp config-6.2.0-rc6-00112-g77900e3a6934 .config
	make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage modules
	make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 INSTALL_MOD_PATH=<mod-install-dir> modules_install
	cd <mod-install-dir>
	find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz


        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests



View attachment "config-6.2.0-rc6-00112-g77900e3a6934" of type "text/plain" (130980 bytes)

View attachment "job-script" of type "text/plain" (5103 bytes)

Download attachment "dmesg.xz" of type "application/x-xz" (28136 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ