lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20210322073735.GF32426@xsang-OptiPlex-9020>
Date:   Mon, 22 Mar 2021 15:37:35 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Peter Zijlstra <peterz@...radead.org>
Cc:     Ingo Molnar <mingo@...nel.org>,
        Valentin Schneider <valentin.schneider@....com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...el.com, aubrey.li@...ux.intel.com,
        yu.c.chen@...el.com
Subject: [sched]  9e81889c76:  aim9.signal_test.ops_per_sec -2.9% regression



Greeting,

FYI, we noticed a -2.9% regression of aim9.signal_test.ops_per_sec due to commit:


commit: 9e81889c7648d48dd5fe13f41cbc99f3c362484a ("sched: Fix affine_move_task() self-concurrency")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: aim9
on test machine: 256 threads Intel(R) Genuine Intel(R) CPU 0000 @ 1.30GHz with 112G memory
with following parameters:

	testtime: 300s
	test: signal_test
	cpufreq_governor: performance
	ucode: 0xffff0190

test-description: Suite IX is the "AIM Independent Resource Benchmark:" the famous synthetic benchmark.
test-url: https://sourceforge.net/projects/aimbench/files/aim-suite9/



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang@...el.com>


Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install                job.yaml  # job file is attached in this email
        bin/lkp split-job --compatible job.yaml
        bin/lkp run                    compatible-job.yaml

=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/tbox_group/test/testcase/testtime/ucode:
  gcc-9/performance/x86_64-rhel-8.3/debian-10.4-x86_64-20200603.cgz/lkp-knl-f1/signal_test/aim9/300s/0xffff0190

commit: 
  3f1bc119cd ("sched: Optimize migration_cpu_stop()")
  9e81889c76 ("sched: Fix affine_move_task() self-concurrency")

3f1bc119cd7fc987 9e81889c7648d48dd5fe13f41cb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    112561            -2.9%     109339        aim9.signal_test.ops_per_sec
    215.24            +1.3%     218.12        aim9.time.system_time
     66.75            -4.1%      64.00        aim9.time.user_time
      0.07 ± 27%     -20.8%       0.05 ±  6%  perf-sched.sch_delay.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
  52972282            +1.8%   53921623        perf-stat.i.iTLB-load-misses
  52791114            +1.8%   53737428        perf-stat.ps.iTLB-load-misses
    419.00 ±  4%     -39.6%     253.00 ± 29%  interrupts.CPU100.NMI:Non-maskable_interrupts
    419.00 ±  4%     -39.6%     253.00 ± 29%  interrupts.CPU100.PMI:Performance_monitoring_interrupts
    348.33 ± 29%     -41.3%     204.50 ±  4%  interrupts.CPU103.NMI:Non-maskable_interrupts
    348.33 ± 29%     -41.3%     204.50 ±  4%  interrupts.CPU103.PMI:Performance_monitoring_interrupts
    167.17 ±  5%     +81.2%     302.83 ± 21%  interrupts.CPU146.NMI:Non-maskable_interrupts
    167.17 ±  5%     +81.2%     302.83 ± 21%  interrupts.CPU146.PMI:Performance_monitoring_interrupts
     29.50 ±124%     -76.8%       6.83 ± 43%  interrupts.CPU148.RES:Rescheduling_interrupts
    196.50 ± 30%    +173.7%     537.83 ±113%  interrupts.CPU153.NMI:Non-maskable_interrupts
    196.50 ± 30%    +173.7%     537.83 ±113%  interrupts.CPU153.PMI:Performance_monitoring_interrupts
    306.67 ± 32%     -25.2%     229.50 ± 29%  interrupts.CPU167.NMI:Non-maskable_interrupts
    306.67 ± 32%     -25.2%     229.50 ± 29%  interrupts.CPU167.PMI:Performance_monitoring_interrupts
    396.67 ±  3%     -42.6%     227.67 ± 27%  interrupts.CPU179.NMI:Non-maskable_interrupts
    396.67 ±  3%     -42.6%     227.67 ± 27%  interrupts.CPU179.PMI:Performance_monitoring_interrupts
    417.17 ±  3%     -49.8%     209.50 ±  2%  interrupts.CPU183.NMI:Non-maskable_interrupts
    417.17 ±  3%     -49.8%     209.50 ±  2%  interrupts.CPU183.PMI:Performance_monitoring_interrupts
    379.50 ± 20%     -45.5%     206.83        interrupts.CPU188.NMI:Non-maskable_interrupts
    379.50 ± 20%     -45.5%     206.83        interrupts.CPU188.PMI:Performance_monitoring_interrupts
    352.17 ± 28%     -42.1%     204.00 ±  2%  interrupts.CPU189.NMI:Non-maskable_interrupts
    352.17 ± 28%     -42.1%     204.00 ±  2%  interrupts.CPU189.PMI:Performance_monitoring_interrupts
    301.33 ± 18%     -40.3%     179.83 ± 30%  interrupts.CPU236.NMI:Non-maskable_interrupts
    301.33 ± 18%     -40.3%     179.83 ± 30%  interrupts.CPU236.PMI:Performance_monitoring_interrupts
      1598 ±  6%     +11.3%       1779 ±  5%  interrupts.CPU26.CAL:Function_call_interrupts
      1609 ±  3%     +18.3%       1903 ± 11%  interrupts.CPU30.CAL:Function_call_interrupts
      1372 ±  2%     +31.9%       1809 ± 23%  interrupts.CPU51.CAL:Function_call_interrupts
      1477 ± 24%     +64.5%       2430 ± 30%  interrupts.CPU53.CAL:Function_call_interrupts
    260.17 ± 27%     -33.6%     172.83 ±  4%  interrupts.CPU82.NMI:Non-maskable_interrupts
    260.17 ± 27%     -33.6%     172.83 ±  4%  interrupts.CPU82.PMI:Performance_monitoring_interrupts
    299.33 ± 31%     -35.4%     193.33 ±  4%  interrupts.CPU98.NMI:Non-maskable_interrupts
    299.33 ± 31%     -35.4%     193.33 ±  4%  interrupts.CPU98.PMI:Performance_monitoring_interrupts
    364.50 ± 19%     -39.3%     221.17 ± 29%  interrupts.CPU99.NMI:Non-maskable_interrupts
    364.50 ± 19%     -39.3%     221.17 ± 29%  interrupts.CPU99.PMI:Performance_monitoring_interrupts
      5828 ± 12%     -13.9%       5016 ±  4%  interrupts.RES:Rescheduling_interrupts
     92.10            -0.6       91.48        perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
      0.81 ±  5%      +0.1        0.90 ±  7%  perf-profile.calltrace.cycles-pp.sighandler
      0.79 ±  6%      +0.1        0.90 ±  7%  perf-profile.calltrace.cycles-pp.kill_pid_info.kill_something_info.__x64_sys_kill.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.89 ±  4%      +0.1        1.02 ±  6%  perf-profile.calltrace.cycles-pp.kill_something_info.__x64_sys_kill.do_syscall_64.entry_SYSCALL_64_after_hwframe.kill
      0.46 ± 45%      +0.2        0.67 ± 13%  perf-profile.calltrace.cycles-pp.ret_from_fork
      0.46 ± 45%      +0.2        0.67 ± 13%  perf-profile.calltrace.cycles-pp.kthread.ret_from_fork
     92.10            -0.6       91.48        perf-profile.children.cycles-pp.secondary_startup_64_no_verify
     92.10            -0.6       91.48        perf-profile.children.cycles-pp.cpu_startup_entry
     92.10            -0.6       91.48        perf-profile.children.cycles-pp.do_idle
      4.69 ± 10%      -0.5        4.22 ±  3%  perf-profile.children.cycles-pp.ktime_get
      0.50 ±  8%      -0.1        0.39 ±  9%  perf-profile.children.cycles-pp.tick_check_broadcast_expired
      0.08 ± 12%      -0.0        0.06 ±  9%  perf-profile.children.cycles-pp.tick_nohz_idle_retain_tick
      0.05 ±  8%      +0.0        0.07 ±  9%  perf-profile.children.cycles-pp.sched_clock_tick
      0.09 ± 36%      +0.1        0.14 ± 31%  perf-profile.children.cycles-pp.drm_atomic_helper_commit
      0.09 ± 36%      +0.1        0.14 ± 31%  perf-profile.children.cycles-pp.commit_tail
      0.09 ± 36%      +0.1        0.14 ± 31%  perf-profile.children.cycles-pp.drm_atomic_helper_commit_tail
      0.09 ± 36%      +0.1        0.14 ± 31%  perf-profile.children.cycles-pp.drm_atomic_helper_commit_planes
      0.09 ± 36%      +0.1        0.14 ± 31%  perf-profile.children.cycles-pp.mgag200_simple_display_pipe_update
      0.09 ± 36%      +0.1        0.14 ± 31%  perf-profile.children.cycles-pp.mgag200_handle_damage
      0.09 ± 36%      +0.1        0.15 ± 30%  perf-profile.children.cycles-pp.drm_atomic_helper_dirtyfb
      0.52 ±  4%      +0.1        0.59 ±  7%  perf-profile.children.cycles-pp.nr_iowait_cpu
      0.68 ±  3%      +0.1        0.76 ±  5%  perf-profile.children.cycles-pp.update_ts_time_stats
      0.33 ±  9%      +0.1        0.43 ± 14%  perf-profile.children.cycles-pp.run_posix_cpu_timers
      0.80 ±  6%      +0.1        0.91 ±  7%  perf-profile.children.cycles-pp.kill_pid_info
      0.89 ±  4%      +0.1        1.03 ±  6%  perf-profile.children.cycles-pp.kill_something_info
      0.54 ±  9%      +0.1        0.67 ± 13%  perf-profile.children.cycles-pp.kthread
      0.54 ± 10%      +0.1        0.68 ± 13%  perf-profile.children.cycles-pp.ret_from_fork
      3.39 ± 14%      -0.5        2.88 ±  5%  perf-profile.self.cycles-pp.ktime_get
      0.50 ±  7%      -0.1        0.38 ±  9%  perf-profile.self.cycles-pp.tick_check_broadcast_expired
      0.07 ± 12%      -0.0        0.04 ± 45%  perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      0.14 ±  4%      -0.0        0.12 ±  9%  perf-profile.self.cycles-pp.arch_cpu_idle_enter
      0.35 ±  4%      +0.0        0.38 ±  5%  perf-profile.self.cycles-pp.irq_enter_rcu
      0.40 ±  2%      +0.1        0.45 ±  3%  perf-profile.self.cycles-pp.update_ts_time_stats
      0.50 ±  4%      +0.1        0.57 ±  7%  perf-profile.self.cycles-pp.nr_iowait_cpu
      0.33 ± 10%      +0.1        0.43 ± 14%  perf-profile.self.cycles-pp.run_posix_cpu_timers


                                                                                
                            aim9.signal_test.ops_per_sec                        
                                                                                
  118000 +------------------------------------------------------------------+   
         |                            +                                     |   
  116000 |-+                          ::                                    |   
  114000 |-+ +.          +      +.   : :  .+                                |   
         |+.+  +     ++.+ +.++.+  ++.+  :+  + .+                            |   
  112000 |-+   :    :                   +    +                    O         |   
  110000 |-+    :   :                          O             O  O  O O      |   
         |      +.++              O                O  O O  OO  O      O OO O|   
  108000 |-+      O             O    OO OO  OO  O    O   O                  |   
  106000 |-+             OO OO     O       O      O                         |   
         |              O      O                                            |   
  104000 |-+       O  O                                                     |   
  102000 |-+O   O    O                                                      |   
         |O  O O                                                            |   
  100000 +------------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


---
0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation

Thanks,
Oliver Sang


View attachment "config-5.12.0-rc2-00005-g9e81889c7648" of type "text/plain" (172899 bytes)

View attachment "job-script" of type "text/plain" (7641 bytes)

View attachment "job.yaml" of type "text/plain" (4974 bytes)

View attachment "reproduce" of type "text/plain" (254 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ