lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20210513091506.GC20142@xsang-OptiPlex-9020>
Date:   Thu, 13 May 2021 17:15:06 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Kees Cook <keescook@...omium.org>
Cc:     Thomas Gleixner <tglx@...utronix.de>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...el.com
Subject: [x86/entry]  fe950f6020:  will-it-scale.per_thread_ops 5.2%
 improvement



Greeting,

FYI, we noticed a 5.2% improvement of will-it-scale.per_thread_ops due to commit:


commit: fe950f6020338c8ac668ef823bb692d36b7542a2 ("x86/entry: Enable random_kstack_offset support")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: will-it-scale
on test machine: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
with following parameters:

	nr_task: 50%
	mode: thread
	test: lseek1
	cpufreq_governor: performance
	ucode: 0x5003006

test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale

In addition to that, the commit also has significant impact on the following tests:

+------------------+-------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_thread_ops 5.5% improvement                        |
| test machine     | 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory     |
| test parameters  | cpufreq_governor=performance                                                        |
|                  | mode=thread                                                                         |
|                  | nr_task=50%                                                                         |
|                  | test=lseek2                                                                         |
|                  | ucode=0x5003006                                                                     |
+------------------+-------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops 7.8% improvement                       |
| test machine     | 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory     |
| test parameters  | cpufreq_governor=performance                                                        |
|                  | mode=process                                                                        |
|                  | nr_task=50%                                                                         |
|                  | test=lseek2                                                                         |
|                  | ucode=0x5003006                                                                     |
+------------------+-------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_thread_ops 5.5% improvement                        |
| test machine     | 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory     |
| test parameters  | cpufreq_governor=performance                                                        |
|                  | mode=thread                                                                         |
|                  | nr_task=16                                                                          |
|                  | test=lseek2                                                                         |
|                  | ucode=0x5003006                                                                     |
+------------------+-------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -1.7% regression                       |
| test machine     | 192 threads 4 sockets Intel(R) Xeon(R) Platinum 9242 CPU @ 2.30GHz with 192G memory |
| test parameters  | cpufreq_governor=performance                                                        |
|                  | mode=process                                                                        |
|                  | nr_task=50%                                                                         |
|                  | test=dup1                                                                           |
|                  | ucode=0x5003006                                                                     |
+------------------+-------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops 7.9% improvement                       |
| test machine     | 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory     |
| test parameters  | cpufreq_governor=performance                                                        |
|                  | mode=process                                                                        |
|                  | nr_task=16                                                                          |
|                  | test=lseek2                                                                         |
|                  | ucode=0x5003006                                                                     |
+------------------+-------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install                job.yaml  # job file is attached in this email
        bin/lkp split-job --compatible job.yaml  # generate the yaml file for lkp run
        bin/lkp run                    generated-yaml-file

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/thread/50%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/lseek1/will-it-scale/0x5003006

commit: 
  39218ff4c6 ("stack: Optionally randomize kernel stack offset each syscall")
  fe950f6020 ("x86/entry: Enable random_kstack_offset support")

39218ff4c625dbf2 fe950f6020338c8ac668ef823bb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 3.039e+08            +5.2%  3.197e+08        will-it-scale.44.threads
   6907369            +5.2%    7265144        will-it-scale.per_thread_ops
 3.039e+08            +5.2%  3.197e+08        will-it-scale.workload
      5861 ± 28%     +59.2%       9333 ± 32%  proc-vmstat.numa_hint_faults_local
     38898 ± 96%     -70.1%      11622 ±190%  numa-meminfo.node0.Active
     38725 ± 96%     -70.4%      11481 ±192%  numa-meminfo.node0.Active(anon)
      9681 ± 96%     -70.4%       2865 ±192%  numa-vmstat.node0.nr_active_anon
      9681 ± 96%     -70.4%       2865 ±192%  numa-vmstat.node0.nr_zone_active_anon
     35732 ± 11%     -34.2%      23499 ± 18%  softirqs.CPU57.SCHED
     32110 ± 10%     -40.0%      19252 ± 43%  softirqs.CPU84.SCHED
    223.17 ± 14%     -27.0%     162.83 ± 18%  interrupts.CPU13.RES:Rescheduling_interrupts
      7858           -34.7%       5132 ± 37%  interrupts.CPU15.NMI:Non-maskable_interrupts
      7858           -34.7%       5132 ± 37%  interrupts.CPU15.PMI:Performance_monitoring_interrupts
      0.01 ± 24%    -100.0%       0.00        perf-sched.sch_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.02 ± 46%     -45.0%       0.01 ± 15%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      0.03 ±  7%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    459.17 ± 12%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.57 ±115%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ±  7%    -100.0%       0.00        perf-sched.wait_time.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.57 ±115%    -100.0%       0.00        perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
 3.728e+10 ±  9%     +13.0%  4.213e+10 ±  2%  perf-stat.i.dTLB-loads
 2.474e+10 ±  9%     +13.8%  2.816e+10 ±  2%  perf-stat.i.dTLB-stores
   1827406            -4.0%    1754013        perf-stat.i.iTLB-loads
 1.244e+11 ±  9%     +13.1%  1.407e+11 ±  2%  perf-stat.i.instructions
      1.02 ±  7%     +11.6%       1.14        perf-stat.i.ipc
      1026 ±  9%     +13.1%       1160 ±  2%  perf-stat.i.metric.M/sec
      0.93            -6.5%       0.87        perf-stat.overall.cpi
      0.00 ±  2%      -0.0        0.00        perf-stat.overall.dTLB-store-miss-rate%
    398.77            +1.6%     404.98        perf-stat.overall.instructions-per-iTLB-miss
      1.08            +7.0%       1.15        perf-stat.overall.ipc
     86.31            -2.1       84.17        perf-stat.overall.node-load-miss-rate%
    131961            +1.7%     134154        perf-stat.overall.path-length
 3.718e+10 ±  9%     +13.0%    4.2e+10 ±  2%  perf-stat.ps.dTLB-loads
 2.467e+10 ±  9%     +13.8%  2.807e+10 ±  2%  perf-stat.ps.dTLB-stores
   1821416            -4.0%    1748171        perf-stat.ps.iTLB-loads
 1.241e+11 ±  9%     +13.0%  1.402e+11 ±  2%  perf-stat.ps.instructions
 4.011e+13            +6.9%  4.288e+13        perf-stat.total.instructions
     36.32            -4.9       31.44 ± 10%  perf-profile.calltrace.cycles-pp.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     46.21            -4.6       41.65 ± 10%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     21.18            -4.2       16.96 ± 10%  perf-profile.calltrace.cycles-pp.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     11.48            -3.9        7.59 ±  9%  perf-profile.calltrace.cycles-pp.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.88            -0.8        6.11 ±  9%  perf-profile.calltrace.cycles-pp.__fget_files.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64
      2.44            -0.4        2.02 ± 10%  perf-profile.calltrace.cycles-pp.shmem_file_llseek.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
      0.00            +1.5        1.46 ±  9%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
      0.00            +2.8        2.79 ± 10%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     36.85            -4.7       32.16 ± 10%  perf-profile.children.cycles-pp.ksys_lseek
     46.46            -4.4       42.09 ± 10%  perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     21.59            -4.2       17.41 ± 10%  perf-profile.children.cycles-pp.__fdget_pos
     11.85            -3.9        7.94 ±  9%  perf-profile.children.cycles-pp.__fget_light
      7.17            -0.7        6.46 ±  9%  perf-profile.children.cycles-pp.__fget_files
      2.44            -0.4        2.02 ± 10%  perf-profile.children.cycles-pp.shmem_file_llseek
      0.28 ±  3%      +0.1        0.36 ± 12%  perf-profile.children.cycles-pp.rcu_read_unlock_strict
      0.30 ±  4%      +0.7        0.99 ±148%  perf-profile.children.cycles-pp.update_process_times
      4.60            -3.2        1.39 ± 10%  perf-profile.self.cycles-pp.__fget_light
      6.80            -0.7        6.06 ±  9%  perf-profile.self.cycles-pp.__fget_files
      2.25            -0.5        1.78 ± 10%  perf-profile.self.cycles-pp.shmem_file_llseek
      1.26            -0.2        1.06 ± 10%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.62 ±  4%      -0.1        0.55 ± 10%  perf-profile.self.cycles-pp.testcase
      0.22 ±  2%      -0.0        0.19 ±  9%  perf-profile.self.cycles-pp.__x64_sys_lseek
      0.14 ±  4%      +0.0        0.18 ± 12%  perf-profile.self.cycles-pp.rcu_read_unlock_strict
      0.73            +0.6        1.30 ± 10%  perf-profile.self.cycles-pp.do_syscall_64


                                                                                
                               will-it-scale.44.threads                         
                                                                                
  3.5e+08 +-----------------------------------------------------------------+   
          |O  O OO OOO  O OO OO OOO OO OO OOO OO OO OOO OO OO OO OOO OO OO O|   
    3e+08 |+.++.++.+++.++.++.++.+++.++.++.+++.++.++.+++.++.++.++.+++.++.++  |   
          |                                                                 |   
  2.5e+08 |-+                                                               |   
          |                                                                 |   
    2e+08 |-+                                                               |   
          |                                                                 |   
  1.5e+08 |-+                                                               |   
          |                                                                 |   
    1e+08 |-+                                                               |   
          |                                                                 |   
    5e+07 |-+                                                               |   
          |                                                                 |   
        0 +-----------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                            will-it-scale.per_thread_ops                        
                                                                                
  8e+06 +-------------------------------------------------------------------+   
        |O  O OO OO O  OO OO OO OO OO OO OOO OO OO OO OO OO OO OO OO OO OO O|   
  7e+06 |+.++.++.++.++.++.++.++.++.++.++.+++.++.++.++.++.++.++.++.++.++.++  |   
  6e+06 |-+                                                                 |   
        |                                                                   |   
  5e+06 |-+                                                                 |   
        |                                                                   |   
  4e+06 |-+                                                                 |   
        |                                                                   |   
  3e+06 |-+                                                                 |   
  2e+06 |-+                                                                 |   
        |                                                                   |   
  1e+06 |-+                                                                 |   
        |                                                                   |   
      0 +-------------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                will-it-scale.workload                          
                                                                                
  3.5e+08 +-----------------------------------------------------------------+   
          |O  O OO OOO  O OO OO OOO OO OO OOO OO OO OOO OO OO OO OOO OO OO O|   
    3e+08 |+.++.++.+++.++.++.++.+++.++.++.+++.++.++.+++.++.++.++.+++.++.++  |   
          |                                                                 |   
  2.5e+08 |-+                                                               |   
          |                                                                 |   
    2e+08 |-+                                                               |   
          |                                                                 |   
  1.5e+08 |-+                                                               |   
          |                                                                 |   
    1e+08 |-+                                                               |   
          |                                                                 |   
    5e+07 |-+                                                               |   
          |                                                                 |   
        0 +-----------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                                                                
                                                                                
  0.045 +-------------------------------------------------------------------+   
        |                                                                   |   
        |                                      +                            |   
   0.04 |-+                                    :                            |   
        |                                      :                            |   
        |                                     ::                            |   
  0.035 |-+                                   ::                            |   
        |                                     : :  +                        |   
   0.03 |-+      +                            : :  ::                       |   
        |        :: +                         : :  ::   +   .+   +.         |   
        |     +  :: :+           +        +  :  : :  + + +.+  +.+  +     ++.|   
  0.025 |-+   :+:  :  +   .+    + :.+    + + :  +.:   +             +.  +   |   
        |+ +.+  +  +   +.+  :.++  +  :.++   ++    +                   ++    |   
        | +                 +        +                                      |   
   0.02 +-------------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                                                                
                                                                                
  0.013 +-------------------------------------------------------------------+   
        |                                                    :   :          |   
  0.012 |-+      +                                          ::   :    +     |   
  0.011 |-+      ::                                         ::  : :   ::    |   
        |.+     + :  .+     +.    +. +.     +          .+  : :  : :   :+.+  |   
   0.01 |-++.+ :   ++  +.+.+  ++  : +  ++  : +    ++  +  +.+  : + :   :  :  |   
        |     ::                +:       + : :    : + :       : : :  :   :  |   
  0.009 |-+   +                  +        +   :  :   +        ::  :  :    ::|   
        |                                     :  :            ::   : :    ::|   
  0.008 |-+                                    : :            :    : :    + |   
  0.007 |-+                                    + :            +    + :      |   
        |                                       :                  : :      |   
  0.006 |-+                                     +                   :       |   
        |                                                           :       |   
  0.005 +-------------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                                                                
                                                                                
  600 +---------------------------------------------------------------------+   
      |                                                +    +.   +          |   
      |            +                                   :+   : + + +         |   
  550 |-+   +.+    :           +.+      .+.+   +.   +.+  +.+   +  :        .|   
      |  +.+   +   ::   +   +.+   :   ++    :  : +. :              :      + |   
      |  :      + : +   :   :     :   :     +  :   +               :      : |   
  500 |-+:      : :  : : :  :      :  :      +:                     :     : |   
      |  :      : :  : : +  :      + :        +                     +     : |   
  450 |-+:      : :   +  :  :      : :                               :   :  |   
      |: :       ::       : :      : :                               :   :  |   
      |::        :        ::       : :                                :  :  |   
  400 |::        :        ::        ::                                ++.+  |   
      |::        +        ::        :                                       |   
      | :                  :        :                                       |   
  350 +---------------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                                                                
                                                                                
  0.045 +-------------------------------------------------------------------+   
        |                                                                   |   
        |                                      +                            |   
   0.04 |-+                                    :                            |   
        |                                      :                            |   
        |                                     ::                            |   
  0.035 |-+                                   ::                            |   
        |                                     : :  +                        |   
   0.03 |-+      +                            : :  ::                       |   
        |        :: +                         : :  ::   +   .+   +.         |   
        |     +  :: :+           +        +  :  : :  + + +.+  +.+  +     ++.|   
  0.025 |-+   :+:  :  +   .+    + :.+    + + :  +.:   +             +.  +   |   
        |+ +.+  +  +   +.+  :.++  +  :.++   ++    +                   ++    |   
        | +                 +        +                                      |   
   0.02 +-------------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample

***************************************************************************************************
lkp-csl-2sp9: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/thread/50%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/lseek2/will-it-scale/0x5003006

commit: 
  39218ff4c6 ("stack: Optionally randomize kernel stack offset each syscall")
  fe950f6020 ("x86/entry: Enable random_kstack_offset support")

39218ff4c625dbf2 fe950f6020338c8ac668ef823bb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 3.043e+08            +5.5%  3.212e+08        will-it-scale.44.threads
   6915852            +5.5%    7299206        will-it-scale.per_thread_ops
 3.043e+08            +5.5%  3.212e+08        will-it-scale.workload
     45.50 ±178%     -99.3%       0.33 ±223%  interrupts.109:PCI-MSI.31981642-edge.i40e-eth0-TxRx-73
      1318 ± 17%     -27.1%     961.00 ± 23%  interrupts.CPU17.CAL:Function_call_interrupts
      4441 ± 55%     +56.4%       6947 ± 14%  interrupts.CPU27.NMI:Non-maskable_interrupts
      4441 ± 55%     +56.4%       6947 ± 14%  interrupts.CPU27.PMI:Performance_monitoring_interrupts
    969.67 ± 21%     +44.8%       1404 ± 15%  interrupts.CPU52.CAL:Function_call_interrupts
      1048 ± 17%     +28.6%       1348 ±  9%  interrupts.CPU70.CAL:Function_call_interrupts
     45.00 ±179%     -99.6%       0.17 ±223%  interrupts.CPU73.109:PCI-MSI.31981642-edge.i40e-eth0-TxRx-73
      4684 ± 47%     +52.1%       7122 ± 14%  interrupts.CPU86.NMI:Non-maskable_interrupts
      4684 ± 47%     +52.1%       7122 ± 14%  interrupts.CPU86.PMI:Performance_monitoring_interrupts
      0.02 ±  7%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    466.83 ±  6%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.12 ± 35%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    440.02 ±222%     -99.7%       1.54 ± 14%  perf-sched.wait_time.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
    440.03 ±222%     -99.6%       1.55 ± 14%  perf-sched.wait_time.avg.ms.do_syslog.part.0.kmsg_read.vfs_read
      0.02 ±  7%    -100.0%       0.00        perf-sched.wait_time.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      1318 ±222%     -99.8%       3.08 ± 14%  perf-sched.wait_time.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
      1318 ±222%     -99.8%       3.10 ± 14%  perf-sched.wait_time.max.ms.do_syslog.part.0.kmsg_read.vfs_read
      0.12 ± 35%    -100.0%       0.00        perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
     13627 ±  8%     +30.1%      17730 ± 20%  softirqs.CPU2.RCU
     13987 ±  8%     +24.5%      17410 ± 15%  softirqs.CPU20.RCU
     11774 ± 11%     +32.8%      15633 ± 22%  softirqs.CPU3.RCU
     12955 ±  8%     +34.2%      17380 ± 13%  softirqs.CPU30.RCU
     13872 ±  3%     +23.5%      17126 ± 14%  softirqs.CPU31.RCU
     14367 ±  6%     +13.4%      16292 ±  9%  softirqs.CPU36.RCU
     12724 ± 10%     +31.0%      16667 ±  5%  softirqs.CPU37.RCU
     12093 ±  8%     +20.4%      14559 ± 10%  softirqs.CPU63.RCU
     12010 ±  9%     +22.6%      14726 ±  9%  softirqs.CPU67.RCU
     21091 ± 32%     +50.0%      31645 ± 20%  softirqs.CPU74.SCHED
     13024 ±  8%     +28.1%      16685 ± 14%  softirqs.CPU9.RCU
     35.56 ±  2%      -6.2       29.33 ±  9%  perf-profile.calltrace.cycles-pp.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     45.14            -6.2       38.95 ±  9%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     20.70 ±  2%      -4.7       15.96 ±  9%  perf-profile.calltrace.cycles-pp.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     11.24 ±  2%      -4.4        6.87 ±  9%  perf-profile.calltrace.cycles-pp.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.74 ±  2%      -1.1        5.61 ±  9%  perf-profile.calltrace.cycles-pp.__fget_files.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64
      2.41 ±  3%      -0.7        1.71 ±  9%  perf-profile.calltrace.cycles-pp.shmem_file_llseek.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
      0.00            +1.4        1.36 ±  8%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
      0.00            +2.6        2.59 ±  9%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     36.05 ±  2%      -6.1       29.96 ±  9%  perf-profile.children.cycles-pp.ksys_lseek
     45.41            -6.0       39.36 ±  9%  perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     21.10 ±  2%      -4.7       16.36 ±  9%  perf-profile.children.cycles-pp.__fdget_pos
     11.59 ±  2%      -4.4        7.20 ±  9%  perf-profile.children.cycles-pp.__fget_light
      7.05 ±  2%      -1.1        5.95 ±  9%  perf-profile.children.cycles-pp.__fget_files
      2.41 ±  3%      -0.7        1.71 ±  9%  perf-profile.children.cycles-pp.shmem_file_llseek
      0.34 ±  4%      -0.0        0.30 ±  8%  perf-profile.children.cycles-pp.update_process_times
      4.45 ±  2%      -3.3        1.18 ± 10%  perf-profile.self.cycles-pp.__fget_light
      6.67 ±  2%      -1.1        5.57 ±  9%  perf-profile.self.cycles-pp.__fget_files
      2.21 ±  3%      -0.7        1.51 ±  8%  perf-profile.self.cycles-pp.shmem_file_llseek
      1.26            -0.2        1.02 ±  8%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.71            +0.5        1.24 ±  9%  perf-profile.self.cycles-pp.do_syscall_64
  3.05e+10            +5.8%  3.227e+10        perf-stat.i.branch-instructions
 3.079e+08            +4.5%  3.218e+08        perf-stat.i.branch-misses
 3.998e+10            +6.1%  4.244e+10        perf-stat.i.dTLB-loads
     44220            +6.5%      47109 ±  3%  perf-stat.i.dTLB-store-misses
 2.649e+10            +7.2%  2.839e+10        perf-stat.i.dTLB-stores
 2.976e+08            +3.9%  3.093e+08        perf-stat.i.iTLB-load-misses
   1824718            -4.2%    1748236        perf-stat.i.iTLB-loads
 1.334e+11            +6.4%  1.419e+11        perf-stat.i.instructions
    448.74            +2.4%     459.53        perf-stat.i.instructions-per-iTLB-miss
      1.08            +6.7%       1.15        perf-stat.i.ipc
      1101            +6.3%       1171        perf-stat.i.metric.M/sec
      1.01            -0.0        1.00        perf-stat.overall.branch-miss-rate%
      0.93            -6.7%       0.86        perf-stat.overall.cpi
    448.14            +2.4%     458.89        perf-stat.overall.instructions-per-iTLB-miss
      1.08            +7.2%       1.16        perf-stat.overall.ipc
    131961            +1.7%     134156        perf-stat.overall.path-length
  3.04e+10            +5.8%  3.216e+10        perf-stat.ps.branch-instructions
 3.069e+08            +4.5%  3.208e+08        perf-stat.ps.branch-misses
 3.985e+10            +6.1%   4.23e+10        perf-stat.ps.dTLB-loads
     44075            +6.6%      46980 ±  3%  perf-stat.ps.dTLB-store-misses
  2.64e+10            +7.2%  2.829e+10        perf-stat.ps.dTLB-stores
 2.967e+08            +3.9%  3.083e+08        perf-stat.ps.iTLB-load-misses
   1818554            -4.2%    1742364        perf-stat.ps.iTLB-loads
 1.329e+11            +6.4%  1.415e+11        perf-stat.ps.instructions
 4.016e+13            +7.3%  4.309e+13        perf-stat.total.instructions



***************************************************************************************************
lkp-csl-2sp9: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/process/50%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/lseek2/will-it-scale/0x5003006

commit: 
  39218ff4c6 ("stack: Optionally randomize kernel stack offset each syscall")
  fe950f6020 ("x86/entry: Enable random_kstack_offset support")

39218ff4c625dbf2 fe950f6020338c8ac668ef823bb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 4.093e+08            +7.8%   4.41e+08        will-it-scale.44.processes
   9301344            +7.8%   10022385        will-it-scale.per_process_ops
 4.093e+08            +7.8%   4.41e+08        will-it-scale.workload
    126.29 ± 32%     +77.8%     224.57 ± 22%  interrupts.CPU0.RES:Rescheduling_interrupts
     15.17            +1.5       16.68        mpstat.cpu.all.usr%
    262.82            +2.5%     269.31        turbostat.PkgWatt
     15.00            +6.7%      16.00        vmstat.cpu.us
     15940 ± 11%     +36.9%      21829 ± 23%  softirqs.CPU20.RCU
     14633 ± 13%     +36.8%      20013 ± 17%  softirqs.CPU39.RCU
     17267 ± 13%     +27.1%      21954 ± 19%  softirqs.CPU43.RCU
      0.01 ± 16%    -100.0%       0.00        perf-sched.sch_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ± 10%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    529.73 ± 10%     -21.5%     415.67 ± 19%  perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_sys_poll
    567.00 ±  5%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.57 ±179%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ± 10%    -100.0%       0.00        perf-sched.wait_time.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    529.73 ± 10%     -21.5%     415.67 ± 19%  perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_sys_poll
      0.57 ±179%    -100.0%       0.00        perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
     12.09            -6.8        5.29 ± 12%  perf-profile.calltrace.cycles-pp.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
     11.05            -6.8        4.29 ± 12%  perf-profile.calltrace.cycles-pp.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe
     21.28            -6.4       14.86 ± 12%  perf-profile.calltrace.cycles-pp.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
     35.29            -5.3       30.00 ± 12%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.llseek
      1.41 ±  2%      -0.3        1.09 ± 14%  perf-profile.calltrace.cycles-pp.testcase
      0.00            +1.1        1.09 ± 13%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      0.00            +3.1        3.10 ± 11%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      0.00            +5.2        5.19 ± 11%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
     12.69            -6.9        5.83 ± 12%  perf-profile.children.cycles-pp.__fdget_pos
     11.06            -6.8        4.29 ± 12%  perf-profile.children.cycles-pp.__fget_light
     21.71            -6.5       15.25 ± 12%  perf-profile.children.cycles-pp.ksys_lseek
     35.62            -5.1       30.57 ± 12%  perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      1.54 ±  2%      -0.4        1.18 ± 13%  perf-profile.children.cycles-pp.testcase
      1.36            -0.3        1.09 ± 13%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      0.70 ±  2%      -0.2        0.48 ± 13%  perf-profile.children.cycles-pp.rcu_nocb_flush_deferred_wakeup
      2.49 ±  2%      +0.8        3.30 ± 11%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
     10.62            -6.7        3.93 ± 12%  perf-profile.self.cycles-pp.__fget_light
      1.23 ±  2%      -0.3        0.89 ± 13%  perf-profile.self.cycles-pp.testcase
      1.35            -0.3        1.08 ± 12%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      1.72            -0.3        1.45 ± 12%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.45 ±  4%      -0.1        0.37 ± 13%  perf-profile.self.cycles-pp.rcu_nocb_flush_deferred_wakeup
      1.79 ±  2%      +0.9        2.69 ± 10%  perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      1.29            +1.0        2.32 ± 12%  perf-profile.self.cycles-pp.do_syscall_64
 2.753e+10            +9.2%  3.007e+10        perf-stat.i.branch-instructions
      1.50            -0.0        1.48        perf-stat.i.branch-miss-rate%
 4.131e+08            +7.6%  4.446e+08        perf-stat.i.branch-misses
      0.94            -9.1%       0.86        perf-stat.i.cpi
 3.984e+10            +9.8%  4.376e+10        perf-stat.i.dTLB-loads
     61463            +9.2%      67090        perf-stat.i.dTLB-store-misses
 2.621e+10           +11.1%  2.912e+10        perf-stat.i.dTLB-stores
 3.961e+08            +7.4%  4.255e+08        perf-stat.i.iTLB-load-misses
   1836428            -3.3%    1776535        perf-stat.i.iTLB-loads
 1.311e+11           +10.0%  1.442e+11        perf-stat.i.instructions
    333.07            +3.4%     344.51        perf-stat.i.instructions-per-iTLB-miss
      1.06           +10.0%       1.17        perf-stat.i.ipc
      1063           +10.0%       1169        perf-stat.i.metric.M/sec
      1.50            -0.0        1.48        perf-stat.overall.branch-miss-rate%
      0.94            -9.1%       0.86        perf-stat.overall.cpi
    331.01            +2.4%     338.86        perf-stat.overall.instructions-per-iTLB-miss
      1.06           +10.0%       1.17        perf-stat.overall.ipc
     96478            +2.1%      98513        perf-stat.overall.path-length
 2.744e+10            +9.2%  2.997e+10        perf-stat.ps.branch-instructions
 4.117e+08            +7.6%  4.432e+08        perf-stat.ps.branch-misses
 3.971e+10            +9.8%  4.361e+10        perf-stat.ps.dTLB-loads
     61264            +9.2%      66871        perf-stat.ps.dTLB-store-misses
 2.613e+10           +11.1%  2.902e+10        perf-stat.ps.dTLB-stores
 3.947e+08            +7.4%  4.241e+08        perf-stat.ps.iTLB-load-misses
   1830250            -3.3%    1770517        perf-stat.ps.iTLB-loads
 1.307e+11           +10.0%  1.437e+11        perf-stat.ps.instructions
 3.948e+13           +10.0%  4.344e+13        perf-stat.total.instructions



***************************************************************************************************
lkp-csl-2sp9: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/thread/16/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/lseek2/will-it-scale/0x5003006

commit: 
  39218ff4c6 ("stack: Optionally randomize kernel stack offset each syscall")
  fe950f6020 ("x86/entry: Enable random_kstack_offset support")

39218ff4c625dbf2 fe950f6020338c8ac668ef823bb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 1.107e+08            +5.5%  1.168e+08        will-it-scale.16.threads
   6920338            +5.5%    7300711        will-it-scale.per_thread_ops
 1.107e+08            +5.5%  1.168e+08        will-it-scale.workload
 2.421e+08 ±158%     -99.1%    2058964 ±  5%  cpuidle.C1.time
    166.58            +1.3%     168.78        turbostat.PkgWatt
      1510 ± 10%     +12.7%       1703 ±  4%  slabinfo.khugepaged_mm_slot.active_objs
      1510 ± 10%     +12.7%       1703 ±  4%  slabinfo.khugepaged_mm_slot.num_objs
     41752           +14.6%      47840 ± 12%  softirqs.CPU23.SCHED
     23632 ±  9%     +23.9%      29282 ±  8%  softirqs.CPU47.SCHED
     24724 ± 17%     +26.0%      31150 ±  7%  softirqs.CPU52.SCHED
     19277 ± 17%     -26.8%      14106 ± 21%  softirqs.CPU8.SCHED
     10.53 ± 10%      -3.6        6.97 ± 12%  perf-profile.calltrace.cycles-pp.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +1.4        1.39 ± 10%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
      0.00            +2.6        2.63 ± 10%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_lseek64
     10.87 ± 10%      -3.6        7.32 ± 12%  perf-profile.children.cycles-pp.__fget_light
      0.17 ± 17%      -0.1        0.11 ± 18%  perf-profile.children.cycles-pp.clockevents_program_event
      0.10 ± 12%      -0.1        0.05 ± 73%  perf-profile.children.cycles-pp.tick_nohz_get_sleep_length
      4.17 ± 10%      -3.0        1.19 ± 11%  perf-profile.self.cycles-pp.__fget_light
      0.68 ± 11%      +0.6        1.23 ±  9%  perf-profile.self.cycles-pp.do_syscall_64
      0.02 ± 54%     -84.2%       0.00 ± 85%  perf-sched.sch_delay.avg.ms.schedule_timeout.wait_for_completion.__flush_work.lru_add_drain_all
      0.02 ±132%    -100.0%       0.00        perf-sched.sch_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ± 43%     -61.2%       0.01 ± 40%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_sys_poll
      0.03 ± 64%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    111.83 ± 32%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.87 ±209%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ± 65%    -100.0%       0.00        perf-sched.wait_time.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.86 ±211%    -100.0%       0.00        perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    169.67 ± 27%    +105.3%     348.33 ± 22%  interrupts.CPU3.TLB:TLB_shootdowns
    146.67 ± 22%     -51.2%      71.50 ± 26%  interrupts.CPU37.NMI:Non-maskable_interrupts
    146.67 ± 22%     -51.2%      71.50 ± 26%  interrupts.CPU37.PMI:Performance_monitoring_interrupts
    130.17 ± 29%     -44.4%      72.33 ± 30%  interrupts.CPU38.NMI:Non-maskable_interrupts
    130.17 ± 29%     -44.4%      72.33 ± 30%  interrupts.CPU38.PMI:Performance_monitoring_interrupts
    155.17 ± 30%    +114.4%     332.67 ± 31%  interrupts.CPU4.TLB:TLB_shootdowns
     42.67 ± 92%     -83.6%       7.00 ± 82%  interrupts.CPU48.RES:Rescheduling_interrupts
     53.17 ± 95%     -88.7%       6.00 ± 36%  interrupts.CPU52.RES:Rescheduling_interrupts
     80.50 ±210%   +1302.5%       1129 ±162%  interrupts.CPU62.RES:Rescheduling_interrupts
    138.00 ± 22%     -47.9%      71.83 ± 53%  interrupts.CPU70.NMI:Non-maskable_interrupts
    138.00 ± 22%     -47.9%      71.83 ± 53%  interrupts.CPU70.PMI:Performance_monitoring_interrupts
    117.83 ± 24%     -45.8%      63.83 ± 30%  interrupts.CPU80.NMI:Non-maskable_interrupts
    117.83 ± 24%     -45.8%      63.83 ± 30%  interrupts.CPU80.PMI:Performance_monitoring_interrupts
    120.00 ± 26%     -46.2%      64.50 ± 28%  interrupts.CPU81.NMI:Non-maskable_interrupts
    120.00 ± 26%     -46.2%      64.50 ± 28%  interrupts.CPU81.PMI:Performance_monitoring_interrupts
  1.12e+10            +6.6%  1.194e+10        perf-stat.i.branch-instructions
      1.08 ±  2%      -0.1        1.02 ±  2%  perf-stat.i.branch-miss-rate%
      0.96            -8.0%       0.89        perf-stat.i.cpi
 1.467e+10            +6.8%  1.567e+10        perf-stat.i.dTLB-loads
 9.704e+09            +8.0%  1.048e+10        perf-stat.i.dTLB-stores
 1.085e+08            +5.2%  1.141e+08        perf-stat.i.iTLB-load-misses
 4.903e+10            +7.2%  5.257e+10        perf-stat.i.instructions
    453.23            +1.8%     461.58        perf-stat.i.instructions-per-iTLB-miss
      1.04            +8.3%       1.13        perf-stat.i.ipc
    404.23            +7.1%     432.74        perf-stat.i.metric.M/sec
      1.07 ±  2%      -0.1        1.02 ±  2%  perf-stat.overall.branch-miss-rate%
      0.96            -7.6%       0.89        perf-stat.overall.cpi
    451.97            +1.9%     460.75        perf-stat.overall.instructions-per-iTLB-miss
      1.04            +8.2%       1.13        perf-stat.overall.ipc
    133400            +1.6%     135521        perf-stat.overall.path-length
 1.116e+10            +6.6%   1.19e+10        perf-stat.ps.branch-instructions
 1.462e+10            +6.8%  1.561e+10        perf-stat.ps.dTLB-loads
 9.672e+09            +8.0%  1.044e+10        perf-stat.ps.dTLB-stores
 1.081e+08            +5.2%  1.137e+08        perf-stat.ps.iTLB-load-misses
 4.886e+10            +7.2%  5.239e+10        perf-stat.ps.instructions
 1.477e+13            +7.2%  1.583e+13        perf-stat.total.instructions



***************************************************************************************************
lkp-csl-2ap2: 192 threads 4 sockets Intel(R) Xeon(R) Platinum 9242 CPU @ 2.30GHz with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/process/50%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2ap2/dup1/will-it-scale/0x5003006

commit: 
  39218ff4c6 ("stack: Optionally randomize kernel stack offset each syscall")
  fe950f6020 ("x86/entry: Enable random_kstack_offset support")

39218ff4c625dbf2 fe950f6020338c8ac668ef823bb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 4.653e+08            -1.7%  4.575e+08        will-it-scale.96.processes
   4846984            -1.7%    4765791        will-it-scale.per_process_ops
 4.653e+08            -1.7%  4.575e+08        will-it-scale.workload
      0.84 ±  2%      +0.2        1.04 ±  7%  mpstat.cpu.all.irq%
     50.00            -2.0%      49.00        vmstat.cpu.id
    125.33 ± 57%     -69.8%      37.83 ± 83%  interrupts.CPU164.RES:Rescheduling_interrupts
      7763 ± 11%     -33.1%       5197 ± 48%  interrupts.CPU177.NMI:Non-maskable_interrupts
      7763 ± 11%     -33.1%       5197 ± 48%  interrupts.CPU177.PMI:Performance_monitoring_interrupts
      7750 ± 18%     -58.8%       3194 ± 37%  interrupts.CPU92.NMI:Non-maskable_interrupts
      7750 ± 18%     -58.8%       3194 ± 37%  interrupts.CPU92.PMI:Performance_monitoring_interrupts
     14208 ± 14%     +18.5%      16830 ±  8%  softirqs.CPU15.RCU
     25356 ± 36%     +46.5%      37157 ±  7%  softirqs.CPU164.SCHED
     11031 ± 14%     +31.3%      14481 ± 14%  softirqs.CPU169.RCU
     12602 ±  8%     +17.5%      14804 ±  6%  softirqs.CPU183.RCU
     28486 ± 25%     -44.9%      15685 ± 48%  softirqs.CPU24.SCHED
     15237 ± 12%     +23.6%      18837 ±  7%  softirqs.CPU68.RCU
      0.01 ±  9%     -16.7%       0.01 ± 11%  perf-sched.sch_delay.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
      0.01 ± 19%     +37.3%       0.01 ± 16%  perf-sched.sch_delay.avg.ms.futex_wait_queue_me.futex_wait.do_futex.__x64_sys_futex
      0.01 ± 23%    -100.0%       0.00        perf-sched.sch_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.01 ± 19%     +37.3%       0.01 ± 16%  perf-sched.sch_delay.max.ms.futex_wait_queue_me.futex_wait.do_futex.__x64_sys_futex
      2.10 ±139%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      1188 ±  8%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      2621 ±141%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      2.10 ±139%    -100.0%       0.00        perf-sched.wait_time.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      2621 ±141%    -100.0%       0.00        perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ±  2%      +7.3%       0.03 ±  5%  perf-stat.i.MPKI
 5.915e+08            -1.3%  5.835e+08        perf-stat.i.branch-misses
     12.12 ±  2%      +0.9       13.00 ±  4%  perf-stat.i.cache-miss-rate%
   1200067 ±  3%     +13.7%    1364854 ±  4%  perf-stat.i.cache-misses
   9856054            +6.4%   10489701        perf-stat.i.cache-references
    281116 ±  2%     -12.7%     245501 ±  4%  perf-stat.i.cycles-between-cache-misses
   4298048            -4.2%    4118045        perf-stat.i.iTLB-loads
     74.85            +3.4%      77.39        perf-stat.i.metric.K/sec
      0.03            +5.5%       0.03        perf-stat.overall.MPKI
    241364 ±  3%     -12.1%     212085 ±  4%  perf-stat.overall.cycles-between-cache-misses
    233746            +1.8%     238022        perf-stat.overall.path-length
 5.897e+08            -1.4%  5.817e+08        perf-stat.ps.branch-misses
   1222801 ±  3%     +13.7%    1390521 ±  4%  perf-stat.ps.cache-misses
  10040040            +5.7%   10608298        perf-stat.ps.cache-references
   4283511            -4.2%    4104019        perf-stat.ps.iTLB-loads
      0.93 ±  9%      +0.2        1.14 ±  4%  perf-profile.calltrace.cycles-pp.fd_install.__x64_sys_dup.do_syscall_64.entry_SYSCALL_64_after_hwframe.dup
      0.00            +0.9        0.94 ±  5%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.dup
      0.00            +1.0        0.98 ±  4%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
      0.00            +1.8        1.75 ±  4%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.dup
      0.00            +1.8        1.82 ±  4%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     11.57 ±  8%      +2.4       13.96 ±  4%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
      0.14 ±  9%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.perf_trace_sched_stat_runtime
      0.12 ±  9%      +0.0        0.14 ±  5%  perf-profile.children.cycles-pp.perf_prepare_sample
      0.14 ± 10%      +0.0        0.17 ±  3%  perf-profile.children.cycles-pp.update_curr
      0.14 ±  9%      +0.0        0.16 ±  6%  perf-profile.children.cycles-pp.perf_tp_event
      0.38 ± 10%      +0.1        0.45 ±  3%  perf-profile.children.cycles-pp.close@plt
      0.31 ± 11%      +0.1        0.45 ± 40%  perf-profile.children.cycles-pp.update_process_times
      0.31 ± 12%      +0.1        0.46 ± 41%  perf-profile.children.cycles-pp.tick_sched_handle
      0.32 ± 10%      +0.2        0.49 ± 50%  perf-profile.children.cycles-pp.tick_sched_timer
      0.35 ± 10%      +0.2        0.53 ± 49%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.58 ± 11%      +1.6        2.13 ±154%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
     28.26 ±  8%      +5.5       33.75 ±  4%  perf-profile.children.cycles-pp.do_syscall_64
      0.92 ±  9%      +0.2        1.09 ±  4%  perf-profile.self.cycles-pp.fd_install
      0.89 ±  9%      +0.7        1.62 ±  4%  perf-profile.self.cycles-pp.do_syscall_64



***************************************************************************************************
lkp-csl-2sp9: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/process/16/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/lseek2/will-it-scale/0x5003006

commit: 
  39218ff4c6 ("stack: Optionally randomize kernel stack offset each syscall")
  fe950f6020 ("x86/entry: Enable random_kstack_offset support")

39218ff4c625dbf2 fe950f6020338c8ac668ef823bb 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 1.489e+08            +7.9%  1.606e+08        will-it-scale.16.processes
   9303247            +7.9%   10038079        will-it-scale.per_process_ops
 1.489e+08            +7.9%  1.606e+08        will-it-scale.workload
    164.84            +1.4%     167.21        turbostat.PkgWatt
 1.163e+10 ± 63%     -99.6%   41700541 ±179%  cpuidle.C6.time
  14620637 ± 59%     -99.6%      65713 ±156%  cpuidle.C6.usage
     40.08 ± 49%    +107.9%      83.33 ± 14%  sched_debug.cfs_rq:/.removed.runnable_avg.max
      5.85 ± 54%     +91.4%      11.19 ± 20%  sched_debug.cfs_rq:/.removed.runnable_avg.stddev
     40.08 ± 49%    +107.9%      83.33 ± 14%  sched_debug.cfs_rq:/.removed.util_avg.max
      5.85 ± 54%     +83.6%      10.74 ± 20%  sched_debug.cfs_rq:/.removed.util_avg.stddev
     99973 ± 53%     +97.4%     197369 ± 33%  interrupts.CAL:Function_call_interrupts
    488.17 ±  6%     +24.7%     608.67 ± 15%  interrupts.CPU22.CAL:Function_call_interrupts
     14.00 ± 87%    +432.1%      74.50 ± 32%  interrupts.CPU22.RES:Rescheduling_interrupts
    218.17 ±107%     -72.5%      60.00 ± 23%  interrupts.CPU35.NMI:Non-maskable_interrupts
    218.17 ±107%     -72.5%      60.00 ± 23%  interrupts.CPU35.PMI:Performance_monitoring_interrupts
    123.17 ± 42%     -43.0%      70.17 ± 30%  interrupts.CPU37.NMI:Non-maskable_interrupts
    123.17 ± 42%     -43.0%      70.17 ± 30%  interrupts.CPU37.PMI:Performance_monitoring_interrupts
      2.67 ± 91%    +756.2%      22.83 ±161%  interrupts.CPU38.TLB:TLB_shootdowns
      3.50 ± 69%   +1423.8%      53.33 ±184%  interrupts.CPU67.TLB:TLB_shootdowns
     34.00 ± 14%    +163.7%      89.67 ±111%  interrupts.CPU8.RES:Rescheduling_interrupts
    113.50 ± 38%     -39.5%      68.67 ±  8%  interrupts.CPU85.NMI:Non-maskable_interrupts
    113.50 ± 38%     -39.5%      68.67 ±  8%  interrupts.CPU85.PMI:Performance_monitoring_interrupts
    330.50 ± 54%    +131.4%     764.67 ± 30%  interrupts.TLB:TLB_shootdowns
      0.01 ±  5%    -100.0%       0.00        perf-sched.sch_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ±128%     -70.0%       0.01 ± 18%  perf-sched.sch_delay.max.ms.wait_for_partner.fifo_open.do_dentry_open.do_open.isra
      0.03 ±  7%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
    530.40 ± 14%     +59.3%     845.16 ± 13%  perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
    186.83 ± 10%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.33 ±159%     -89.4%       0.04 ± 44%  perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
      0.15 ± 61%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      0.03 ±128%     -70.0%       0.01 ± 18%  perf-sched.wait_and_delay.max.ms.wait_for_partner.fifo_open.do_dentry_open.do_open.isra
      0.03 ±  7%    -100.0%       0.00        perf-sched.wait_time.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
      2.55 ±  5%     +15.4%       2.95 ± 12%  perf-sched.wait_time.avg.ms.rcu_gp_kthread.kthread.ret_from_fork
    530.39 ± 14%     +59.3%     845.15 ± 13%  perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      0.32 ±157%     -87.1%       0.04 ±  5%  perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
      0.15 ± 61%    -100.0%       0.00        perf-sched.wait_time.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.[unknown]
     10.84 ± 11%      -6.1        4.69 ± 10%  perf-profile.calltrace.cycles-pp.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      9.92 ± 11%      -6.1        3.81 ± 10%  perf-profile.calltrace.cycles-pp.__fget_light.__fdget_pos.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe
     19.15 ± 11%      -6.0       13.17 ± 10%  perf-profile.calltrace.cycles-pp.ksys_lseek.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      1.28 ± 15%      -0.3        0.96 ±  9%  perf-profile.calltrace.cycles-pp.testcase
      0.77 ± 11%      -0.2        0.61 ± 11%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.do_idle
      0.88 ±  9%      -0.1        0.74 ± 11%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry
      0.00            +1.0        0.95 ±  9%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      0.00            +2.7        2.72 ±  9%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      0.00            +4.6        4.58 ±  9%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
     11.38 ± 11%      -6.2        5.17 ± 10%  perf-profile.children.cycles-pp.__fdget_pos
      9.93 ± 11%      -6.1        3.81 ± 10%  perf-profile.children.cycles-pp.__fget_light
     19.53 ± 11%      -6.0       13.53 ± 10%  perf-profile.children.cycles-pp.ksys_lseek
      1.39 ± 14%      -0.4        1.04 ±  9%  perf-profile.children.cycles-pp.testcase
      1.22 ± 13%      -0.3        0.95 ±  9%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      0.63 ± 12%      -0.2        0.42 ± 11%  perf-profile.children.cycles-pp.rcu_nocb_flush_deferred_wakeup
      2.20 ± 11%      +0.7        2.89 ± 10%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      9.54 ± 11%      -6.0        3.49 ± 10%  perf-profile.self.cycles-pp.__fget_light
      1.12 ± 14%      -0.3        0.78 ±  9%  perf-profile.self.cycles-pp.testcase
      1.22 ± 13%      -0.3        0.94 ±  9%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      1.53 ± 11%      -0.3        1.27 ± 10%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.40 ± 10%      -0.1        0.32 ±  9%  perf-profile.self.cycles-pp.rcu_nocb_flush_deferred_wakeup
      1.58 ± 12%      +0.8        2.36 ±  9%  perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      1.16 ± 12%      +0.9        2.04 ± 10%  perf-profile.self.cycles-pp.do_syscall_64
      0.53 ± 50%     -72.9%       0.14 ±  3%  perf-stat.i.MPKI
 1.013e+10            +9.2%  1.107e+10        perf-stat.i.branch-instructions
      1.55            -0.1        1.49        perf-stat.i.branch-miss-rate%
  1.57e+08            +4.6%  1.643e+08        perf-stat.i.branch-misses
   1167685 ± 23%     -35.0%     758467 ±  3%  perf-stat.i.cache-misses
      0.97           -10.3%       0.87        perf-stat.i.cpi
     46007 ± 30%     +46.5%      67405 ±  3%  perf-stat.i.cycles-between-cache-misses
      0.01 ± 67%      -0.0        0.00 ±  6%  perf-stat.i.dTLB-load-miss-rate%
 1.463e+10           +10.0%  1.609e+10        perf-stat.i.dTLB-loads
      0.00 ± 52%      -0.0        0.00 ±  2%  perf-stat.i.dTLB-store-miss-rate%
 9.612e+09           +11.1%  1.068e+10        perf-stat.i.dTLB-stores
 1.445e+08            +7.8%  1.558e+08        perf-stat.i.iTLB-load-misses
 4.825e+10           +10.0%  5.307e+10        perf-stat.i.instructions
    338.28            +2.2%     345.61        perf-stat.i.instructions-per-iTLB-miss
      1.03           +11.5%       1.15        perf-stat.i.ipc
      1.18 ±  3%     -13.8%       1.01 ±  8%  perf-stat.i.metric.K/sec
    390.93           +10.0%     430.08        perf-stat.i.metric.M/sec
      0.53 ± 51%     -73.6%       0.14 ±  3%  perf-stat.overall.MPKI
      1.55            -0.1        1.48        perf-stat.overall.branch-miss-rate%
      0.97           -10.3%       0.87        perf-stat.overall.cpi
     43056 ± 27%     +42.1%      61177 ±  3%  perf-stat.overall.cycles-between-cache-misses
      0.01 ± 68%      -0.0        0.00 ±  6%  perf-stat.overall.dTLB-load-miss-rate%
      0.00 ± 54%      -0.0        0.00        perf-stat.overall.dTLB-store-miss-rate%
    333.83            +2.0%     340.60        perf-stat.overall.instructions-per-iTLB-miss
      1.03           +11.5%       1.15        perf-stat.overall.ipc
     97642            +1.8%      99442        perf-stat.overall.path-length
  1.01e+10            +9.2%  1.103e+10        perf-stat.ps.branch-instructions
 1.565e+08            +4.6%  1.637e+08        perf-stat.ps.branch-misses
   1163211 ± 23%     -35.1%     755495 ±  3%  perf-stat.ps.cache-misses
 1.458e+10           +10.0%  1.604e+10        perf-stat.ps.dTLB-loads
  9.58e+09           +11.1%  1.064e+10        perf-stat.ps.dTLB-stores
 1.441e+08            +7.8%  1.553e+08        perf-stat.ps.iTLB-load-misses
 4.809e+10           +10.0%  5.289e+10        perf-stat.ps.instructions
 1.453e+13            +9.9%  1.597e+13        perf-stat.total.instructions





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


---
0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation

Thanks,
Oliver Sang


View attachment "config-5.12.0-rc6-00004-gfe950f602033" of type "text/plain" (172967 bytes)

View attachment "job-script" of type "text/plain" (7993 bytes)

View attachment "job.yaml" of type "text/plain" (5477 bytes)

View attachment "reproduce" of type "text/plain" (337 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ