lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date: Fri, 19 Apr 2024 13:49:26 +0800
From: kernel test robot <yujie.liu@...el.com>
To: Linus Torvalds <torvalds@...ux-foundation.org>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
	Thomas Gleixner <tglx@...utronix.de>, Daniel Sneddon
	<daniel.sneddon@...ux.intel.com>, Josh Poimboeuf <jpoimboe@...nel.org>,
	<ying.huang@...el.com>, <feng.tang@...el.com>, <fengwei.yin@...el.com>
Subject: [linus:master] [x86/syscall] 1e3ad78334:
 will-it-scale.per_process_ops 1.4% improvement

Hi Linus,

We noticed that commit 1e3ad78334a6 caused performance fluctuations in
various micro benchmarks. The perf stat metrics related with branch
instructions do have noticeable changes, which may be an expected
result of this commit. We are sending this report to provide these data
and hope it can be helpful for the awareness of overall impact or any
further investigation. Thanks.

kernel test robot noticed a 1.4% improvement of will-it-scale.per_process_ops on:

commit: 1e3ad78334a69b36e107232e337f9d693dcc9df2 ("x86/syscall: Don't force use of indirect calls for system calls")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: will-it-scale
test machine: 104 threads 2 sockets (Skylake) with 192G memory
parameters:

	nr_task: 16
	mode: process
	test: futex4
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.null.ops_per_sec -4.0% regression                                    |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=null                                                                                 |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.fpunch.ops_per_sec -1.6% regression                                  |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | disk=1HDD                                                                                 |
|                  | fs=ext4                                                                                   |
|                  | nr_threads=100%                                                                           |
|                  | test=fpunch                                                                               |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | unixbench: unixbench.throughput -1.4% regression                                          |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_task=100%                                                                              |
|                  | runtime=300s                                                                              |
|                  | test=fsbuffer                                                                             |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -1.1% regression                             |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | mode=process                                                                              |
|                  | nr_task=100%                                                                              |
|                  | test=pread1                                                                               |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_thread_ops -3.4% regression                              |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | mode=thread                                                                               |
|                  | nr_task=100%                                                                              |
|                  | test=poll1                                                                                |
+------------------+-------------------------------------------------------------------------------------------+

Details are as below:

The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240419/202404191333.178a0eed-yujie.liu@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
  gcc-13/performance/x86_64-rhel-8.3/process/16/debian-12-x86_64-20240206.cgz/lkp-skl-fpga01/futex4/will-it-scale

commit: 
  0cd01ac5dc ("x86/bugs: Change commas to semicolons in 'spectre_v2' sysfs file")
  1e3ad78334 ("x86/syscall: Don't force use of indirect calls for system calls")

0cd01ac5dcb1e18e 1e3ad78334a69b36e107232e337 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    860611            -1.4%     848885        proc-vmstat.numa_hit
    753301            -1.6%     741136        proc-vmstat.numa_local
  21797058            +1.4%   22102512        will-it-scale.16.processes
   1362315            +1.4%    1381406        will-it-scale.per_process_ops
  21797058            +1.4%   22102512        will-it-scale.workload
      0.04 ±  7%      -7.4%       0.04        perf-stat.i.MPKI
  1.98e+09           +19.2%   2.36e+09        perf-stat.i.branch-instructions
      1.47            -1.2        0.30        perf-stat.i.branch-miss-rate%
  30820475           -70.4%    9118612        perf-stat.i.branch-misses
      3.45            -4.4%       3.30        perf-stat.i.cpi
 1.504e+10            +5.1%   1.58e+10        perf-stat.i.instructions
      0.29            +4.5%       0.31        perf-stat.i.ipc
      0.05 ±  2%      -4.2%       0.04        perf-stat.overall.MPKI
      1.56            -1.2        0.39        perf-stat.overall.branch-miss-rate%
      3.43            -4.3%       3.28        perf-stat.overall.cpi
      0.29            +4.5%       0.30        perf-stat.overall.ipc
    208138            +3.4%     215312        perf-stat.overall.path-length
 1.973e+09           +19.2%  2.353e+09        perf-stat.ps.branch-instructions
  30729762           -70.4%    9109071        perf-stat.ps.branch-misses
 1.499e+10            +5.1%  1.575e+10        perf-stat.ps.instructions
 4.537e+12            +4.9%  4.759e+12        perf-stat.total.instructions
     12.23            -0.6       11.60        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
     10.09            -0.6        9.51        perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
     22.31            -0.4       21.88        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.syscall
      9.25            +0.2        9.43        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.syscall
      8.79            +0.2        9.02        perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
      7.13            +0.2        7.36        perf-profile.calltrace.cycles-pp.__futex_wait.futex_wait.do_futex.__x64_sys_futex.do_syscall_64
      8.37            +0.3        8.63        perf-profile.calltrace.cycles-pp.futex_wait.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe
     12.38            -0.6       11.78        perf-profile.children.cycles-pp.do_syscall_64
     10.12            -0.5        9.57        perf-profile.children.cycles-pp.__x64_sys_futex
     22.63            -0.4       22.20        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.48 ±  2%      -0.0        0.46        perf-profile.children.cycles-pp.get_futex_key
      0.00            +0.2        0.18 ±  2%  perf-profile.children.cycles-pp.x64_sys_call
      9.11            +0.2        9.29        perf-profile.children.cycles-pp.entry_SYSCALL_64
      8.88            +0.2        9.11        perf-profile.children.cycles-pp.do_futex
      7.13            +0.2        7.36        perf-profile.children.cycles-pp.__futex_wait
      8.43            +0.3        8.70        perf-profile.children.cycles-pp.futex_wait
      1.20            -0.7        0.47        perf-profile.self.cycles-pp.__x64_sys_futex
      1.46            -0.2        1.27        perf-profile.self.cycles-pp.do_syscall_64
      0.51            -0.1        0.44        perf-profile.self.cycles-pp.do_futex
      0.38 ±  5%      -0.1        0.32 ±  4%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.48 ±  2%      -0.0        0.45        perf-profile.self.cycles-pp.get_futex_key
      0.00            +0.1        0.15 ±  2%  perf-profile.self.cycles-pp.x64_sys_call
      7.97            +0.1        8.12        perf-profile.self.cycles-pp.entry_SYSCALL_64
     10.43            +0.2       10.60        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.72 ±  3%      +0.2        0.94 ±  3%  perf-profile.self.cycles-pp.__futex_wait


***************************************************************************************************
lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-13/performance/1HDD/btrfs/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/utime/stress-ng/60s

commit: 
  0cd01ac5dc ("x86/bugs: Change commas to semicolons in 'spectre_v2' sysfs file")
  1e3ad78334 ("x86/syscall: Don't force use of indirect calls for system calls")

0cd01ac5dcb1e18e 1e3ad78334a69b36e107232e337 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    136026 ±  3%     +20.6%     164016 ± 11%  meminfo.DirectMap4k
 5.516e+10            +1.5%  5.598e+10        perf-stat.i.branch-instructions
 5.427e+10            +1.5%  5.508e+10        perf-stat.ps.branch-instructions
    137060 ± 23%     +35.5%     185722 ±  7%  numa-numastat.node0.local_node
     50345 ± 26%     -56.2%      22060 ± 77%  numa-numastat.node0.other_node
    289383 ±  9%     -17.6%     238445 ±  6%  numa-numastat.node1.local_node
     15965 ± 85%    +177.3%      44264 ± 38%  numa-numastat.node1.other_node
    136562 ± 23%     +35.6%     185165 ±  7%  numa-vmstat.node0.numa_local
     50345 ± 26%     -56.2%      22060 ± 77%  numa-vmstat.node0.numa_other
    288523 ±  9%     -17.7%     237526 ±  6%  numa-vmstat.node1.numa_local
     15965 ± 85%    +177.3%      44264 ± 38%  numa-vmstat.node1.numa_other
      1.71            -0.5        1.18        perf-profile.calltrace.cycles-pp.mnt_want_write.vfs_utimes.do_utimes.__x64_sys_utimensat.do_syscall_64
     43.01            -0.3       42.68        perf-profile.calltrace.cycles-pp.user_path_at_empty.do_utimes.__x64_sys_utimensat.do_syscall_64.entry_SYSCALL_64_after_hwframe
     23.61            -0.3       23.34        perf-profile.calltrace.cycles-pp.do_utimes.__x64_sys_utimensat.do_syscall_64.entry_SYSCALL_64_after_hwframe.utimensat
     26.52            -0.2       26.27        perf-profile.calltrace.cycles-pp.__x64_sys_utimensat.do_syscall_64.entry_SYSCALL_64_after_hwframe.utimensat
     16.22            -0.2       16.00        perf-profile.calltrace.cycles-pp.do_utimes.__x64_sys_utime.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
     13.89            -0.2       13.68        perf-profile.calltrace.cycles-pp.user_path_at_empty.do_utimes.__x64_sys_utime.do_syscall_64.entry_SYSCALL_64_after_hwframe
     39.07            -0.2       38.87        perf-profile.calltrace.cycles-pp.do_utimes.__x64_sys_utimensat.do_syscall_64.entry_SYSCALL_64_after_hwframe
     16.75            -0.2       16.56        perf-profile.calltrace.cycles-pp.__x64_sys_utime.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
     15.77            -0.2       15.58        perf-profile.calltrace.cycles-pp.filename_lookup.user_path_at_empty.do_utimes.__x64_sys_utimensat.do_syscall_64
     10.55            -0.2       10.37        perf-profile.calltrace.cycles-pp.getname_flags.user_path_at_empty.do_utimes.__x64_sys_utime.do_syscall_64
     13.78            -0.2       13.60        perf-profile.calltrace.cycles-pp.path_lookupat.filename_lookup.user_path_at_empty.do_utimes.__x64_sys_utimensat
      9.48            -0.2        9.31        perf-profile.calltrace.cycles-pp.strncpy_from_user.getname_flags.user_path_at_empty.do_utimes.__x64_sys_utime
     29.46            -0.1       29.32        perf-profile.calltrace.cycles-pp.utimensat
     25.18            -0.1       25.05        perf-profile.calltrace.cycles-pp.getname_flags.user_path_at_empty.do_utimes.__x64_sys_utimensat.do_syscall_64
     21.74            -0.1       21.62        perf-profile.calltrace.cycles-pp.strncpy_from_user.getname_flags.user_path_at_empty.do_utimes.__x64_sys_utimensat
     27.48            -0.1       27.35        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.utimensat
     43.89            -0.1       43.77        perf-profile.calltrace.cycles-pp.__x64_sys_utimensat.do_syscall_64.entry_SYSCALL_64_after_hwframe
     17.24            -0.1       17.13        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.syscall
     27.21            -0.1       27.11        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.utimensat
     17.10            -0.1       17.00        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
     18.02            -0.1       17.93        perf-profile.calltrace.cycles-pp.syscall
      3.82            -0.1        3.76        perf-profile.calltrace.cycles-pp.__check_object_size.strncpy_from_user.getname_flags.user_path_at_empty.do_utimes
      0.57            -0.0        0.54        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.61            -0.0        1.58        perf-profile.calltrace.cycles-pp.check_heap_object.__check_object_size.strncpy_from_user.getname_flags.user_path_at_empty
      2.91            -0.0        2.88        perf-profile.calltrace.cycles-pp.filename_lookup.user_path_at_empty.do_utimes.__x64_sys_utime.do_syscall_64
      2.43            -0.0        2.40        perf-profile.calltrace.cycles-pp.path_lookupat.filename_lookup.user_path_at_empty.do_utimes.__x64_sys_utime
     45.81            +0.1       45.96        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
     45.27            +0.2       45.45        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
     79.22            -0.7       78.54        perf-profile.children.cycles-pp.do_utimes
     57.10            -0.5       56.56        perf-profile.children.cycles-pp.user_path_at_empty
     70.66            -0.4       70.29        perf-profile.children.cycles-pp.__x64_sys_utimensat
     36.81            -0.3       36.49        perf-profile.children.cycles-pp.getname_flags
     31.75            -0.3       31.45        perf-profile.children.cycles-pp.strncpy_from_user
     20.12            -0.2       19.91        perf-profile.children.cycles-pp.filename_lookup
     17.70            -0.2       17.50        perf-profile.children.cycles-pp.path_lookupat
     16.79            -0.2       16.60        perf-profile.children.cycles-pp.__x64_sys_utime
     29.54            -0.1       29.40        perf-profile.children.cycles-pp.utimensat
     18.34            -0.1       18.25        perf-profile.children.cycles-pp.syscall
     19.31            -0.1       19.22        perf-profile.children.cycles-pp.vfs_utimes
      4.47            -0.1        4.40        perf-profile.children.cycles-pp.__check_object_size
      1.32            -0.1        1.26        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      3.38            -0.1        3.34        perf-profile.children.cycles-pp.walk_component
      2.56            -0.0        2.52        perf-profile.children.cycles-pp.lookup_fast
      2.08            -0.0        2.04        perf-profile.children.cycles-pp.__d_lookup_rcu
      2.33            -0.0        2.30        perf-profile.children.cycles-pp.check_heap_object
      2.44            -0.0        2.41        perf-profile.children.cycles-pp.complete_walk
      1.07            -0.0        1.05        perf-profile.children.cycles-pp.make_vfsuid
      1.30            -0.0        1.28        perf-profile.children.cycles-pp.path_put
      0.84            +0.0        0.88        perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.00            +0.6        0.63        perf-profile.children.cycles-pp.x64_sys_call
     27.25            -0.2       27.02        perf-profile.self.cycles-pp.strncpy_from_user
      1.30            -0.1        1.22        perf-profile.self.cycles-pp.do_syscall_64
      0.24            -0.0        0.23        perf-profile.self.cycles-pp.may_setattr
      0.12            +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.__x64_sys_utime
      0.84            +0.0        0.88        perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.92            +0.1        1.04        perf-profile.self.cycles-pp.__x64_sys_utimensat
      0.00            +0.5        0.55        perf-profile.self.cycles-pp.x64_sys_call



***************************************************************************************************
lkp-icl-2sp7: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-13/performance/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/readahead/stress-ng/60s

commit: 
  0cd01ac5dc ("x86/bugs: Change commas to semicolons in 'spectre_v2' sysfs file")
  1e3ad78334 ("x86/syscall: Don't force use of indirect calls for system calls")

0cd01ac5dcb1e18e 1e3ad78334a69b36e107232e337 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 5.631e+10            +2.8%  5.787e+10        perf-stat.i.branch-instructions
  5.54e+10            +2.8%  5.695e+10        perf-stat.ps.branch-instructions
     55177 ± 10%     +36.4%      75281 ± 12%  sched_debug.cfs_rq:/.avg_vruntime.stddev
     55177 ± 10%     +36.4%      75281 ± 12%  sched_debug.cfs_rq:/.min_vruntime.stddev
     46.20            -0.5       45.74        perf-profile.calltrace.cycles-pp.vfs_read.__x64_sys_pread64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pread
     35.83            -0.4       35.38        perf-profile.calltrace.cycles-pp.filemap_read.vfs_read.__x64_sys_pread64.do_syscall_64.entry_SYSCALL_64_after_hwframe
     20.24            -0.3       19.90        perf-profile.calltrace.cycles-pp._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read.__x64_sys_pread64
     20.87            -0.3       20.54        perf-profile.calltrace.cycles-pp.copy_page_to_iter.filemap_read.vfs_read.__x64_sys_pread64.do_syscall_64
      1.66            -0.1        1.58        perf-profile.calltrace.cycles-pp.__fdget.ksys_readahead.do_syscall_64.entry_SYSCALL_64_after_hwframe.readahead
      0.66 ±  3%      -0.1        0.60 ±  2%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.readahead.stress_run
      0.63 ±  4%      -0.0        0.58 ±  2%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.readahead
      4.29            -0.0        4.25        perf-profile.calltrace.cycles-pp.__fsnotify_parent.vfs_read.__x64_sys_pread64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.20            -0.0        2.16        perf-profile.calltrace.cycles-pp.touch_atime.filemap_read.vfs_read.__x64_sys_pread64.do_syscall_64
      1.88            -0.0        1.85        perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.filemap_read.vfs_read.__x64_sys_pread64
      4.33 ±  3%      +0.3        4.68 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.readahead
      3.66 ±  3%      +0.4        4.05 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.readahead
     46.41            -0.5       45.94        perf-profile.children.cycles-pp.vfs_read
     48.17            -0.5       47.71        perf-profile.children.cycles-pp.__x64_sys_pread64
     36.13            -0.5       35.68        perf-profile.children.cycles-pp.filemap_read
     20.30            -0.3       19.96        perf-profile.children.cycles-pp._copy_to_iter
     20.97            -0.3       20.64        perf-profile.children.cycles-pp.copy_page_to_iter
     55.86            -0.3       55.60        perf-profile.children.cycles-pp.__libc_pread
     24.71            -0.2       24.48        perf-profile.children.cycles-pp.stress_readahead
      2.62            -0.2        2.46        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      4.54            -0.1        4.45        perf-profile.children.cycles-pp.ksys_readahead
      5.33            -0.0        5.28        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      4.40            -0.0        4.36        perf-profile.children.cycles-pp.__fsnotify_parent
      2.28            -0.0        2.26        perf-profile.children.cycles-pp.touch_atime
      2.06            -0.0        2.04        perf-profile.children.cycles-pp.atime_needs_update
      0.08 ±  8%      -0.0        0.05 ±  8%  perf-profile.children.cycles-pp.ktime_get_update_offsets_now
      0.78            +0.0        0.81        perf-profile.children.cycles-pp.posix_fadvise
     59.97            +0.3       60.27        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     58.28            +0.3       58.60        perf-profile.children.cycles-pp.do_syscall_64
     18.84            +0.5       19.32        perf-profile.children.cycles-pp.readahead
      0.00            +1.2        1.22        perf-profile.children.cycles-pp.x64_sys_call
     20.09            -0.3       19.76        perf-profile.self.cycles-pp._copy_to_iter
     24.32            -0.2       24.08        perf-profile.self.cycles-pp.stress_readahead
      2.65            -0.2        2.47        perf-profile.self.cycles-pp.do_syscall_64
      4.84            -0.0        4.80        perf-profile.self.cycles-pp.filemap_read
      5.16            -0.0        5.11        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      4.27            -0.0        4.22        perf-profile.self.cycles-pp.__fsnotify_parent
      0.08 ±  6%      -0.0        0.05 ±  7%  perf-profile.self.cycles-pp.ktime_get_update_offsets_now
      1.82            -0.0        1.80        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.70            +0.0        0.72        perf-profile.self.cycles-pp.__x64_sys_pread64
      0.00            +1.1        1.06        perf-profile.self.cycles-pp.x64_sys_call



***************************************************************************************************
lkp-icl-2sp7: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-13/performance/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/null/stress-ng/60s

commit: 
  0cd01ac5dc ("x86/bugs: Change commas to semicolons in 'spectre_v2' sysfs file")
  1e3ad78334 ("x86/syscall: Don't force use of indirect calls for system calls")

0cd01ac5dcb1e18e 1e3ad78334a69b36e107232e337 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     19402 ± 14%     +63.7%      31762 ± 28%  sched_debug.cpu.nr_switches.max
      3272 ± 10%     +40.4%       4595 ± 21%  sched_debug.cpu.nr_switches.stddev
      3241           +10.1%       3569 ±  9%  vmstat.system.cs
    162368            -0.9%     160961        vmstat.system.in
   6303220            -3.7%    6068707        proc-vmstat.numa_hit
   6236896            -3.8%    6002419        proc-vmstat.numa_local
   6341375            -3.7%    6107478        proc-vmstat.pgalloc_normal
   6171078            -3.7%    5941105        proc-vmstat.pgfault
   6144519            -3.8%    5913179        proc-vmstat.pgfree
     19272            -3.3%      18627        stress-ng.null.MB_per_sec_/dev/null_write_rate
 2.902e+09            -4.0%  2.787e+09        stress-ng.null.ops
  48365768            -4.0%   46449880        stress-ng.null.ops_per_sec
   5809136            -3.9%    5580207        stress-ng.time.minor_page_faults
      2394            +1.6%       2431        stress-ng.time.system_time
      1324            -2.7%       1289        stress-ng.time.user_time
 3.529e+10           +18.8%   4.19e+10        perf-stat.i.branch-instructions
      0.24 ±  3%      -0.1        0.19 ±  3%  perf-stat.i.branch-miss-rate%
  85202098 ±  4%      -9.4%   77223454 ±  3%  perf-stat.i.branch-misses
      3168 ±  2%     +11.4%       3529 ± 10%  perf-stat.i.context-switches
      1.03            -2.7%       1.00        perf-stat.i.cpi
 1.897e+11            +2.7%  1.949e+11        perf-stat.i.instructions
      0.97            +2.7%       1.00        perf-stat.i.ipc
      3.14            -3.8%       3.03        perf-stat.i.metric.K/sec
    100663            -3.8%      96871        perf-stat.i.minor-faults
    100663            -3.8%      96871        perf-stat.i.page-faults
      0.24 ±  3%      -0.1        0.18 ±  3%  perf-stat.overall.branch-miss-rate%
      1.03            -2.7%       1.00        perf-stat.overall.cpi
      0.97            +2.7%       1.00        perf-stat.overall.ipc
 3.471e+10           +18.7%  4.121e+10        perf-stat.ps.branch-instructions
  83783190 ±  3%      -9.8%   75603241 ±  3%  perf-stat.ps.branch-misses
      3114 ±  2%     +11.0%       3457 ± 10%  perf-stat.ps.context-switches
 1.866e+11            +2.7%  1.916e+11        perf-stat.ps.instructions
     98965            -3.8%      95242        perf-stat.ps.minor-faults
     98966            -3.8%      95242        perf-stat.ps.page-faults
 1.139e+13            +2.6%  1.169e+13        perf-stat.total.instructions
      4.88 ±  2%      -0.3        4.62        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.llseek
      4.94 ±  2%      -0.2        4.74        perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      3.29 ±  2%      -0.2        3.12        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.ioctl
      3.26            -0.1        3.13        perf-profile.calltrace.cycles-pp.setfl.do_fcntl.__x64_sys_fcntl.do_syscall_64.entry_SYSCALL_64_after_hwframe
      3.30            -0.1        3.17        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.fallocate64
      2.48            -0.1        2.36        perf-profile.calltrace.cycles-pp.do_vfs_ioctl.__x64_sys_ioctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.ioctl
      2.34 ±  2%      -0.1        2.21        perf-profile.calltrace.cycles-pp.do_fcntl.__x64_sys_fcntl.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.12            -0.1        2.01        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64
      0.84 ±  2%      -0.1        0.75 ±  2%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.86 ±  2%      -0.1        0.76 ±  2%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      0.89 ±  3%      -0.1        0.80        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek.stress_run
      1.63 ±  2%      -0.1        1.55        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.write
      0.86 ±  3%      -0.1        0.79 ±  2%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_run
      2.40 ±  2%      -0.1        2.33        perf-profile.calltrace.cycles-pp.do_fcntl.__x64_sys_fcntl.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_run
      1.26            -0.1        1.20        perf-profile.calltrace.cycles-pp.__put_user_4.do_vfs_ioctl.__x64_sys_ioctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.58 ±  3%      -0.1        0.52        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.ioctl
      1.45            -0.1        1.40        perf-profile.calltrace.cycles-pp._raw_spin_lock.setfl.do_fcntl.__x64_sys_fcntl.do_syscall_64
      0.55            +0.0        0.58        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      0.86 ±  3%      +0.1        0.91 ±  2%  perf-profile.calltrace.cycles-pp.__fdget_raw.__x64_sys_fcntl.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_run
      3.11            +0.1        3.19        perf-profile.calltrace.cycles-pp.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      1.50 ±  2%      +0.1        1.60 ±  2%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fdatasync
      0.70            +0.1        0.83        perf-profile.calltrace.cycles-pp.__fdget.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      1.15 ±  2%      +0.1        1.29 ±  2%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync
      1.41 ±  2%      +0.1        1.55 ±  2%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fdatasync.stress_run
      1.18 ±  2%      +0.2        1.35 ±  2%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_run
      3.90 ±  2%      +0.2        4.08 ±  2%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_run
      3.95 ±  2%      +0.2        4.14        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.ioctl
      1.36 ±  2%      +0.2        1.55 ±  2%  perf-profile.calltrace.cycles-pp.__fdget.__x64_sys_ioctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.ioctl
      4.08 ±  2%      +0.2        4.31        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      4.66 ±  3%      +0.3        4.91        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.ioctl.stress_run
      4.06 ±  3%      +0.3        4.36        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.ioctl.stress_run
      4.58            +0.3        4.91        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      5.07            +0.3        5.40        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fallocate64
      4.20 ±  3%      +0.3        4.54 ±  2%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek.stress_run
      6.71 ±  2%      +0.4        7.07        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_run
      0.17 ±141%      +0.4        0.58 ±  2%  perf-profile.calltrace.cycles-pp.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_run
      0.00            +0.5        0.55 ±  2%  perf-profile.calltrace.cycles-pp.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync
      8.22            -0.6        7.61        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
     16.53            -0.6       15.94        perf-profile.children.cycles-pp.entry_SYSCALL_64
     16.36            -0.6       15.80        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      5.04            -0.2        4.83        perf-profile.children.cycles-pp.do_fcntl
      5.14 ±  2%      -0.2        4.95        perf-profile.children.cycles-pp.vfs_write
      9.50            -0.2        9.31        perf-profile.children.cycles-pp.__x64_sys_fcntl
      3.54            -0.1        3.40        perf-profile.children.cycles-pp.setfl
      2.62            -0.1        2.48        perf-profile.children.cycles-pp.do_vfs_ioctl
      2.56            -0.1        2.47        perf-profile.children.cycles-pp.stress_null
      1.89            -0.1        1.80        perf-profile.children.cycles-pp.amd_clear_divider
      1.74            -0.1        1.65        perf-profile.children.cycles-pp.__libc_fcntl64
      1.38            -0.1        1.30        perf-profile.children.cycles-pp.__put_user_4
      1.54            -0.1        1.49        perf-profile.children.cycles-pp._raw_spin_lock
      0.44 ±  4%      -0.1        0.39        perf-profile.children.cycles-pp.__munmap
      0.42 ±  4%      -0.1        0.37        perf-profile.children.cycles-pp.__vm_munmap
      0.42 ±  4%      -0.1        0.37        perf-profile.children.cycles-pp.__x64_sys_munmap
      0.40 ±  4%      -0.0        0.36        perf-profile.children.cycles-pp.do_vmi_align_munmap
      2.46            -0.0        2.41        perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      0.42 ±  4%      -0.0        0.38        perf-profile.children.cycles-pp.do_vmi_munmap
      0.32 ±  4%      -0.0        0.28        perf-profile.children.cycles-pp.unmap_region
      0.24 ±  3%      -0.0        0.22 ±  2%  perf-profile.children.cycles-pp.asm_exc_page_fault
      0.31 ±  3%      -0.0        0.29        perf-profile.children.cycles-pp.__mmap
      0.22 ±  3%      -0.0        0.19        perf-profile.children.cycles-pp.do_user_addr_fault
      0.55            -0.0        0.52        perf-profile.children.cycles-pp.fcntl64@plt
      0.75            -0.0        0.72        perf-profile.children.cycles-pp.security_file_fcntl
      0.29 ±  3%      -0.0        0.27        perf-profile.children.cycles-pp.vm_mmap_pgoff
      0.28 ±  3%      -0.0        0.25        perf-profile.children.cycles-pp.do_mmap
      0.22 ±  2%      -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.exc_page_fault
      0.20 ±  3%      -0.0        0.18 ±  2%  perf-profile.children.cycles-pp.mmap_region
      0.56            -0.0        0.54        perf-profile.children.cycles-pp.null_lseek
      0.53            -0.0        0.51        perf-profile.children.cycles-pp.security_file_ioctl
      0.18 ±  3%      -0.0        0.16 ±  2%  perf-profile.children.cycles-pp.handle_mm_fault
      0.15 ±  7%      -0.0        0.13 ±  2%  perf-profile.children.cycles-pp.tlb_finish_mmu
      0.16 ±  3%      -0.0        0.15 ±  3%  perf-profile.children.cycles-pp.__handle_mm_fault
      0.12 ±  6%      -0.0        0.10        perf-profile.children.cycles-pp.__tlb_batch_free_encoded_pages
      0.07            -0.0        0.06        perf-profile.children.cycles-pp.__anon_vma_prepare
      3.28            +0.1        3.35        perf-profile.children.cycles-pp.__x64_sys_fallocate
      7.43            +0.1        7.51        perf-profile.children.cycles-pp.fdatasync
      4.15            +0.1        4.23        perf-profile.children.cycles-pp.syscall_return_via_sysret
      1.07            +0.1        1.22        perf-profile.children.cycles-pp.__x64_sys_fdatasync
      2.93            +0.4        3.35 ±  2%  perf-profile.children.cycles-pp.__fdget
     52.11            +1.6       53.71        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     47.02            +1.9       48.87        perf-profile.children.cycles-pp.do_syscall_64
      0.00            +3.4        3.40        perf-profile.children.cycles-pp.x64_sys_call
      8.38            -0.7        7.68        perf-profile.self.cycles-pp.do_syscall_64
     15.83            -0.6       15.27        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      5.47            -0.3        5.20        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      5.17            -0.2        4.98        perf-profile.self.cycles-pp.entry_SYSCALL_64
      4.64            -0.2        4.49        perf-profile.self.cycles-pp.llseek
      4.24            -0.1        4.10        perf-profile.self.cycles-pp.ioctl
      4.36            -0.1        4.22        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      1.94            -0.1        1.84        perf-profile.self.cycles-pp.stress_null
      2.02            -0.1        1.93        perf-profile.self.cycles-pp.fdatasync
      2.18            -0.1        2.10        perf-profile.self.cycles-pp.fallocate64
      2.01            -0.1        1.94        perf-profile.self.cycles-pp.setfl
      1.33            -0.1        1.26        perf-profile.self.cycles-pp.__put_user_4
      1.54            -0.1        1.47        perf-profile.self.cycles-pp.do_fcntl
      1.19            -0.1        1.14 ±  2%  perf-profile.self.cycles-pp.do_vfs_ioctl
      1.34            -0.1        1.29        perf-profile.self.cycles-pp._raw_spin_lock
      2.30            -0.0        2.26        perf-profile.self.cycles-pp.__x64_sys_fcntl
      1.97            -0.0        1.93        perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      0.97            -0.0        0.94        perf-profile.self.cycles-pp.amd_clear_divider
      0.56            -0.0        0.54        perf-profile.self.cycles-pp.security_file_fcntl
      0.39            -0.0        0.37        perf-profile.self.cycles-pp.fcntl64@plt
      0.30            -0.0        0.29        perf-profile.self.cycles-pp.rw_verify_area
      0.36            -0.0        0.35        perf-profile.self.cycles-pp.security_file_ioctl
      0.44            +0.0        0.48 ±  2%  perf-profile.self.cycles-pp.__x64_sys_fallocate
      0.34            +0.0        0.37 ±  6%  perf-profile.self.cycles-pp.__x64_sys_fdatasync
      4.14            +0.1        4.23        perf-profile.self.cycles-pp.syscall_return_via_sysret
      1.57            +0.1        1.66 ±  2%  perf-profile.self.cycles-pp.__fdget_raw
      2.61            +0.4        3.06 ±  2%  perf-profile.self.cycles-pp.__fdget
      0.00            +2.9        2.89        perf-profile.self.cycles-pp.x64_sys_call



***************************************************************************************************
lkp-icl-2sp7: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-13/performance/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/monte-carlo/stress-ng/60s

commit: 
  0cd01ac5dc ("x86/bugs: Change commas to semicolons in 'spectre_v2' sysfs file")
  1e3ad78334 ("x86/syscall: Don't force use of indirect calls for system calls")

0cd01ac5dcb1e18e 1e3ad78334a69b36e107232e337 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      1.74            -0.1        1.62        perf-stat.overall.branch-miss-rate%
 1.411e+13            +1.2%  1.427e+13        perf-stat.total.instructions
   2838242            -1.6%    2793803        stress-ng.monte-carlo.samples/sec,_e_using_arc4
   7122323            -1.2%    7036665        stress-ng.monte-carlo.samples/sec,_exp_using_arc4
   3972723            -1.5%    3911813        stress-ng.monte-carlo.samples/sec,_pi_using_arc4
 1.016e+08            -1.3%  1.004e+08        stress-ng.monte-carlo.samples/sec,_pi_using_lcg
   6407021            -1.4%    6319313        stress-ng.monte-carlo.samples/sec,_sin_using_arc4
   7374513            -1.3%    7277983        stress-ng.monte-carlo.samples/sec,_sqrt_using_arc4
   3962914            -1.5%    3904274        stress-ng.monte-carlo.samples/sec,_squircle_using_arc4
      1108            +1.8%       1128        stress-ng.time.system_time
      3.02            -0.3        2.69        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__getpid.stress_mc_arc4_rand
      3.40 ±  2%      -0.2        3.16 ±  2%  perf-profile.calltrace.cycles-pp.__x64_sys_getpid.do_syscall_64.entry_SYSCALL_64_after_hwframe.__getpid.stress_mc_arc4_rand
      2.93 ±  3%      -0.2        2.71 ±  3%  perf-profile.calltrace.cycles-pp.__task_pid_nr_ns.__x64_sys_getpid.do_syscall_64.entry_SYSCALL_64_after_hwframe.__getpid
     17.72            -0.2       17.52        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__getpid.stress_mc_arc4_rand
      0.94            -0.0        0.91        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__getpid
      3.13            -0.0        3.10        perf-profile.calltrace.cycles-pp.stress_mc_mwc64_rand
      2.07            -0.0        2.04        perf-profile.calltrace.cycles-pp.stress_mwc64.stress_mc_mwc64_rand
      0.56            -0.0        0.53        perf-profile.calltrace.cycles-pp.stress_monte_carlo_sqrt.stress_mc_arc4_rand
     43.17            +0.5       43.65        perf-profile.calltrace.cycles-pp.stress_mc_arc4_rand
     34.06            +0.5       34.58        perf-profile.calltrace.cycles-pp.__getpid.stress_mc_arc4_rand
     13.54            +0.8       14.33        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__getpid.stress_mc_arc4_rand
     10.40            +1.0       11.40        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__getpid.stress_mc_arc4_rand
      0.00            +1.4        1.38        perf-profile.calltrace.cycles-pp.x64_sys_call.do_syscall_64.entry_SYSCALL_64_after_hwframe.__getpid.stress_mc_arc4_rand
      3.94            -0.4        3.58        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      3.85 ±  2%      -0.2        3.61 ±  2%  perf-profile.children.cycles-pp.__x64_sys_getpid
      3.15 ±  2%      -0.2        2.93 ±  3%  perf-profile.children.cycles-pp.__task_pid_nr_ns
      7.88            -0.1        7.76        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      4.44            -0.1        4.37        perf-profile.children.cycles-pp.stress_mc_xorshift_rand
      1.18            -0.1        1.13        perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      2.10            -0.0        2.06        perf-profile.children.cycles-pp.stress_monte_carlo_pi
      2.38            -0.0        2.35        perf-profile.children.cycles-pp.stress_monte_carlo_sqrt
      2.32            -0.0        2.29        perf-profile.children.cycles-pp.stress_mwc64
     35.10            +0.6       35.66        perf-profile.children.cycles-pp.__getpid
     27.59            +0.7       28.32        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     25.10            +0.7       25.84        perf-profile.children.cycles-pp.do_syscall_64
      0.00            +1.6        1.61        perf-profile.children.cycles-pp.x64_sys_call
      3.90            -0.2        3.68        perf-profile.self.cycles-pp.do_syscall_64
      2.90 ±  2%      -0.2        2.69 ±  3%  perf-profile.self.cycles-pp.__task_pid_nr_ns
      7.64            -0.1        7.52        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      2.08            -0.1        2.00        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.95            -0.0        0.90        perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      2.76            -0.0        2.72        perf-profile.self.cycles-pp.stress_mc_xorshift_rand
      1.64            -0.0        1.60        perf-profile.self.cycles-pp.stress_monte_carlo_pi
      2.05            -0.0        2.02        perf-profile.self.cycles-pp.stress_mwc64
      0.00            +1.4        1.37        perf-profile.self.cycles-pp.x64_sys_call



***************************************************************************************************
lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-13/performance/1HDD/ext4/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/fpunch/stress-ng/60s

commit: 
  0cd01ac5dc ("x86/bugs: Change commas to semicolons in 'spectre_v2' sysfs file")
  1e3ad78334 ("x86/syscall: Don't force use of indirect calls for system calls")

0cd01ac5dcb1e18e 1e3ad78334a69b36e107232e337 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 4.408e+10            +4.9%  4.623e+10        perf-stat.i.branch-instructions
      0.21            -0.0        0.19 ±  3%  perf-stat.overall.branch-miss-rate%
 4.336e+10            +4.9%  4.547e+10        perf-stat.ps.branch-instructions
 1.054e+08            -1.6%  1.037e+08        stress-ng.fpunch.ops
   1756286            -1.6%    1727644        stress-ng.fpunch.ops_per_sec
    879217            -2.0%     861604        stress-ng.time.voluntary_context_switches
     38.90            -0.6       38.29        perf-profile.calltrace.cycles-pp.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
     31.84            -0.5       31.30        perf-profile.calltrace.cycles-pp.generic_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
     40.48            -0.4       40.04        perf-profile.calltrace.cycles-pp.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
     19.84            -0.4       19.48        perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
     22.29            -0.4       21.94        perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
     16.32            -0.3       16.01        perf-profile.calltrace.cycles-pp.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
     26.13            -0.3       25.87        perf-profile.calltrace.cycles-pp.write
     23.37            -0.3       23.11        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
     47.74            -0.2       47.50        perf-profile.calltrace.cycles-pp.__libc_pwrite
      0.52            -0.2        0.34 ± 70%  perf-profile.calltrace.cycles-pp.__mutex_unlock_slowpath.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      2.06            -0.1        1.98        perf-profile.calltrace.cycles-pp.up_write.generic_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
      1.33            -0.1        1.28        perf-profile.calltrace.cycles-pp.rwsem_wake.up_write.generic_file_write_iter.vfs_write.__x64_sys_pwrite64
      0.64 ±  2%      -0.1        0.59        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
      2.62            -0.0        2.58        perf-profile.calltrace.cycles-pp.simple_write_end.generic_perform_write.generic_file_write_iter.vfs_write.__x64_sys_pwrite64
      1.02            -0.0        0.99        perf-profile.calltrace.cycles-pp.up_write.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64
      0.66            -0.0        0.64        perf-profile.calltrace.cycles-pp.rwsem_wake.up_write.generic_file_write_iter.vfs_write.ksys_write
      1.48            -0.0        1.46        perf-profile.calltrace.cycles-pp.simple_write_end.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
      0.66            -0.0        0.64        perf-profile.calltrace.cycles-pp.security_file_permission.rw_verify_area.vfs_write.ksys_write.do_syscall_64
      1.06            +0.0        1.08        perf-profile.calltrace.cycles-pp.vfs_fallocate.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
      0.83 ±  3%      +0.1        0.91        perf-profile.calltrace.cycles-pp.__fdget.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
      1.60            +0.1        1.71        perf-profile.calltrace.cycles-pp.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
      0.71 ±  2%      +0.1        0.82        perf-profile.calltrace.cycles-pp.__fdget.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      3.00            +0.2        3.18        perf-profile.calltrace.cycles-pp.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      5.63            +0.2        5.85        perf-profile.calltrace.cycles-pp.syscall
      2.37            +0.2        2.60        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
      2.65            +0.2        2.89        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.syscall
     10.09            +0.3       10.44        perf-profile.calltrace.cycles-pp.fallocate64
      4.37            +0.4        4.74        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
      4.84            +0.4        5.21        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fallocate64
     59.21            -1.0       58.23        perf-profile.children.cycles-pp.vfs_write
     48.48            -0.9       47.62        perf-profile.children.cycles-pp.generic_file_write_iter
     40.60            -0.4       40.17        perf-profile.children.cycles-pp.__x64_sys_pwrite64
     22.42            -0.4       22.06        perf-profile.children.cycles-pp.ksys_write
     26.21            -0.3       25.96        perf-profile.children.cycles-pp.write
     47.88            -0.2       47.65        perf-profile.children.cycles-pp.__libc_pwrite
      3.21            -0.1        3.10        perf-profile.children.cycles-pp.up_write
      2.62            -0.1        2.52        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      2.07            -0.1        2.00        perf-profile.children.cycles-pp.rwsem_wake
      4.34            -0.1        4.27        perf-profile.children.cycles-pp.simple_write_end
      1.28            -0.0        1.24        perf-profile.children.cycles-pp.wake_up_q
      0.69            -0.0        0.66        perf-profile.children.cycles-pp.wake_q_add
      1.06            -0.0        1.03 ±  2%  perf-profile.children.cycles-pp.__mutex_unlock_slowpath
      0.84 ±  2%      -0.0        0.81        perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      0.76            -0.0        0.73        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.65            -0.0        0.62        perf-profile.children.cycles-pp.rwsem_mark_wake
      0.81            -0.0        0.79        perf-profile.children.cycles-pp.try_to_wake_up
      5.70            +0.2        5.92        perf-profile.children.cycles-pp.syscall
      2.03 ±  2%      +0.3        2.29        perf-profile.children.cycles-pp.__fdget
      4.83            +0.3        5.10        perf-profile.children.cycles-pp.__x64_sys_fallocate
     10.18            +0.3       10.52        perf-profile.children.cycles-pp.fallocate64
      0.00            +1.1        1.12        perf-profile.children.cycles-pp.x64_sys_call
      2.68            -0.1        2.55        perf-profile.self.cycles-pp.do_syscall_64
      4.24            -0.1        4.13        perf-profile.self.cycles-pp.fault_in_readable
      2.04            -0.0        1.99        perf-profile.self.cycles-pp.simple_write_end
      4.80            -0.0        4.76        perf-profile.self.cycles-pp.vfs_write
      1.67            -0.0        1.64        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.66 ±  2%      -0.0        0.63        perf-profile.self.cycles-pp.wake_q_add
      0.72            -0.0        0.70        perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.68 ±  3%      -0.0        0.66 ±  2%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      0.50            -0.0        0.48        perf-profile.self.cycles-pp.wake_up_q
      0.59            +0.1        0.66        perf-profile.self.cycles-pp.__x64_sys_fallocate
      0.96            +0.1        1.05        perf-profile.self.cycles-pp.__fdget_pos
      0.57            +0.1        0.68        perf-profile.self.cycles-pp.__x64_sys_pwrite64
      1.87 ±  2%      +0.3        2.14        perf-profile.self.cycles-pp.__fdget
      0.00            +1.0        0.96        perf-profile.self.cycles-pp.x64_sys_call





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ