lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20211216084755.GA10708@xsang-OptiPlex-9020>
Date:   Thu, 16 Dec 2021 16:47:55 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Tom Zanussi <zanussi@...nel.org>
Cc:     Steven Rostedt <rostedt@...dmis.org>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...ux.intel.com, fengwei.yin@...el.com
Subject: [tracing]  85c62c8c37:  unixbench.score 2.4% improvement



Greeting,

FYI, we noticed a 2.4% improvement of unixbench.score due to commit:


commit: 85c62c8c3749eec02ba81217bdcac26867dc262e ("tracing: Have existing event_command.parse() implementations use helpers")
https://git.kernel.org/cgit/linux/kernel/git/rostedt/linux-trace.git ftrace/core

in testcase: unixbench
on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory
with following parameters:

	runtime: 300s
	nr_task: 30%
	test: pipe
	cpufreq_governor: performance
	ucode: 0xd000280

test-description: UnixBench is the original BYTE UNIX benchmark suite aims to test performance of Unix-like system.
test-url: https://github.com/kdlucas/byte-unixbench





Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/30%/debian-10.4-x86_64-20200603.cgz/300s/lkp-icl-2sp2/pipe/unixbench/0xd000280

commit: 
  70b3c6b691 ("tracing: Add helper functions to simplify event_command.parse() callback handling")
  85c62c8c37 ("tracing: Have existing event_command.parse() implementations use helpers")

70b3c6b69157039d 85c62c8c3749eec02ba81217bdc 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     60620            +2.4%      62065        unixbench.score
 2.947e+10            +2.3%  3.016e+10        unixbench.workload
     36554 ±  7%     -10.8%      32602 ±  4%  softirqs.CPU123.SCHED
     46257 ±  4%     -10.3%      41506 ±  7%  softirqs.CPU7.SCHED
     32380 ± 12%     +14.7%      37134 ±  6%  softirqs.CPU81.SCHED
 2.794e+10            +2.3%  2.858e+10        perf-stat.i.branch-instructions
  13343338           -23.8%   10167419 ±  2%  perf-stat.i.branch-misses
    662823           -17.5%     546757        perf-stat.i.cache-misses
   7192051           -20.0%    5751089 ±  2%  perf-stat.i.cache-references
    129484           +35.1%     174921        perf-stat.i.cycles-between-cache-misses
      0.00 ±  2%      -0.0        0.00 ±  2%  perf-stat.i.dTLB-load-miss-rate%
    665427           -46.7%     354693        perf-stat.i.dTLB-load-misses
 4.435e+10            +2.3%  4.537e+10        perf-stat.i.dTLB-loads
 2.665e+10            +2.3%  2.726e+10        perf-stat.i.dTLB-stores
 1.438e+11            +2.3%  1.471e+11        perf-stat.i.instructions
      1.58            +2.0%       1.62        perf-stat.i.ipc
     59.02           -20.2%      47.12 ±  2%  perf-stat.i.metric.K/sec
    772.85            +2.3%     790.56        perf-stat.i.metric.M/sec
     89.73            -2.2       87.53        perf-stat.i.node-load-miss-rate%
    128386           -21.9%     100239 ±  2%  perf-stat.i.node-load-misses
     88458 ±  5%     -34.6%      57889 ±  8%  perf-stat.i.node-store-misses
      0.05           -21.8%       0.04 ±  2%  perf-stat.overall.MPKI
      0.05            -0.0        0.04 ±  2%  perf-stat.overall.branch-miss-rate%
      0.54            -2.3%       0.53        perf-stat.overall.cpi
    117918           +21.1%     142796        perf-stat.overall.cycles-between-cache-misses
      0.00            -0.0        0.00        perf-stat.overall.dTLB-load-miss-rate%
      1.84            +2.4%       1.89        perf-stat.overall.ipc
     86.82            -3.5       83.34        perf-stat.overall.node-load-miss-rate%
     48.41 ±  6%      -9.0       39.39 ±  8%  perf-stat.overall.node-store-miss-rate%
 2.794e+10            +2.3%  2.857e+10        perf-stat.ps.branch-instructions
  13329890           -23.8%   10154768 ±  2%  perf-stat.ps.branch-misses
    661626           -17.5%     545780        perf-stat.ps.cache-misses
   7176900           -20.1%    5735850 ±  2%  perf-stat.ps.cache-references
    665319           -46.7%     354481        perf-stat.ps.dTLB-load-misses
 4.435e+10            +2.3%  4.535e+10        perf-stat.ps.dTLB-loads
 2.665e+10            +2.3%  2.725e+10        perf-stat.ps.dTLB-stores
 1.438e+11            +2.3%   1.47e+11        perf-stat.ps.instructions
    128207           -21.9%     100146 ±  2%  perf-stat.ps.node-load-misses
     88398 ±  5%     -34.6%      57822 ±  8%  perf-stat.ps.node-store-misses
 5.652e+13            +2.3%  5.785e+13        perf-stat.total.instructions
      3.19            -0.3        2.86        perf-profile.calltrace.cycles-pp.security_file_permission.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.91            -0.3        2.62        perf-profile.calltrace.cycles-pp.security_file_permission.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.37            -0.2        6.13        perf-profile.calltrace.cycles-pp.__entry_text_start.read
     33.05            -0.2       32.82        perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      1.86 ±  2%      -0.2        1.66        perf-profile.calltrace.cycles-pp.common_file_perm.security_file_permission.vfs_read.ksys_read.do_syscall_64
      1.84            -0.2        1.66        perf-profile.calltrace.cycles-pp.common_file_perm.security_file_permission.vfs_write.ksys_write.do_syscall_64
      1.63            -0.1        1.53 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.pipe_write.new_sync_write.vfs_write.ksys_write
      1.99            -0.1        1.91        perf-profile.calltrace.cycles-pp.main
      0.85 ±  2%      -0.1        0.77 ±  2%  perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.pipe_write.new_sync_write.vfs_write
      1.26            -0.0        1.22 ±  2%  perf-profile.calltrace.cycles-pp.current_time.file_update_time.pipe_write.new_sync_write.vfs_write
      1.12            +0.0        1.16        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
      1.73            +0.1        1.80 ±  2%  perf-profile.calltrace.cycles-pp.__fget_light.__fdget_pos.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.91            +0.1        2.98        perf-profile.calltrace.cycles-pp.mutex_lock.pipe_read.new_sync_read.vfs_read.ksys_read
      4.16            +0.1        4.23        perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.pipe_read.new_sync_read.vfs_read
      2.02            +0.1        2.10        perf-profile.calltrace.cycles-pp.__fdget_pos.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      4.89            +0.1        4.99        perf-profile.calltrace.cycles-pp.touch_atime.pipe_read.new_sync_read.vfs_read.ksys_read
      2.42            +0.1        2.53        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.pipe_write.new_sync_write.vfs_write
      4.50            +0.1        4.63        perf-profile.calltrace.cycles-pp.__wake_up_common_lock.pipe_write.new_sync_write.vfs_write.ksys_write
     37.34            +0.3       37.63        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
     38.45            +0.3       38.76        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
     24.57            +0.4       24.93        perf-profile.calltrace.cycles-pp.pipe_read.new_sync_read.vfs_read.ksys_read.do_syscall_64
     26.85            +0.4       27.25        perf-profile.calltrace.cycles-pp.new_sync_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +0.5        0.52        perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.__wake_up_common_lock.pipe_write.new_sync_write.vfs_write
      6.40            -0.6        5.79        perf-profile.children.cycles-pp.security_file_permission
      4.12            -0.4        3.76        perf-profile.children.cycles-pp.common_file_perm
     50.62            -0.3       50.34        perf-profile.children.cycles-pp.write
     33.42            -0.2       33.22        perf-profile.children.cycles-pp.vfs_write
      7.06            -0.2        6.85        perf-profile.children.cycles-pp.__entry_text_start
      0.90 ±  2%      -0.1        0.81        perf-profile.children.cycles-pp.__wake_up_common
      2.14            -0.1        2.06        perf-profile.children.cycles-pp.main
      1.52            -0.1        1.46        perf-profile.children.cycles-pp.__cond_resched
      1.09            -0.0        1.05        perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
      0.13 ±  5%      -0.0        0.11        perf-profile.children.cycles-pp.write@plt
      1.20            +0.0        1.23        perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      0.56            +0.0        0.61        perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
      2.67            +0.1        2.73        perf-profile.children.cycles-pp.mutex_unlock
      2.70            +0.1        2.77        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      4.58            +0.1        4.68        perf-profile.children.cycles-pp.atime_needs_update
      5.11            +0.1        5.22        perf-profile.children.cycles-pp.touch_atime
      2.54            +0.1        2.66        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      6.30            +0.1        6.41        perf-profile.children.cycles-pp.mutex_lock
      4.87            +0.1        5.00        perf-profile.children.cycles-pp.__wake_up_common_lock
     25.30            +0.4       25.65        perf-profile.children.cycles-pp.pipe_read
     27.07            +0.4       27.48        perf-profile.children.cycles-pp.new_sync_read
      2.44 ±  2%      -0.4        2.08        perf-profile.self.cycles-pp.common_file_perm
      1.98 ±  2%      -0.3        1.72        perf-profile.self.cycles-pp.security_file_permission
      3.06            -0.1        2.95        perf-profile.self.cycles-pp.write
      1.62            -0.1        1.53 ±  2%  perf-profile.self.cycles-pp.__entry_text_start
      3.89            -0.1        3.79        perf-profile.self.cycles-pp.pipe_write
      1.99            -0.1        1.91        perf-profile.self.cycles-pp.main
      1.72            -0.1        1.65        perf-profile.self.cycles-pp.copy_page_from_iter
      2.22            -0.1        2.17        perf-profile.self.cycles-pp.copy_page_to_iter
      0.78 ±  2%      -0.0        0.73        perf-profile.self.cycles-pp.__wake_up_common
      0.32 ±  3%      -0.0        0.29 ±  4%  perf-profile.self.cycles-pp.__might_fault
      0.22 ±  3%      +0.0        0.24        perf-profile.self.cycles-pp.make_kgid
      1.56            +0.0        1.60        perf-profile.self.cycles-pp.new_sync_write
      1.92            +0.0        1.97        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      1.54            +0.1        1.60        perf-profile.self.cycles-pp.new_sync_read
      3.36            +0.1        3.46        perf-profile.self.cycles-pp.mutex_lock
      2.46            +0.1        2.57        perf-profile.self.cycles-pp._raw_spin_lock_irqsave




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


---
0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation

Thanks,
Oliver Sang


View attachment "config-5.16.0-rc4-00024-g85c62c8c3749" of type "text/plain" (173551 bytes)

View attachment "job-script" of type "text/plain" (7897 bytes)

View attachment "job.yaml" of type "text/plain" (5364 bytes)

View attachment "reproduce" of type "text/plain" (277 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ