[<prev] [next>] [day] [month] [year] [list]
Message-ID: <fba4a3d6-51fb-397a-0235-87fd60b2c437@intel.com>
Date: Mon, 22 Aug 2022 10:10:25 +0800
From: kernel test robot <yujie.liu@...el.com>
To: Al Viro <viro@...iv.linux.org.uk>
CC: <lkp@...ts.01.org>, kernel test robot <lkp@...el.com>,
"Christian Brauner (Microsoft)" <brauner@...nel.org>,
LKML <linux-kernel@...r.kernel.org>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
<linux-nvme@...ts.infradead.org>, <linux-aio@...ck.org>,
<linux-fsdevel@...r.kernel.org>, Huang Ying <ying.huang@...el.com>,
<feng.tang@...el.com>, <zhengjun.xing@...ux.intel.com>,
<fengwei.yin@...el.com>
Subject: 164f4064ca: unixbench.score 1.4% improvement
Greeting,
FYI, we noticed a 1.4% improvement of unixbench.score due to commit:
commit: 164f4064ca81eefcea29f7f5dcf394f92be1d0c0 ("keep iocb_flags() result cached in struct file")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: unixbench
on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory
with following parameters:
runtime: 300s
nr_task: 1
test: fstime
cpufreq_governor: performance
ucode: 0xd000363
test-description: UnixBench is the original BYTE UNIX benchmark suite aims to test performance of Unix-like system.
test-url: https://github.com/kdlucas/byte-unixbench
In addition to that, the commit also has impact on the following tests:
+------------------+---------------------------------------------------------------------------------+
| testcase: change | unixbench: unixbench.score 2.1% improvement |
| test machine | 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory |
| test parameters | cpufreq_governor=performance |
| | nr_task=1 |
| | runtime=300s |
| | test=fsbuffer |
| | ucode=0xd000363 |
+------------------+---------------------------------------------------------------------------------+
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase/ucode:
gcc-11/performance/x86_64-rhel-8.3/1/debian-11.1-x86_64-20220510.cgz/300s/lkp-icl-2sp2/fstime/unixbench/0xd000363
commit:
91b94c5d6a ("iocb: delay evaluation of IS_SYNC(...) until we want to check IOCB_DSYNC")
164f4064ca ("keep iocb_flags() result cached in struct file")
91b94c5d6ae55d11 164f4064ca81eefcea29f7f5dcf
---------------- ---------------------------
%stddev %change %stddev
\ | \
3500 +1.4% 3548 unixbench.score
6.657e+08 +1.4% 6.747e+08 unixbench.workload
0.02 ±118% +0.0 0.07 ± 17% perf-profile.self.cycles-pp.rcu_eqs_enter
39089557 +1.8% 39808281 perf-stat.i.cache-references
39018465 +1.8% 39737152 perf-stat.ps.cache-references
***************************************************************************************************
lkp-icl-2sp2: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase/ucode:
gcc-11/performance/x86_64-rhel-8.3/1/debian-11.1-x86_64-20220510.cgz/300s/lkp-icl-2sp2/fsbuffer/unixbench/0xd000363
commit:
91b94c5d6a ("iocb: delay evaluation of IS_SYNC(...) until we want to check IOCB_DSYNC")
164f4064ca ("keep iocb_flags() result cached in struct file")
91b94c5d6ae55d11 164f4064ca81eefcea29f7f5dcf
---------------- ---------------------------
%stddev %change %stddev
\ | \
2322 +2.1% 2370 unixbench.score
1.845e+08 +2.1% 1.883e+08 unixbench.workload
0.03 ± 3% +0.0 0.04 ± 12% mpstat.cpu.all.soft%
45.67 ± 56% +7248.9% 3356 ± 98% proc-vmstat.numa_hint_faults_local
1.022e+09 +1.3% 1.036e+09 perf-stat.i.branch-instructions
1.455e+09 +1.6% 1.478e+09 perf-stat.i.dTLB-loads
7.985e+08 +1.8% 8.125e+08 perf-stat.i.dTLB-stores
25.59 +1.6% 25.99 perf-stat.i.metric.M/sec
12988 -1.4% 12803 perf-stat.overall.path-length
1.02e+09 +1.3% 1.034e+09 perf-stat.ps.branch-instructions
1.452e+09 +1.6% 1.475e+09 perf-stat.ps.dTLB-loads
7.971e+08 +1.7% 8.109e+08 perf-stat.ps.dTLB-stores
0.25 ± 12% -0.1 0.15 ± 15% perf-profile.children.cycles-pp.error_entry
0.52 ± 6% -0.1 0.43 ± 8% perf-profile.children.cycles-pp.update_sd_lb_stats
0.54 ± 8% -0.1 0.46 ± 9% perf-profile.children.cycles-pp.find_busiest_group
0.09 ± 18% -0.0 0.05 ± 47% perf-profile.children.cycles-pp.trigger_load_balance
0.09 ± 21% -0.0 0.06 ± 46% perf-profile.children.cycles-pp.update_group_capacity
0.11 ± 9% +0.0 0.15 ± 17% perf-profile.children.cycles-pp.cpuidle_governor_latency_req
0.02 ±142% +0.0 0.07 ± 26% perf-profile.children.cycles-pp.cpuidle_reflect
0.05 ± 50% +0.1 0.10 ± 36% perf-profile.children.cycles-pp.tick_program_event
0.02 ±141% +0.1 0.09 ± 11% perf-profile.children.cycles-pp.rcu_eqs_enter
0.72 ± 7% +0.1 0.80 ± 5% perf-profile.children.cycles-pp.native_sched_clock
0.89 ± 6% +0.1 1.00 ± 4% perf-profile.children.cycles-pp.sched_clock_cpu
0.59 ± 6% +0.1 0.70 ± 14% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.23 ± 10% -0.1 0.13 ± 14% perf-profile.self.cycles-pp.error_entry
0.09 ± 21% -0.0 0.05 ± 46% perf-profile.self.cycles-pp.update_group_capacity
0.04 ± 45% +0.0 0.06 ± 7% perf-profile.self.cycles-pp.tick_nohz_tick_stopped
0.06 ± 14% +0.0 0.09 ± 11% perf-profile.self.cycles-pp.cpuidle_enter
0.06 ± 14% +0.0 0.10 ± 18% perf-profile.self.cycles-pp.rcu_idle_exit
0.52 ± 6% +0.0 0.56 ± 3% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.28 ± 5% +0.0 0.32 ± 7% perf-profile.self.cycles-pp.irq_enter_rcu
0.09 ± 27% +0.1 0.17 ± 15% perf-profile.self.cycles-pp.acpi_idle_enter
0.01 ±223% +0.1 0.09 ± 36% perf-profile.self.cycles-pp.tick_program_event
0.00 +0.1 0.08 ± 14% perf-profile.self.cycles-pp.rcu_eqs_enter
0.68 ± 7% +0.1 0.78 ± 6% perf-profile.self.cycles-pp.native_sched_clock
0.38 ± 9% +0.1 0.53 ± 13% perf-profile.self.cycles-pp.sysvec_apic_timer_interrupt
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://01.org/lkp
View attachment "config-5.19.0-rc1-00006-g164f4064ca81" of type "text/plain" (170371 bytes)
View attachment "job-script" of type "text/plain" (8302 bytes)
View attachment "job.yaml" of type "text/plain" (5660 bytes)
View attachment "reproduce" of type "text/plain" (290 bytes)
Powered by blists - more mailing lists