[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20170206030246.GC18735@yexl-desktop>
Date: Mon, 6 Feb 2017 11:02:46 +0800
From: kernel test robot <xiaolong.ye@...el.com>
To: Jaegeuk Kim <jaegeuk@...nel.org>
Cc: Kinglong Mee <kinglongmee@...il.com>, Chao Yu <yuchao0@...wei.com>,
LKML <linux-kernel@...r.kernel.org>,
Stephen Rothwell <sfr@...b.auug.org.au>, lkp@...org
Subject: [f2fs] eea944687a: -8% regression of aim7.jobs-per-min
Greeting,
FYI, we noticed a -8% regression of aim7.jobs-per-min due to commit:
commit: eea944687acd9061a1e75e3dbe6823dfd2c4034b ("f2fs: drop exist_data for inline_data when truncated to 0")
https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master
in testcase: aim7
on test machine: 40 threads Intel(R) Xeon(R) CPU E5-2690 v2 @ 3.00GHz with 384G memory
with following parameters:
disk: 1BRD_48G
fs: f2fs
test: creat-clo
load: 1500
cpufreq_governor: performance
test-description: AIM7 is a traditional UNIX system level benchmark suite which is used to test and measure the performance of multiuser system.
test-url: https://sourceforge.net/projects/aimbench/files/aim-suite7/
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
testcase/path_params/tbox_group/run: aim7/1BRD_48G-f2fs-creat-clo-1500-performance/lkp-ivb-ep01
fb7d218589a77a55 eea944687acd9061a1e75e3dbe
---------------- --------------------------
%stddev change %stddev
\ | \
68975 ± 3% -8% 63748 aim7.jobs-per-min
225590 ± 4% 116% 488016 ± 6% aim7.time.involuntary_context_switches
4549 ± 3% 13% 5122 aim7.time.system_time
130 ± 3% 8% 141 aim7.time.elapsed_time
130 ± 3% 8% 141 aim7.time.elapsed_time.max
686619 ± 13% -36% 441007 ± 5% aim7.time.voluntary_context_switches
2610 7% 2780 turbostat.Avg_MHz
127 5% 133 turbostat.CorWatt
154 4% 161 turbostat.PkgWatt
87.32 4% 91.04 turbostat.%Busy
6237 ± 70% -6e+03 289 ±120% latency_stats.avg.build_free_nids.[f2fs].f2fs_balance_fs_bg.[f2fs].f2fs_balance_fs.[f2fs].f2fs_setattr.[f2fs].notify_change.do_truncate.path_openat.do_filp_open.do_sys_open.SyS_creat.entry_SYSCALL_64_fastpath
22719 ± 64% -2e+04 452 ±110% latency_stats.max.build_free_nids.[f2fs].f2fs_balance_fs_bg.[f2fs].f2fs_balance_fs.[f2fs].f2fs_setattr.[f2fs].notify_change.do_truncate.path_openat.do_filp_open.do_sys_open.SyS_creat.entry_SYSCALL_64_fastpath
246491 ± 71% -2e+05 36668 ±166% latency_stats.sum.build_free_nids.[f2fs].f2fs_balance_fs_bg.[f2fs].f2fs_balance_fs.[f2fs].f2fs_setattr.[f2fs].notify_change.do_truncate.path_openat.do_filp_open.do_sys_open.SyS_creat.entry_SYSCALL_64_fastpath
1.045e+08 ± 9% -8e+07 26256181 ± 15% latency_stats.sum.call_rwsem_down_write_failed.path_openat.do_filp_open.do_sys_open.SyS_creat.do_syscall_64.return_from_SYSCALL_64
0.19 ± 3% 26% 0.24 perf-stat.branch-miss-rate%
1.375e+13 ± 3% 15% 1.575e+13 perf-stat.cpu-cycles
1.187e+09 ± 3% 8% 1.284e+09 perf-stat.node-store-misses
2.348e+09 ± 3% 7% 2.517e+09 perf-stat.branch-misses
516928 ± 7% 7% 553150 perf-stat.minor-faults
516930 ± 7% 7% 553090 perf-stat.page-faults
3.131e+11 4% 3.254e+11 perf-stat.dTLB-stores
42.60 3% 43.92 perf-stat.node-store-miss-rate%
45.70 44.95 perf-stat.node-load-miss-rate%
26.01 -5% 24.70 perf-stat.cache-miss-rate%
1.213e+12 ± 3% -15% 1.029e+12 perf-stat.branch-instructions
5.911e+12 ± 3% -21% 4.66e+12 perf-stat.instructions
1.589e+12 ± 3% -22% 1.236e+12 perf-stat.dTLB-loads
182459 ± 6% -28% 130643 ± 12% perf-stat.instructions-per-iTLB-miss
0.43 -31% 0.30 ± 3% perf-stat.ipc
perf-stat.cpu-cycles
1.65e+13 ++---------------------------------------------------------------+
| |
1.6e+13 O+ O O O O O O
1.55e+13 ++ O O O O O O O O O O O O O O O O O O O O |
| |
1.5e+13 ++ |
| |
1.45e+13 ++ .* |
| .*. *. : *..*. .*. *.. |
1.4e+13 ++ *.* *. * : : : *. *..* *.* : *.*..* |
1.35e+13 ++: : : + : : : : + : : :|
| : : : + : : : : + : : :|
1.3e+13 ++ : : * *..* * :: *
* * * |
1.25e+13 ++---------------------------------------------------------------+
perf-stat.node-store-miss-rate_
44.4 ++-----------------O-------------------------------------------------+
44.2 O+ O O |
| O O O O
44 ++ O O O O O O O O O O O O O O |
43.8 ++ O O O O O |
| |
43.6 ++ |
43.4 ++ |
43.2 ++ |
| *. |
43 ++ .*.*.. .. *.. |
42.8 ++ .*.. *..*. * *.. .* |
*..* + .*.. .*.. *..*. + .*..|
42.6 ++ *..* * * *.. + *..*..* *
42.4 ++----------------------------------------------*--------------------+
perf-stat.ipc
0.44 ++-----------*--*----------------------------*-----------------------+
| .*.*..*..* .*..*.. *..*.*..*.*. *.*..*..*.*..*..*.*..*
0.42 *+ * *. .. |
0.4 ++ * |
| |
0.38 ++ |
| |
0.36 ++ |
| |
0.34 ++ |
0.32 ++ |
| O O O O O O O O |
0.3 ++ O O O O O |
| O O O O O O O O O O O O
0.28 O+--------------------------------------------------------O----------+
aim7.jobs-per-min
74000 *+---------------------------*--------------------------------------+
|: * * *. : * * *
72000 ++ : :: : : :: : |
| : : : : : : : : : : : :|
| : : : : : : : : : : : :|
70000 ++ : : : : : : : : : : : : |
| * : : : : : : : : : : : |
68000 ++ +: *..*..* *..* *.. : *.* : .*.: |
| * *..*.*..*.* :.*. * |
66000 ++ * |
| |
| O O O O |
64000 ++ O O O O O O O O O O O O O O O O O |
O O O O O O
62000 ++------------------------------------------------------------------+
aim7.time.system_time
5300 ++-------------------------------------------------------------------+
5200 O+ O O |
| O O O O O O O O O O O O O O O O O
5100 ++ O O O O O O O |
5000 ++ |
4900 ++ |
4800 ++ |
| *..* *.. |
4700 ++ * .*..* : : : *.*..*.*..* *.. .*.* |
4600 ++ + : * : : : : : *..* : *. : |
4500 ++ * : : : : : : : : : : : |
4400 +++ : : :: : : : : : : :|
|+ : : * *.* : : :: :|
4300 *+ * * * *
4200 ++-------------------------------------------------------------------+
aim7.time.elapsed_time
145 ++--------------------------------------------------------------------+
O O O O O O
| O O O O O O O O O O O O O O O O O |
140 ++ O O O O |
| |
| *.. |
135 ++ * .*..*.*..*..* : *.. .* |
| + : *.*..* *.* *. : *..* : * : |
130 ++ * : : : : : : : : : : : |
| : : : : : : : : : : : :|
| : : : : : : : : : : : :|
125 ++ :: : : : : :: :: |
|: * * *..: * * *
* * |
120 ++--------------------------------------------------------------------+
aim7.time.elapsed_time.max
145 ++--------------------------------------------------------------------+
O O O O O O
| O O O O O O O O O O O O O O O O O |
140 ++ O O O O |
| |
| *.. |
135 ++ * .*..*.*..*..* : *.. .* |
| + : *.*..* *.* *. : *..* : * : |
130 ++ * : : : : : : : : : : : |
| : : : : : : : : : : : :|
| : : : : : : : : : : : :|
125 ++ :: : : : : :: :: |
|: * * *..: * * *
* * |
120 ++--------------------------------------------------------------------+
aim7.time.involuntary_context_switches
600000 ++-----------------------------------------------------------------+
O |
550000 ++ O |
500000 ++ O O O O O O O O O
| O O O O |
450000 ++ O O O O O O O O O O |
400000 ++ O O |
| |
350000 ++ |
300000 ++ |
| *.. *. .*.. |
250000 ++ + .*.*.. .. *..* *..*. .*. .*. .*.. .*. .*. |
200000 *+.* *.*. * *. *. *. *.*. *. *..*
| |
150000 ++-----------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Xiaolong
View attachment "config-4.10.0-rc2-00118-geea9446" of type "text/plain" (155593 bytes)
View attachment "job-script" of type "text/plain" (6710 bytes)
View attachment "job.yaml" of type "text/plain" (4368 bytes)
View attachment "reproduce" of type "text/plain" (529 bytes)
Powered by blists - more mailing lists