[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20161127182153.GE2501@yexl-desktop>
Date: Mon, 28 Nov 2016 02:21:53 +0800
From: kernel test robot <xiaolong.ye@...el.com>
To: Aaron Lu <aaron.lu@...el.com>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>,
Dave Hansen <dave.hansen@...el.com>,
LKML <linux-kernel@...r.kernel.org>, lkp@...org
Subject: [lkp] [mremap] 5d1904204c: will-it-scale.per_thread_ops -13.1%
regression
Greeting,
FYI, we noticed a -13.1% regression of will-it-scale.per_thread_ops due to commit:
commit 5d1904204c99596b50a700f092fe49d78edba400 ("mremap: fix race between mremap() and page cleanning")
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git master
in testcase: will-it-scale
on test machine: 12 threads Intel(R) Core(TM) i7 CPU X 980 @ 3.33GHz with 6G memory
with following parameters:
test: eventfd1
cpufreq_governor: performance
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/tbox_group/test/testcase:
gcc-6/performance/x86_64-rhel-7.2/debian-x86_64-2016-08-31.cgz/wsm/eventfd1/will-it-scale
commit:
961b708e95 (" fixes for amdgpu, and a bunch of arm drivers.")
5d1904204c ("mremap: fix race between mremap() and page cleanning")
961b708e95181041 5d1904204c99596b50a700f092
---------------- --------------------------
fail:runs %reproduction fail:runs
| | |
%stddev %change %stddev
\ | \
2459656 ± 0% -13.1% 2137017 ± 1% will-it-scale.per_thread_ops
2865527 ± 3% +4.2% 2986100 ± 0% will-it-scale.per_process_ops
0.62 ± 11% -13.2% 0.54 ± 1% will-it-scale.scalability
893.40 ± 0% +1.3% 905.24 ± 0% will-it-scale.time.system_time
169.92 ± 0% -7.0% 158.09 ± 0% will-it-scale.time.user_time
176943 ± 6% +26.1% 223131 ± 11% cpuidle.C1E-NHM.time
10.00 ± 6% -10.9% 8.91 ± 4% turbostat.CPU%c6
30508 ± 1% +3.4% 31541 ± 0% vmstat.system.cs
27239 ± 0% +1.5% 27650 ± 0% vmstat.system.in
2.03 ± 2% -11.6% 1.80 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64
4.11 ± 1% -12.0% 3.61 ± 4% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_swapgs
1.70 ± 3% -13.8% 1.46 ± 5% perf-profile.children.cycles-pp.__fget_light
2.03 ± 2% -11.6% 1.80 ± 6% perf-profile.children.cycles-pp.entry_SYSCALL_64
4.11 ± 1% -12.0% 3.61 ± 4% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_swapgs
12.79 ± 1% -10.0% 11.50 ± 6% perf-profile.children.cycles-pp.selinux_file_permission
1.70 ± 3% -13.8% 1.46 ± 5% perf-profile.self.cycles-pp.__fget_light
2.03 ± 2% -11.6% 1.80 ± 6% perf-profile.self.cycles-pp.entry_SYSCALL_64
4.11 ± 1% -12.0% 3.61 ± 4% perf-profile.self.cycles-pp.entry_SYSCALL_64_after_swapgs
5.85 ± 2% -12.5% 5.12 ± 5% perf-profile.self.cycles-pp.selinux_file_permission
1.472e+12 ± 0% -5.5% 1.392e+12 ± 0% perf-stat.branch-instructions
0.89 ± 0% -6.0% 0.83 ± 0% perf-stat.branch-miss-rate%
1.303e+10 ± 0% -11.1% 1.158e+10 ± 0% perf-stat.branch-misses
5.534e+08 ± 4% -6.9% 5.151e+08 ± 1% perf-stat.cache-references
9347877 ± 1% +3.4% 9663609 ± 0% perf-stat.context-switches
2.298e+12 ± 0% -5.6% 2.168e+12 ± 0% perf-stat.dTLB-loads
1.525e+12 ± 1% -5.4% 1.442e+12 ± 0% perf-stat.dTLB-stores
7.795e+12 ± 0% -5.5% 7.363e+12 ± 0% perf-stat.iTLB-loads
6.694e+12 ± 1% -4.5% 6.391e+12 ± 2% perf-stat.instructions
0.93 ± 0% -5.5% 0.88 ± 0% perf-stat.ipc
119024 ± 5% -11.3% 105523 ± 8% sched_debug.cfs_rq:/.exec_clock.max
5933459 ± 19% +24.5% 7385120 ± 3% sched_debug.cpu.nr_switches.max
1684848 ± 15% +20.6% 2032107 ± 3% sched_debug.cpu.nr_switches.stddev
5929704 ± 19% +24.5% 7382036 ± 3% sched_debug.cpu.sched_count.max
1684318 ± 15% +20.6% 2031701 ± 3% sched_debug.cpu.sched_count.stddev
2826278 ± 18% +30.4% 3684493 ± 3% sched_debug.cpu.sched_goidle.max
804195 ± 14% +26.2% 1014783 ± 3% sched_debug.cpu.sched_goidle.stddev
2969365 ± 19% +24.3% 3692180 ± 3% sched_debug.cpu.ttwu_count.max
843614 ± 15% +20.5% 1016263 ± 3% sched_debug.cpu.ttwu_count.stddev
2963657 ± 19% +24.4% 3687897 ± 3% sched_debug.cpu.ttwu_local.max
843104 ± 15% +20.5% 1016333 ± 3% sched_debug.cpu.ttwu_local.stddev
will-it-scale.time.user_time
172 ++--------------------*--------*---*----------------------------------+
170 ++..*....*...*....*. *.. . ..*.... ..*...*.... |
*. *....*. *. *...*
168 ++ |
166 ++ |
| |
164 ++ |
162 ++ |
160 ++ |
| O O |
158 ++ O O O |
156 ++ O |
O O |
154 ++ O O |
152 ++--O-----------------------------------------------------------------+
will-it-scale.time.system_time
912 ++--------------------------------------------------------------------+
910 ++ O O |
O O O |
908 ++ O |
906 ++ O O O |
904 ++ O O |
902 ++ |
| |
900 ++ |
898 ++ |
896 ++ |
894 ++ ..*
*...*.... ..*.... ..*....*...*....*...*...*....*. |
892 ++ *. *...*...*....*...*. |
890 ++--------------------------------------------------------------------+
will-it-scale.per_thread_ops
2.55e+06 ++---------------------------------------------------------------+
2.5e+06 ++ .*...*.. .*...*... ..*. .*.. |
|.. . .. *...*....*. .. .*... .. . |
2.45e+06 *+ * .. * *...*...*
2.4e+06 ++ * |
2.35e+06 ++ |
2.3e+06 ++ |
| |
2.25e+06 ++ |
2.2e+06 ++ O |
2.15e+06 ++ O O O O |
2.1e+06 ++ O O |
O O O |
2.05e+06 ++ O |
2e+06 ++---------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Xiaolong
View attachment "config-4.9.0-rc5-00214-g5d19042" of type "text/plain" (153615 bytes)
View attachment "job-script" of type "text/plain" (6498 bytes)
View attachment "job.yaml" of type "text/plain" (4159 bytes)
View attachment "reproduce" of type "text/plain" (142 bytes)
Powered by blists - more mailing lists