[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20161227171240.GB10884@yexl-desktop>
Date: Wed, 28 Dec 2016 01:12:40 +0800
From: kernel test robot <xiaolong.ye@...el.com>
To: Thomas Gleixner <tglx@...utronix.de>
Cc: Peter Zijlstra <peterz@...radead.org>,
Sebastian Siewior <bigeasy@...utronix.de>,
LKML <linux-kernel@...r.kernel.org>,
Linus Torvalds <torvalds@...ux-foundation.org>, lkp@...org
Subject: [lkp-developer] [cpu/hotplug] dc280d9362: aim9.add_long.ops_per_sec
-6.8% regression
Greeting,
FYI, we noticed a -6.8% regression of aim9.add_long.ops_per_sec due to commit:
commit: dc280d93623927570da279e99393879dbbab39e7 ("cpu/hotplug: Prevent overwriting of callbacks")
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git master
in testcase: aim9
on test machine: 80 threads Intel(R) Xeon(R) CPU E7- 8870 @ 2.40GHz with 128G memory
with following parameters:
testtime: 300s
test: add_long
cpufreq_governor: performance
test-description: Suite IX is the "AIM Independent Resource Benchmark:" the famous synthetic benchmark.
test-url: https://sourceforge.net/projects/aimbench/files/aim-suite9/
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
testcase/path_params/tbox_group/run: aim9/300s-add_long-performance/lkp-wsx02
59fefd0890f12716 dc280d93623927570da279e993
---------------- --------------------------
%stddev change %stddev
\ | \
6372000 -7% 5939700 aim9.add_long.ops_per_sec
302467 310046 interrupts.CAL:Function_call_interrupts
81197 82290 vmstat.system.in
252 -100% 0 turbostat.Avg_MHz
12.01 -100% 0.00 turbostat.%Busy
0 2e+04 16338 ± 58% latency_stats.avg.perf_event_alloc.SYSC_perf_event_open.SyS_perf_event_open.entry_SYSCALL_64_fastpath
0 2e+04 16338 ± 58% latency_stats.max.perf_event_alloc.SYSC_perf_event_open.SyS_perf_event_open.entry_SYSCALL_64_fastpath
0 2e+04 16338 ± 58% latency_stats.sum.perf_event_alloc.SYSC_perf_event_open.SyS_perf_event_open.entry_SYSCALL_64_fastpath
4622 ±104% -3e+03 1198 latency_stats.sum.kthread_park.watchdog_park_threads.watchdog_enable_all_cpus.proc_watchdog_update.proc_watchdog_common.proc_nmi_watchdog.proc_sys_call_handler.proc_sys_write.__vfs_write.vfs_write.SyS_write.entry_SYSCALL_64_fastpath
1.04 ± 3% 1953% 21.38 perf-stat.cache-miss-rate%
18504 29% 23868 ± 8% perf-stat.cpu-migrations
1889012 ± 14% 121% 4170802 perf-stat.node-load-misses
20313444 ± 3% 83% 37191142 perf-stat.node-stores
90065697 82% 1.641e+08 perf-stat.cache-misses
0.41 ± 7% 111% 0.87 ± 8% perf-stat.ipc
6119350 ± 7% 158% 15783978 perf-stat.node-store-misses
33409559 56% 52103995 perf-stat.node-loads
4.596e+08 -86% 65414036 ± 3% perf-stat.dTLB-load-misses
2.003e+10 -14% 1.728e+10 perf-stat.dTLB-stores
2.335e+11 ± 4% -77% 5.439e+10 perf-stat.dTLB-loads
2.36e+12 -35% 1.545e+12 perf-stat.iTLB-loads
2.328e+12 ± 9% -40% 1.406e+12 ± 17% perf-stat.instructions
3.84e+11 -49% 1.946e+11 perf-stat.branch-instructions
5.34 ± 13% 39% 7.41 perf-stat.node-load-miss-rate%
1.929e+08 -51% 94975374 perf-stat.iTLB-load-misses
1.698e+09 -81% 3.201e+08 perf-stat.branch-misses
23.14 ± 5% 29% 29.79 perf-stat.node-store-miss-rate%
0.01 -25% 0.01 perf-stat.iTLB-load-miss-rate%
8.652e+09 -91% 7.674e+08 perf-stat.cache-references
0.44 -63% 0.16 perf-stat.branch-miss-rate%
5.607e+12 -72% 1.596e+12 ± 8% perf-stat.cpu-cycles
3.087e+08 -96% 11205248 ± 4% perf-stat.dTLB-store-misses
0.20 ± 5% -39% 0.12 ± 3% perf-stat.dTLB-load-miss-rate%
1.52 -96% 0.06 ± 3% perf-stat.dTLB-store-miss-rate%
perf-stat.cpu-cycles
7e+12 ++------------------------------------------------------------------+
| |
6e+12 ++* .**. *. .**. *. .**. .* *. |
* * *. *. .* .**.*.* .**.* * * * * : *. .**. : **.*
5e+12 ++ * * * *.* * : * *.: |
| : : * |
4e+12 ++ : : |
| : : |
3e+12 ++ : : |
| :: |
2e+12 ++ :: |
O OO OO O OO O OO O OO O :: |
1e+12 ++ OO : |
| : |
0 ++--------------------------------------------------*---------------+
perf-stat.cache-references
1e+10 ++------------------------------------------------------------------+
9e+09 ++ .* .**. *. *.*. .* *.*.* .*.* |
*. *. *.* *.* * *.* **.* .*. *.* .*. *.* * : * *.**.*
8e+09 ++* * * * * * : : |
7e+09 ++ : : |
| : : |
6e+09 ++ : : |
5e+09 ++ : : |
4e+09 ++ :: |
| :: |
3e+09 ++ :: |
2e+09 ++ : |
| : |
1e+09 O+OO OO O OO O OO OO O OO O : |
0 ++--------------------------------------------------*---------------+
perf-stat.branch-instructions
4.5e+11 ++----------------------------------------------------------------+
|.* .* *. .* *. *. |
4e+11 *+ * *.* *. *.*. .* * *.* * *. *. .*. *.**.*
3.5e+11 ++ *.*.**.* * **.** * * ** * : |
| : : *.* |
3e+11 ++ : : |
2.5e+11 ++ : : |
| : : |
2e+11 O+OO OO OO O OO O O O OO : : |
1.5e+11 ++ O O :: |
| :: |
1e+11 ++ :: |
5e+10 ++ : |
| : |
0 ++------------------------------------------------*---------------+
perf-stat.branch-misses
1.8e+09 ++------------------*-*-**----------------------------------------+
*.**.**.**.*.**.**.* **.**.*.**.**.**.*.**.* **.*.**.**.**.*
1.6e+09 ++ : : |
1.4e+09 ++ : : |
| : : |
1.2e+09 ++ : : |
1e+09 ++ : : |
| : : |
8e+08 ++ :: |
6e+08 ++ :: |
| :: |
4e+08 O+OO OO OO O OO OO O O : |
2e+08 ++ O O O : |
| : |
0 ++------------------------------------------------*---------------+
perf-stat.dTLB-loads
3e+11 ++----------------------------------------------------------------+
|.* * * * * * |
2.5e+11 *+ :+ : * + + :.* + :+ *. |
| * *.* .* + * * * *. *. *. : **.|
| *.*.* *.** + .** * * * + * : *
2e+11 ++ *.* ** : :* : : |
| : : *.* |
1.5e+11 ++ : : |
| : : |
1e+11 ++ :: |
| :: |
O OO OO OO O O O O OO :: |
5e+10 ++ O O : |
| O O : |
0 ++------------------------------------------------*---------------+
perf-stat.dTLB-load-misses
5e+08 ++---------*-*-----*---------------------------*----*-------------+
4.5e+08 *+**.**.** *. * *.*.**.* .**.*.**.**.**.*.* * :*.*.**.**.**.*
| * * : : |
4e+08 ++ : : |
3.5e+08 ++ : : |
| : : |
3e+08 ++ : : |
2.5e+08 ++ : : |
2e+08 ++ :: |
| :: |
1.5e+08 ++ :: |
1e+08 ++ O : |
O OO O OO O OO OO OO O OO : |
5e+07 ++ : |
0 ++------------------------------------------------*---------------+
perf-stat.dTLB-stores
2.5e+10 ++----------------------------------------------------------------+
| |
*.**.* .**.*.* .**.**. |
2e+10 ++ *.**.*.**.**.**.*.**.** * *.**.* **.*.**.**.**.*
O OO O OO O OO OO OO O OO : : |
| O : : |
1.5e+10 ++ : : |
| : : |
1e+10 ++ : : |
| :: |
| :: |
5e+09 ++ :: |
| : |
| : |
0 ++------------------------------------------------*---------------+
perf-stat.dTLB-store-misses
3.5e+08 ++----------------------------------------------------------------+
*. *. .* .* .* *.* .*. |
3e+08 ++**.* **.* *.** *.*.**.** *.*.* *.** **.* **.*.**.**.**.*
| : : |
2.5e+08 ++ : : |
| : : |
2e+08 ++ : : |
| : : |
1.5e+08 ++ :: |
| :: |
1e+08 ++ :: |
| :: |
5e+07 ++ : |
O O O O O : |
0 ++-O--O-OO---O--OO-OO-O-OO------------------------*---------------+
perf-stat.iTLB-loads
3e+12 ++----------------------------------------------------------------+
| |
2.5e+12 ++* .* *. .* *. *. |
* * *.* .*.* *.**.*. .**.* * *.* * *.**.* * .*. *.**.*
| * *.* ** : :* **.* |
2e+12 ++ : : |
| : : |
1.5e+12 O+OO OO OO O OO OO OO O OO : : |
| : : |
1e+12 ++ :: |
| :: |
| :: |
5e+11 ++ : |
| : |
0 ++------------------------------------------------*---------------+
perf-stat.iTLB-load-misses
2.5e+08 ++----------------------------------------------------------------+
| |
*.**.**. .**.**.**.*.**.**.**.*. * |
2e+08 ++ **. .**.* * + .*.**.**.**.*
| *.**.** * ** |
| : : |
1.5e+08 ++ : : |
| : : |
1e+08 O+ O OO O O : : |
| O OO O OO O O O OO :: |
| :: |
5e+07 ++ :: |
| : |
| : |
0 ++------------------------------------------------*---------------+
aim9.add_long.ops_per_sec
7e+06 ++------------------------------------------------------------------+
*.**.**.*.**.*.**.**.*.**.*.**.**.*.**.**.*.**.*.** *.*.**.*.**.**.*
6e+06 O+OO OO O OO O OO OO O OO O : : |
| : : |
5e+06 ++ : : |
| : : |
4e+06 ++ : : |
| : : |
3e+06 ++ :: |
| :: |
2e+06 ++ :: |
| : |
1e+06 ++ : |
| : |
0 ++--------------------------------------------------*---------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Xiaolong
View attachment "config-4.9.0-12224-gdc280d93" of type "text/plain" (155645 bytes)
View attachment "job-script" of type "text/plain" (6465 bytes)
View attachment "job.yaml" of type "text/plain" (4097 bytes)
View attachment "reproduce" of type "text/plain" (103 bytes)
Powered by blists - more mailing lists