[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202211122213.adebf68c-yujie.liu@intel.com>
Date: Sat, 12 Nov 2022 23:01:43 +0800
From: kernel test robot <yujie.liu@...el.com>
To: Shakeel Butt <shakeelb@...gle.com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>,
Andrew Morton <akpm@...ux-foundation.org>,
Michal Koutný <mkoutny@...e.com>,
Johannes Weiner <hannes@...xchg.org>,
Michal Hocko <mhocko@...nel.org>,
Roman Gushchin <roman.gushchin@...ux.dev>,
Muchun Song <songmuchun@...edance.com>,
David Hildenbrand <david@...hat.com>,
Yosry Ahmed <yosryahmed@...gle.com>,
Greg Thelen <gthelen@...gle.com>,
<linux-kernel@...r.kernel.org>, <ying.huang@...el.com>,
<feng.tang@...el.com>, <zhengjun.xing@...ux.intel.com>,
<fengwei.yin@...el.com>
Subject: [linus:master] [memcg] dbb16df644: stress-ng.fanotify.ops_per_sec
6.9% improvement
Greeting,
FYI, we noticed a 6.9% improvement of stress-ng.fanotify.ops_per_sec due to commit:
commit: dbb16df6443c59e8a1ef21c2272fcf387d600ddf ("Revert "memcg: cleanup racy sum avoidance code"")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: stress-ng
on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz (Cascade Lake) with 512G memory
with following parameters:
nr_threads: 10%
disk: 1HDD
testtime: 60s
fs: ext4
class: os
test: fanotify
cpufreq_governor: performance
Details are as below:
=========================================================================================
class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
os/gcc-11/performance/1HDD/ext4/x86_64-rhel-8.3/10%/debian-11.1-x86_64-20220510.cgz/lkp-csl-2sp7/fanotify/stress-ng/60s
commit:
a5d2172180 ("mm/zsmalloc: do not attempt to free IS_ERR handle")
dbb16df644 ("Revert "memcg: cleanup racy sum avoidance code"")
a5d2172180e8f94a dbb16df6443c59e8a1ef21c2272
---------------- ---------------------------
%stddev %change %stddev
\ | \
74446473 ± 3% +7.0% 79659479 ± 2% stress-ng.fanotify.ops
1233866 ± 3% +6.9% 1318614 ± 2% stress-ng.fanotify.ops_per_sec
19099410 ± 2% +6.9% 20408525 stress-ng.time.file_system_outputs
12033 +7.3% 12907 stress-ng.time.involuntary_context_switches
1619 +4.2% 1687 stress-ng.time.percent_of_cpu_this_job_got
6716 ± 5% -12.6% 5871 ± 2% stress-ng.time.voluntary_context_switches
0.71 ± 29% -95.5% 0.03 ± 14% iostat.cpu.iowait
0.72 ± 29% -0.7 0.03 ± 16% mpstat.cpu.all.iowait%
4322 ± 16% -21.1% 3409 ± 3% meminfo.Active(file)
3447 ± 21% -25.7% 2560 ± 4% meminfo.Buffers
2278 ± 21% -71.3% 653.67 ± 62% numa-vmstat.node0.nr_written
2667 ± 31% -81.1% 503.67 ± 74% numa-vmstat.node1.nr_written
3464 ± 20% -26.0% 2563 ± 4% vmstat.memory.buff
3559 -4.0% 3417 vmstat.system.cs
0.37 ± 8% +0.1 0.50 ± 15% perf-profile.children.cycles-pp.fanotify_should_merge
1.08 ± 9% +0.2 1.28 ± 10% perf-profile.children.cycles-pp.fanotify_merge
0.08 ± 88% +0.5 0.56 ± 42% perf-profile.children.cycles-pp.start_kernel
0.08 ± 88% +0.5 0.56 ± 42% perf-profile.children.cycles-pp.arch_call_rest_init
0.08 ± 88% +0.5 0.56 ± 42% perf-profile.children.cycles-pp.rest_init
0.36 ± 8% +0.1 0.49 ± 15% perf-profile.self.cycles-pp.fanotify_should_merge
1080 ± 16% -21.2% 851.50 ± 3% proc-vmstat.nr_active_file
2388563 ± 2% +6.8% 2551331 proc-vmstat.nr_dirtied
4950 ± 26% -76.6% 1157 ± 67% proc-vmstat.nr_written
1080 ± 16% -21.2% 851.50 ± 3% proc-vmstat.nr_zone_active_file
6102953 +3.4% 6309059 proc-vmstat.numa_hit
6016520 +3.4% 6222216 proc-vmstat.numa_local
6102815 +3.4% 6308704 proc-vmstat.pgalloc_normal
5988882 +3.4% 6192473 proc-vmstat.pgfree
6.114e+09 +3.2% 6.307e+09 perf-stat.i.branch-instructions
3040 -4.8% 2895 perf-stat.i.context-switches
4.852e+10 +3.6% 5.027e+10 perf-stat.i.cpu-cycles
4.547e+09 +2.7% 4.668e+09 perf-stat.i.dTLB-stores
2.965e+10 +3.1% 3.057e+10 perf-stat.i.instructions
2151 +4.2% 2243 perf-stat.i.instructions-per-iTLB-miss
0.51 +3.6% 0.52 perf-stat.i.metric.GHz
653.24 +3.6% 676.88 perf-stat.i.metric.K/sec
198.34 +3.0% 204.22 perf-stat.i.metric.M/sec
36433263 +5.1% 38301409 perf-stat.i.node-load-misses
2010 +5.0% 2111 perf-stat.overall.instructions-per-iTLB-miss
6.021e+09 +3.2% 6.211e+09 perf-stat.ps.branch-instructions
2986 -4.6% 2847 perf-stat.ps.context-switches
4.779e+10 +3.6% 4.951e+10 perf-stat.ps.cpu-cycles
4.479e+09 +2.7% 4.598e+09 perf-stat.ps.dTLB-stores
2.92e+10 +3.1% 3.011e+10 perf-stat.ps.instructions
35888487 +5.1% 37728073 perf-stat.ps.node-load-misses
2922671 +3.8% 3032931 ± 2% perf-stat.ps.node-loads
1.873e+12 +3.4% 1.937e+12 perf-stat.total.instructions
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://01.org/lkp
View attachment "config-6.0.0-rc2-00171-gdbb16df6443c" of type "text/plain" (164374 bytes)
View attachment "job-script" of type "text/plain" (8530 bytes)
View attachment "job.yaml" of type "text/plain" (5477 bytes)
View attachment "reproduce" of type "text/plain" (536 bytes)
Powered by blists - more mailing lists