[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202210141305.f6f7bdb9-oliver.sang@intel.com>
Date: Fri, 14 Oct 2022 13:39:59 +0800
From: kernel test robot <oliver.sang@...el.com>
To: "Darrick J. Wong" <djwong@...nel.org>
CC: <lkp@...ts.01.org>, <lkp@...el.com>,
Dave Chinner <dchinner@...hat.com>,
<linux-kernel@...r.kernel.org>, <linux-xfs@...r.kernel.org>,
<ying.huang@...el.com>, <feng.tang@...el.com>,
<zhengjun.xing@...ux.intel.com>, <fengwei.yin@...el.com>
Subject: [xfs] 2ed5b09b3e: aim7.jobs-per-min 13.9% improvement
Greeting,
FYI, we noticed a 13.9% improvement of aim7.jobs-per-min due to commit:
commit: 2ed5b09b3e8fc274ae8fecd6ab7c5106a364bed1 ("xfs: make inode attribute forks a permanent part of struct xfs_inode")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: aim7
on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory
with following parameters:
disk: 1BRD_48G
fs: xfs
test: disk_rd
load: 9000
cpufreq_governor: performance
test-description: AIM7 is a traditional UNIX system level benchmark suite which is used to test and measure the performance of multiuser system.
test-url: https://sourceforge.net/projects/aimbench/files/aim-suite7/
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/load/rootfs/tbox_group/test/testcase:
gcc-11/performance/1BRD_48G/xfs/x86_64-rhel-8.3/9000/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp2/disk_rd/aim7
commit:
732436ef91 ("xfs: convert XFS_IFORK_PTR to a static inline helper")
2ed5b09b3e ("xfs: make inode attribute forks a permanent part of struct xfs_inode")
732436ef916b4f33 2ed5b09b3e8fc274ae8fecd6ab7
---------------- ---------------------------
%stddev %change %stddev
\ | \
518200 +13.9% 590052 ± 4% aim7.jobs-per-min
104.74 -12.0% 92.22 ± 4% aim7.time.elapsed_time
104.74 -12.0% 92.22 ± 4% aim7.time.elapsed_time.max
1268209 ± 3% -21.5% 995207 ± 7% aim7.time.involuntary_context_switches
11845 -13.5% 10249 ± 5% aim7.time.system_time
96456 ± 2% -6.4% 90310 ± 5% aim7.time.voluntary_context_switches
128756 ± 4% -12.4% 112833 ± 5% meminfo.AnonHugePages
12.03 ± 3% +2.4 14.42 ± 2% mpstat.cpu.all.idle%
15141 ± 2% -8.0% 13934 ± 2% vmstat.system.cs
13.59 ± 2% +18.4% 16.08 ± 2% iostat.cpu.idle
85.83 -3.0% 83.26 iostat.cpu.system
0.94 ± 12% -20.7% 0.74 ± 12% sched_debug.cfs_rq:/.h_nr_running.stddev
1714 ± 11% -21.0% 1354 ± 12% sched_debug.cfs_rq:/.runnable_avg.avg
4309 ± 11% -22.9% 3321 ± 13% sched_debug.cfs_rq:/.runnable_avg.max
0.93 ± 12% -20.7% 0.74 ± 12% sched_debug.cpu.nr_running.stddev
1052162 ± 3% -8.0% 967689 ± 2% proc-vmstat.numa_hit
935991 ± 3% -9.0% 852146 ± 2% proc-vmstat.numa_local
83514 ± 22% -38.3% 51505 ± 34% proc-vmstat.numa_pte_updates
1063781 ± 3% -9.1% 967461 ± 2% proc-vmstat.pgalloc_normal
979883 ± 3% -9.4% 887981 ± 2% proc-vmstat.pgfault
814030 -6.5% 760890 ± 2% proc-vmstat.pgfree
15209 ± 2% -9.2% 13811 ± 2% perf-stat.i.context-switches
16.08 -14.6% 13.72 ± 5% perf-stat.i.cpi
2.882e+11 -4.3% 2.758e+11 perf-stat.i.cpu-cycles
0.11 ± 5% +16.1% 0.12 ± 9% perf-stat.i.ipc
2.24 -4.4% 2.15 ± 2% perf-stat.i.metric.GHz
17.40 -12.3% 15.26 ± 5% perf-stat.overall.cpi
0.00 ± 2% -0.0 0.00 ± 4% perf-stat.overall.dTLB-store-miss-rate%
0.06 +14.3% 0.07 ± 4% perf-stat.overall.ipc
15213 ± 2% -8.4% 13934 ± 2% perf-stat.ps.context-switches
2.92e+11 -2.5% 2.846e+11 perf-stat.ps.cpu-cycles
26.57 ± 4% -26.5 0.09 ±223% perf-profile.calltrace.cycles-pp.touch_atime.filemap_read.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read
26.31 ± 4% -26.2 0.09 ±223% perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.filemap_read.xfs_file_buffered_read.xfs_file_read_iter
44.22 -24.2 20.01 ± 4% perf-profile.calltrace.cycles-pp.filemap_read.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read.vfs_read
6.70 ± 5% -2.8 3.85 ± 11% perf-profile.calltrace.cycles-pp.filemap_get_read_batch.filemap_get_pages.filemap_read.xfs_file_buffered_read.xfs_file_read_iter
98.12 -0.5 97.64 perf-profile.calltrace.cycles-pp.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read.vfs_read.ksys_read
98.20 -0.5 97.74 perf-profile.calltrace.cycles-pp.xfs_file_read_iter.new_sync_read.vfs_read.ksys_read.do_syscall_64
98.34 -0.4 97.92 perf-profile.calltrace.cycles-pp.new_sync_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
98.68 -0.4 98.32 perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
98.83 -0.3 98.49 perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
98.99 -0.3 98.68 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
99.09 -0.3 98.81 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
99.40 -0.2 99.19 perf-profile.calltrace.cycles-pp.read
34.96 +9.8 44.78 perf-profile.calltrace.cycles-pp.down_read.xfs_ilock.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read
35.32 +10.0 45.32 perf-profile.calltrace.cycles-pp.xfs_ilock.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read.vfs_read
18.23 +13.3 31.54 ± 4% perf-profile.calltrace.cycles-pp.up_read.xfs_iunlock.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read
18.45 +13.6 32.07 ± 4% perf-profile.calltrace.cycles-pp.xfs_iunlock.xfs_file_buffered_read.xfs_file_read_iter.new_sync_read.vfs_read
26.54 ± 4% -26.2 0.32 ± 30% perf-profile.children.cycles-pp.atime_needs_update
26.58 ± 4% -26.2 0.36 ± 26% perf-profile.children.cycles-pp.touch_atime
44.28 -24.2 20.09 ± 4% perf-profile.children.cycles-pp.filemap_read
6.74 ± 5% -2.9 3.88 ± 10% perf-profile.children.cycles-pp.filemap_get_read_batch
98.15 -0.5 97.66 perf-profile.children.cycles-pp.xfs_file_buffered_read
98.21 -0.5 97.76 perf-profile.children.cycles-pp.xfs_file_read_iter
98.36 -0.4 97.94 perf-profile.children.cycles-pp.new_sync_read
98.70 -0.4 98.34 perf-profile.children.cycles-pp.vfs_read
98.84 -0.3 98.51 perf-profile.children.cycles-pp.ksys_read
99.56 -0.2 99.37 perf-profile.children.cycles-pp.read
99.37 -0.1 99.24 perf-profile.children.cycles-pp.do_syscall_64
99.43 -0.1 99.31 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
0.05 +0.0 0.06 ± 7% perf-profile.children.cycles-pp.do_filp_open
0.05 +0.0 0.06 ± 7% perf-profile.children.cycles-pp.path_openat
0.06 ± 6% +0.0 0.07 ± 5% perf-profile.children.cycles-pp.exit_to_user_mode_loop
0.05 +0.0 0.06 ± 7% perf-profile.children.cycles-pp.times
0.08 ± 5% +0.0 0.10 ± 9% perf-profile.children.cycles-pp.__might_sleep
0.06 ± 9% +0.0 0.07 ± 5% perf-profile.children.cycles-pp.__close
0.06 +0.0 0.08 ± 6% perf-profile.children.cycles-pp.open64
0.10 ± 5% +0.0 0.11 ± 6% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.06 ± 6% +0.0 0.08 perf-profile.children.cycles-pp.__x64_sys_openat
0.06 ± 6% +0.0 0.08 perf-profile.children.cycles-pp.do_sys_openat2
0.16 ± 2% +0.0 0.19 ± 6% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
0.05 ± 8% +0.0 0.08 ± 20% perf-profile.children.cycles-pp.generic_file_read_iter
0.14 ± 4% +0.0 0.17 ± 5% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.08 ± 8% +0.0 0.11 ± 6% perf-profile.children.cycles-pp.write
0.11 ± 5% +0.0 0.15 ± 16% perf-profile.children.cycles-pp.__fdget_pos
0.00 +0.1 0.05 perf-profile.children.cycles-pp._raw_spin_lock
0.11 ± 7% +0.1 0.16 ± 10% perf-profile.children.cycles-pp.xas_load
0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.thread_group_cputime_adjusted
0.00 +0.1 0.06 ± 8% perf-profile.children.cycles-pp.do_sys_times
0.00 +0.1 0.06 ± 8% perf-profile.children.cycles-pp.task_work_run
0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.__x64_sys_times
0.00 +0.1 0.07 ± 16% perf-profile.children.cycles-pp.xas_start
0.14 ± 21% +0.1 0.23 ± 42% perf-profile.children.cycles-pp.ksys_write
35.29 +10.0 45.28 perf-profile.children.cycles-pp.down_read
35.33 +10.0 45.33 perf-profile.children.cycles-pp.xfs_ilock
18.40 ± 2% +13.5 31.89 ± 4% perf-profile.children.cycles-pp.up_read
18.46 +13.6 32.10 ± 4% perf-profile.children.cycles-pp.xfs_iunlock
26.37 ± 4% -26.1 0.22 ± 42% perf-profile.self.cycles-pp.atime_needs_update
6.60 ± 5% -2.9 3.70 ± 11% perf-profile.self.cycles-pp.filemap_get_read_batch
0.06 ± 7% +0.0 0.08 ± 7% perf-profile.self.cycles-pp.__cond_resched
0.07 ± 5% +0.0 0.09 ± 6% perf-profile.self.cycles-pp.xas_load
0.07 +0.0 0.10 ± 21% perf-profile.self.cycles-pp.xfs_file_read_iter
0.07 ± 7% +0.0 0.10 ± 10% perf-profile.self.cycles-pp.do_syscall_64
0.04 ± 44% +0.0 0.08 ± 9% perf-profile.self.cycles-pp.xfs_file_buffered_read
0.12 ± 7% +0.0 0.16 ± 4% perf-profile.self.cycles-pp.new_sync_read
0.02 ±141% +0.1 0.08 ± 20% perf-profile.self.cycles-pp.generic_file_read_iter
0.00 +0.1 0.07 ± 15% perf-profile.self.cycles-pp.xas_start
0.06 ± 8% +0.2 0.21 ± 12% perf-profile.self.cycles-pp.xfs_iunlock
3.85 ± 4% +3.2 7.09 ± 6% perf-profile.self.cycles-pp.filemap_read
35.06 +10.0 45.02 perf-profile.self.cycles-pp.down_read
18.33 ± 2% +13.4 31.76 ± 4% perf-profile.self.cycles-pp.up_read
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://01.org/lkp
View attachment "config-5.19.0-rc5-00034-g2ed5b09b3e8f" of type "text/plain" (163182 bytes)
View attachment "job-script" of type "text/plain" (8047 bytes)
View attachment "job.yaml" of type "text/plain" (5455 bytes)
View attachment "reproduce" of type "text/plain" (731 bytes)
Powered by blists - more mailing lists