[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20220527092432.GE11731@xsang-OptiPlex-9020>
Date: Fri, 27 May 2022 17:24:32 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Jens Axboe <axboe@...nel.dk>
Cc: LKML <linux-kernel@...r.kernel.org>, io-uring@...r.kernel.org,
lkp@...ts.01.org, lkp@...el.com, ying.huang@...el.com,
feng.tang@...el.com, zhengjun.xing@...ux.intel.com,
fengwei.yin@...el.com, guobing.chen@...el.com,
ming.a.chen@...el.com, frank.du@...el.com, Shuhua.Fan@...el.com,
wangyang.guo@...el.com, Wenhuan.Huang@...el.com,
jessica.ji@...el.com, shan.kang@...el.com, guangli.li@...el.com,
tiejun.li@...el.com, yu.ma@...el.com, dapeng1.mi@...el.com,
jiebin.sun@...el.com, gengxin.xie@...el.com, fan.zhao@...el.com
Subject: [io_uring] 584b0180f0:
phoronix-test-suite.fio.SequentialWrite.IO_uring.Yes.Yes.1MB.DefaultTestDirectory.mb_s
-10.2% regression
Greeting,
FYI, we noticed a -10.2% regression of phoronix-test-suite.fio.SequentialWrite.IO_uring.Yes.Yes.1MB.DefaultTestDirectory.mb_s due to commit:
commit: 584b0180f0f4d67d7145950fe68c625f06c88b10 ("io_uring: move read/write file prep state into actual opcode handler")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: phoronix-test-suite
on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory
with following parameters:
test: fio-1.14.1
option_a: Sequential Write
option_b: IO_uring
option_c: Yes
option_d: Yes
option_e: 1MB
option_f: Default Test Directory
cpufreq_governor: performance
ucode: 0x500320a
test-description: The Phoronix Test Suite is the most comprehensive testing and benchmarking platform available that provides an extensible framework for which new tests can be easily added.
test-url: http://www.phoronix-test-suite.com/
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang@...el.com>
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
compiler/cpufreq_governor/kconfig/option_a/option_b/option_c/option_d/option_e/option_f/rootfs/tbox_group/test/testcase/ucode:
gcc-11/performance/x86_64-rhel-8.3/Sequential Write/IO_uring/Yes/Yes/1MB/Default Test Directory/debian-x86_64-phoronix/lkp-csl-2sp7/fio-1.14.1/phoronix-test-suite/0x500320a
commit:
a3e4bc23d5 ("io_uring: defer splice/tee file validity check until command issue")
584b0180f0 ("io_uring: move read/write file prep state into actual opcode handler")
a3e4bc23d5470b2b 584b0180f0f4d67d7145950fe68
---------------- ---------------------------
%stddev %change %stddev
\ | \
1081 -10.2% 971.00 phoronix-test-suite.fio.SequentialWrite.IO_uring.Yes.Yes.1MB.DefaultTestDirectory.iops
1084 -10.2% 974.67 phoronix-test-suite.fio.SequentialWrite.IO_uring.Yes.Yes.1MB.DefaultTestDirectory.mb_s
118.42 +132.0% 274.70 ± 55% phoronix-test-suite.time.elapsed_time
118.42 +132.0% 274.70 ± 55% phoronix-test-suite.time.elapsed_time.max
1317 ± 19% +921.5% 13458 ± 53% phoronix-test-suite.time.involuntary_context_switches
185595 +23.8% 229715 ± 17% phoronix-test-suite.time.minor_page_faults
68.33 +2031.5% 1456 ± 3% phoronix-test-suite.time.percent_of_cpu_this_job_got
58.62 +6771.2% 4028 ± 58% phoronix-test-suite.time.system_time
244.97 +10.1% 269.72 pmeter.Average_Active_Power
1655992 ± 78% +1356.2% 24114501 ± 50% numa-numastat.node1.local_node
1662758 ± 78% +1374.2% 24512157 ± 50% numa-numastat.node1.numa_hit
958669 +10.8% 1062574 ± 7% meminfo.Active
843569 +12.0% 945049 ± 8% meminfo.Active(anon)
61229 ± 3% -60.2% 24352 ± 21% meminfo.Writeback
96.73 -14.0 82.71 mpstat.cpu.all.idle%
1.75 ± 15% -0.3 1.43 ± 16% mpstat.cpu.all.irq%
0.80 +14.4 15.19 ± 3% mpstat.cpu.all.sys%
0.33 -0.0 0.28 ± 11% mpstat.cpu.all.usr%
258678 ± 7% -37.0% 162850 ± 17% numa-meminfo.node0.Dirty
56023 ± 12% -85.5% 8096 ± 50% numa-meminfo.node0.Writeback
2978 ± 44% +350.6% 13421 ± 82% numa-meminfo.node1.Active(anon)
1.83 ± 73% +8.1e+06% 148245 ± 22% numa-meminfo.node1.Dirty
96.33 -14.4% 82.50 vmstat.cpu.id
746.67 ± 2% -40.8% 441.83 ± 53% vmstat.io.bi
1.00 +1350.0% 14.50 ± 15% vmstat.procs.r
187622 ± 3% +2.6% 192531 vmstat.system.in
79.33 ± 9% +490.3% 468.33 ± 3% turbostat.Avg_MHz
4.77 ± 15% +13.3 18.11 ± 5% turbostat.Busy%
1688 ± 7% +53.6% 2593 ± 2% turbostat.Bzy_MHz
13881106 ± 34% +181.8% 39121066 ± 64% turbostat.C1E
22867090 ± 3% +134.6% 53644530 ± 54% turbostat.IRQ
49.00 +5.4% 51.67 ± 2% turbostat.PkgTmp
120.78 +16.5% 140.68 turbostat.PkgWatt
64647 ± 7% -36.9% 40763 ± 17% numa-vmstat.node0.nr_dirty
13891 ± 12% -85.1% 2064 ± 50% numa-vmstat.node0.nr_writeback
78537 ± 6% -45.5% 42771 ± 17% numa-vmstat.node0.nr_zone_write_pending
744.50 ± 44% +350.7% 3355 ± 82% numa-vmstat.node1.nr_active_anon
6.00 ± 52% +3.6e+08% 21761586 ± 53% numa-vmstat.node1.nr_dirtied
0.00 +3.7e+106% 37065 ± 22% numa-vmstat.node1.nr_dirty
5.50 ± 48% +3.9e+08% 21697670 ± 53% numa-vmstat.node1.nr_written
744.50 ± 44% +350.7% 3355 ± 82% numa-vmstat.node1.nr_zone_active_anon
0.00 +4e+106% 40201 ± 22% numa-vmstat.node1.nr_zone_write_pending
1662586 ± 78% +1374.3% 24512265 ± 50% numa-vmstat.node1.numa_hit
1655820 ± 78% +1356.4% 24114609 ± 50% numa-vmstat.node1.numa_local
17009 ± 36% +3730.9% 651618 ± 64% sched_debug.cfs_rq:/.min_vruntime.avg
33445 ± 30% +2211.0% 772918 ± 59% sched_debug.cfs_rq:/.min_vruntime.max
11326 ± 45% +4758.2% 550285 ± 70% sched_debug.cfs_rq:/.min_vruntime.min
3907 ± 21% +2044.3% 83779 ± 38% sched_debug.cfs_rq:/.min_vruntime.stddev
91.33 ± 33% +55.1% 141.65 ± 37% sched_debug.cfs_rq:/.runnable_avg.avg
7037 ± 70% +823.4% 64981 ± 67% sched_debug.cfs_rq:/.spread0.max
-15244 +934.3% -157672 sched_debug.cfs_rq:/.spread0.min
3930 ± 21% +2031.5% 83782 ± 38% sched_debug.cfs_rq:/.spread0.stddev
90.43 ± 34% +53.1% 138.43 ± 37% sched_debug.cfs_rq:/.util_avg.avg
135.84 ± 21% +529.3% 854.87 ± 67% sched_debug.cpu.curr->pid.avg
1000 ± 10% +376.5% 4767 ± 59% sched_debug.cpu.nr_switches.min
210892 +12.0% 236262 ± 8% proc-vmstat.nr_active_anon
18617 +7.7% 20053 proc-vmstat.nr_kernel_stack
53538 +3.3% 55319 proc-vmstat.nr_slab_unreclaimable
15260 ± 4% -60.1% 6094 ± 21% proc-vmstat.nr_writeback
210892 +12.0% 236262 ± 8% proc-vmstat.nr_zone_active_anon
9868 ± 31% +227.2% 32291 ± 60% proc-vmstat.numa_hint_faults
9609 ± 28% +156.6% 24657 ± 62% proc-vmstat.numa_hint_faults_local
416.00 ± 8% +259.4% 1495 ± 54% proc-vmstat.numa_huge_pte_updates
259.00 ±156% +40441.1% 105001 ± 46% proc-vmstat.numa_pages_migrated
230799 ± 7% +252.9% 814389 ± 53% proc-vmstat.numa_pte_updates
292996 +7.3% 314293 proc-vmstat.pgactivate
867707 +73.7% 1507465 ± 39% proc-vmstat.pgfault
259.00 ±156% +40441.1% 105001 ± 46% proc-vmstat.pgmigrate_success
311.33 ± 7% +2155.2% 7021 ± 65% proc-vmstat.pgrotated
30.29 ± 42% -73.8% 7.93 ±110% perf-stat.i.MPKI
7.08e+08 +283.6% 2.716e+09 perf-stat.i.branch-instructions
2.91 ± 44% -1.9 1.04 ± 83% perf-stat.i.branch-miss-rate%
34699379 -10.6% 31027580 ± 3% perf-stat.i.cache-misses
6.802e+09 ± 10% +554.9% 4.455e+10 ± 3% perf-stat.i.cpu-cycles
33.73 ± 25% +1103.3% 405.85 ± 5% perf-stat.i.cpu-migrations
1102 ± 33% +132.4% 2562 ± 22% perf-stat.i.cycles-between-cache-misses
0.22 ± 50% -0.2 0.07 ±137% perf-stat.i.dTLB-load-miss-rate%
9.67e+08 ± 4% +270.6% 3.584e+09 ± 2% perf-stat.i.dTLB-loads
4.994e+08 ± 2% -6.6% 4.664e+08 ± 2% perf-stat.i.dTLB-stores
3.503e+09 +284.6% 1.347e+10 perf-stat.i.instructions
3126 ± 5% +327.7% 13371 ± 7% perf-stat.i.instructions-per-iTLB-miss
0.52 ± 9% -38.4% 0.32 ± 8% perf-stat.i.ipc
70851 ± 10% +554.6% 463814 ± 3% perf-stat.i.metric.GHz
23543856 +202.1% 71118618 perf-stat.i.metric.M/sec
24.45 ± 5% +17.6 42.05 ± 5% perf-stat.i.node-load-miss-rate%
154141 ± 4% +1027.3% 1737599 ± 11% perf-stat.i.node-load-misses
6780078 -41.7% 3950788 ± 5% perf-stat.i.node-loads
19.06 ± 16% +32.5 51.53 ± 7% perf-stat.i.node-store-miss-rate%
54242 ± 17% +4415.1% 2449115 ± 15% perf-stat.i.node-store-misses
5188725 -41.8% 3018300 ± 12% perf-stat.i.node-stores
20.79 ± 28% -81.7% 3.80 ± 31% perf-stat.overall.MPKI
2.06 ± 28% -1.8 0.31 ± 39% perf-stat.overall.branch-miss-rate%
1.94 ± 10% +70.2% 3.31 ± 2% perf-stat.overall.cpi
195.90 ± 8% +632.9% 1435 perf-stat.overall.cycles-between-cache-misses
0.10 ± 48% -0.1 0.01 ±116% perf-stat.overall.dTLB-load-miss-rate%
3154 ± 6% +306.3% 12813 ± 8% perf-stat.overall.instructions-per-iTLB-miss
0.52 ± 11% -41.9% 0.30 ± 2% perf-stat.overall.ipc
2.23 ± 4% +28.3 30.55 ± 10% perf-stat.overall.node-load-miss-rate%
1.04 ± 17% +43.8 44.81 ± 15% perf-stat.overall.node-store-miss-rate%
7.02e+08 +284.8% 2.702e+09 perf-stat.ps.branch-instructions
34381260 -10.2% 30861100 ± 3% perf-stat.ps.cache-misses
6.746e+09 ± 10% +556.9% 4.431e+10 ± 3% perf-stat.ps.cpu-cycles
33.43 ± 25% +1107.4% 403.66 ± 5% perf-stat.ps.cpu-migrations
9.587e+08 ± 4% +271.8% 3.565e+09 ± 2% perf-stat.ps.dTLB-loads
4.951e+08 ± 2% -6.3% 4.64e+08 ± 2% perf-stat.ps.dTLB-stores
3.473e+09 +285.9% 1.34e+10 perf-stat.ps.instructions
152903 ± 4% +1030.3% 1728344 ± 11% perf-stat.ps.node-load-misses
6717519 -41.5% 3929544 ± 5% perf-stat.ps.node-loads
53840 ± 17% +4424.7% 2436140 ± 15% perf-stat.ps.node-store-misses
5140848 -41.6% 3001971 ± 12% perf-stat.ps.node-stores
4.129e+11 +801.9% 3.724e+12 ± 56% perf-stat.total.instructions
61.17 ± 3% -49.8 11.37 ± 16% perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
60.62 ± 3% -49.4 11.26 ± 16% perf-profile.calltrace.cycles-pp.cpu_startup_entry.secondary_startup_64_no_verify
60.58 ± 3% -49.3 11.26 ± 16% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.secondary_startup_64_no_verify
59.76 ± 3% -48.6 11.19 ± 16% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.secondary_startup_64_no_verify
55.62 ± 2% -44.8 10.84 ± 17% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.secondary_startup_64_no_verify
55.13 ± 3% -44.4 10.73 ± 17% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
41.00 ± 4% -31.5 9.54 ± 20% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
40.57 ± 3% -31.0 9.53 ± 20% perf-profile.calltrace.cycles-pp.mwait_idle_with_hints.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call
13.53 ± 4% -12.4 1.10 ± 9% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
14.04 ± 21% -12.1 1.96 ± 4% perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.io_write.io_issue_sqe.io_wq_submit_work
13.80 ± 10% -12.1 1.73 ± 9% perf-profile.calltrace.cycles-pp.kthread.ret_from_fork
10.98 ± 5% -10.0 0.98 ± 9% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call
11.01 ± 9% -9.6 1.41 ± 11% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork
10.97 ± 9% -9.6 1.40 ± 11% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork
10.78 ± 9% -9.4 1.38 ± 11% perf-profile.calltrace.cycles-pp.loop_process_work.process_one_work.worker_thread.kthread.ret_from_fork
10.60 ± 10% -9.2 1.35 ± 11% perf-profile.calltrace.cycles-pp.lo_write_simple.loop_process_work.process_one_work.worker_thread.kthread
10.32 ± 9% -9.0 1.31 ± 12% perf-profile.calltrace.cycles-pp.do_iter_write.lo_write_simple.loop_process_work.process_one_work.worker_thread
10.00 ± 9% -8.7 1.28 ± 12% perf-profile.calltrace.cycles-pp.do_iter_readv_writev.do_iter_write.lo_write_simple.loop_process_work.process_one_work
9.93 ± 9% -8.7 1.28 ± 12% perf-profile.calltrace.cycles-pp.generic_file_write_iter.do_iter_readv_writev.do_iter_write.lo_write_simple.loop_process_work
9.59 ± 9% -8.4 1.24 ± 13% perf-profile.calltrace.cycles-pp.__generic_file_write_iter.generic_file_write_iter.do_iter_readv_writev.do_iter_write.lo_write_simple
9.41 ± 9% -8.2 1.22 ± 13% perf-profile.calltrace.cycles-pp.generic_perform_write.__generic_file_write_iter.generic_file_write_iter.do_iter_readv_writev.do_iter_write
9.02 ± 8% -8.1 0.95 ± 7% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
9.02 ± 8% -8.1 0.95 ± 7% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
8.21 ± 8% -7.3 0.88 ± 7% perf-profile.calltrace.cycles-pp.__x64_sys_fadvise64.do_syscall_64.entry_SYSCALL_64_after_hwframe
8.21 ± 8% -7.3 0.88 ± 7% perf-profile.calltrace.cycles-pp.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64.entry_SYSCALL_64_after_hwframe
8.21 ± 8% -7.3 0.88 ± 7% perf-profile.calltrace.cycles-pp.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64.entry_SYSCALL_64_after_hwframe
8.16 ± 9% -7.1 1.07 ± 14% perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.__generic_file_write_iter.generic_file_write_iter.do_iter_readv_writev
7.36 ± 6% -6.7 0.62 ± 9% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter
7.17 ± 7% -6.6 0.62 ± 9% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state
5.18 ± 21% -4.5 0.72 ± 6% perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.io_write.io_issue_sqe
5.08 ± 21% -4.4 0.70 ± 5% perf-profile.calltrace.cycles-pp.copyin.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.io_write
5.05 ± 21% -4.4 0.70 ± 6% perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string.copyin.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter
4.69 ± 8% -4.3 0.36 ± 70% perf-profile.calltrace.cycles-pp.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64
5.02 ± 21% -4.3 0.75 ± 4% perf-profile.calltrace.cycles-pp.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.io_write.io_issue_sqe
0.00 +0.6 0.61 ± 2% perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_down_write_slowpath.ext4_buffered_write_iter.io_write.io_issue_sqe
0.00 +2.0 2.03 ± 2% perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.ext4_buffered_write_iter.io_write
28.19 ± 7% +59.2 87.44 ± 2% perf-profile.calltrace.cycles-pp.ret_from_fork
14.40 ± 21% +71.3 85.71 ± 2% perf-profile.calltrace.cycles-pp.io_wqe_worker.ret_from_fork
14.25 ± 21% +71.4 85.64 ± 2% perf-profile.calltrace.cycles-pp.io_worker_handle_work.io_wqe_worker.ret_from_fork
14.23 ± 21% +71.4 85.63 ± 2% perf-profile.calltrace.cycles-pp.io_issue_sqe.io_wq_submit_work.io_worker_handle_work.io_wqe_worker.ret_from_fork
14.23 ± 21% +71.4 85.64 ± 2% perf-profile.calltrace.cycles-pp.io_wq_submit_work.io_worker_handle_work.io_wqe_worker.ret_from_fork
14.22 ± 21% +71.4 85.63 ± 2% perf-profile.calltrace.cycles-pp.io_write.io_issue_sqe.io_wq_submit_work.io_worker_handle_work.io_wqe_worker
14.10 ± 21% +71.5 85.62 ± 2% perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.io_write.io_issue_sqe.io_wq_submit_work.io_worker_handle_work
0.00 +80.9 80.92 ± 2% perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.ext4_buffered_write_iter.io_write
0.00 +83.0 82.99 ± 2% perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.ext4_buffered_write_iter.io_write.io_issue_sqe
0.00 +83.6 83.63 ± 2% perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.ext4_buffered_write_iter.io_write.io_issue_sqe.io_wq_submit_work
61.17 ± 3% -49.8 11.37 ± 16% perf-profile.children.cycles-pp.secondary_startup_64_no_verify
61.17 ± 3% -49.8 11.37 ± 16% perf-profile.children.cycles-pp.cpu_startup_entry
61.17 ± 3% -49.8 11.37 ± 16% perf-profile.children.cycles-pp.do_idle
60.34 ± 3% -49.0 11.30 ± 16% perf-profile.children.cycles-pp.cpuidle_idle_call
56.14 ± 3% -45.2 10.95 ± 17% perf-profile.children.cycles-pp.cpuidle_enter
56.08 ± 3% -45.1 10.94 ± 17% perf-profile.children.cycles-pp.cpuidle_enter_state
41.25 ± 4% -31.6 9.64 ± 20% perf-profile.children.cycles-pp.intel_idle
41.16 ± 4% -31.5 9.64 ± 20% perf-profile.children.cycles-pp.mwait_idle_with_hints
23.64 ± 10% -20.4 3.22 ± 5% perf-profile.children.cycles-pp.generic_perform_write
13.80 ± 10% -12.1 1.73 ± 9% perf-profile.children.cycles-pp.kthread
13.48 ± 8% -11.8 1.72 ± 9% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
13.41 ± 5% -11.6 1.80 ± 9% perf-profile.children.cycles-pp.copy_page_from_iter_atomic
11.71 ± 6% -10.2 1.55 ± 9% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
11.01 ± 9% -9.6 1.41 ± 11% perf-profile.children.cycles-pp.worker_thread
10.97 ± 9% -9.6 1.40 ± 11% perf-profile.children.cycles-pp.process_one_work
10.78 ± 9% -9.4 1.38 ± 11% perf-profile.children.cycles-pp.loop_process_work
10.61 ± 10% -9.3 1.35 ± 12% perf-profile.children.cycles-pp.lo_write_simple
10.19 ± 7% -9.1 1.12 ± 5% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
10.18 ± 7% -9.1 1.12 ± 5% perf-profile.children.cycles-pp.do_syscall_64
10.32 ± 9% -9.0 1.32 ± 12% perf-profile.children.cycles-pp.do_iter_write
10.08 ± 9% -8.8 1.32 ± 12% perf-profile.children.cycles-pp.generic_file_write_iter
10.00 ± 9% -8.7 1.28 ± 12% perf-profile.children.cycles-pp.do_iter_readv_writev
9.75 ± 9% -8.5 1.28 ± 13% perf-profile.children.cycles-pp.__generic_file_write_iter
8.21 ± 8% -7.3 0.88 ± 7% perf-profile.children.cycles-pp.__x64_sys_fadvise64
8.21 ± 8% -7.3 0.88 ± 7% perf-profile.children.cycles-pp.ksys_fadvise64_64
8.21 ± 8% -7.3 0.88 ± 7% perf-profile.children.cycles-pp.generic_fadvise
7.93 ± 6% -6.8 1.16 ± 10% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
7.76 ± 7% -6.6 1.15 ± 10% perf-profile.children.cycles-pp.hrtimer_interrupt
5.14 ± 21% -4.4 0.72 ± 5% perf-profile.children.cycles-pp.copyin
5.13 ± 21% -4.4 0.72 ± 5% perf-profile.children.cycles-pp.copy_user_enhanced_fast_string
5.02 ± 21% -4.3 0.75 ± 4% perf-profile.children.cycles-pp.ext4_da_write_begin
4.70 ± 8% -4.2 0.52 ± 9% perf-profile.children.cycles-pp.invalidate_mapping_pagevec
4.60 ± 10% -3.8 0.80 ± 15% perf-profile.children.cycles-pp.__hrtimer_run_queues
3.98 ± 9% -3.5 0.44 ± 7% perf-profile.children.cycles-pp.__softirqentry_text_start
3.54 ± 13% -3.2 0.30 ± 13% perf-profile.children.cycles-pp.menu_select
3.71 ± 15% -3.2 0.54 ± 3% perf-profile.children.cycles-pp.pagecache_get_page
3.51 ± 9% -3.2 0.36 ± 10% perf-profile.children.cycles-pp.__filemap_fdatawrite_range
3.51 ± 9% -3.2 0.36 ± 10% perf-profile.children.cycles-pp.filemap_fdatawrite_wbc
3.51 ± 9% -3.1 0.37 ± 6% perf-profile.children.cycles-pp.do_writepages
3.51 ± 9% -3.1 0.37 ± 6% perf-profile.children.cycles-pp.ext4_writepages
3.50 ± 9% -3.1 0.37 ± 6% perf-profile.children.cycles-pp.mpage_prepare_extent_to_map
3.67 ± 16% -3.1 0.54 ± 3% perf-profile.children.cycles-pp.__filemap_get_folio
2.87 ± 10% -2.6 0.29 ± 11% perf-profile.children.cycles-pp.mpage_process_page_bufs
3.08 ± 11% -2.4 0.66 ± 15% perf-profile.children.cycles-pp.tick_sched_timer
2.71 ± 52% -2.4 0.30 ± 23% perf-profile.children.cycles-pp.ktime_get
2.70 ± 10% -2.4 0.30 ± 5% perf-profile.children.cycles-pp.smpboot_thread_fn
2.66 ± 21% -2.3 0.33 ± 3% perf-profile.children.cycles-pp.generic_write_end
2.61 ± 11% -2.3 0.29 ± 6% perf-profile.children.cycles-pp.run_ksoftirqd
2.57 ± 11% -2.3 0.28 ± 7% perf-profile.children.cycles-pp.blk_complete_reqs
2.56 ± 11% -2.3 0.28 ± 7% perf-profile.children.cycles-pp.blk_mq_end_request
2.56 ± 11% -2.3 0.28 ± 7% perf-profile.children.cycles-pp.blk_update_request
2.54 ± 11% -2.3 0.28 ± 7% perf-profile.children.cycles-pp.ext4_end_bio
2.54 ± 11% -2.3 0.28 ± 7% perf-profile.children.cycles-pp.ext4_finish_bio
2.45 ± 9% -2.2 0.24 ± 9% perf-profile.children.cycles-pp.mpage_submit_page
2.48 ± 21% -2.2 0.30 ± 2% perf-profile.children.cycles-pp.__block_commit_write
2.40 ± 15% -1.8 0.58 ± 14% perf-profile.children.cycles-pp.tick_sched_handle
2.06 ± 34% -1.8 0.24 ± 15% perf-profile.children.cycles-pp.clockevents_program_event
2.26 ± 13% -1.7 0.57 ± 13% perf-profile.children.cycles-pp.update_process_times
1.80 ± 9% -1.6 0.17 ± 8% perf-profile.children.cycles-pp.ext4_bio_write_page
1.78 ± 11% -1.6 0.18 ± 7% perf-profile.children.cycles-pp.folio_end_writeback
1.86 ± 21% -1.6 0.28 ± 6% perf-profile.children.cycles-pp.filemap_add_folio
1.78 ± 14% -1.6 0.20 ± 13% perf-profile.children.cycles-pp.__irq_exit_rcu
1.80 ± 22% -1.5 0.26 ± 6% perf-profile.children.cycles-pp.ext4_block_write_begin
1.69 ± 12% -1.5 0.20 ± 17% perf-profile.children.cycles-pp.mapping_evict_folio
1.63 ± 22% -1.5 0.14 ± 12% perf-profile.children.cycles-pp.tick_nohz_get_sleep_length
1.68 ± 12% -1.5 0.20 ± 17% perf-profile.children.cycles-pp.filemap_release_folio
1.55 ± 10% -1.4 0.16 ± 7% perf-profile.children.cycles-pp.__folio_end_writeback
1.50 ± 6% -1.3 0.15 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
1.41 ± 11% -1.3 0.14 ± 8% perf-profile.children.cycles-pp.remove_mapping
1.38 ± 11% -1.2 0.14 ± 7% perf-profile.children.cycles-pp.__remove_mapping
1.28 ± 14% -1.1 0.14 ± 10% perf-profile.children.cycles-pp.release_pages
1.24 ± 27% -1.1 0.11 ± 17% perf-profile.children.cycles-pp.tick_nohz_next_event
1.26 ± 21% -1.1 0.19 ± 5% perf-profile.children.cycles-pp.__filemap_add_folio
1.26 ± 9% -1.1 0.19 ± 4% perf-profile.children.cycles-pp.__schedule
1.19 ± 14% -1.1 0.13 ± 13% perf-profile.children.cycles-pp.__pagevec_release
1.15 ± 15% -1.0 0.13 ± 25% perf-profile.children.cycles-pp.try_to_free_buffers
1.11 ± 16% -1.0 0.10 ± 15% perf-profile.children.cycles-pp.__folio_start_writeback
1.17 ± 24% -1.0 0.18 ± 8% perf-profile.children.cycles-pp.create_empty_buffers
1.35 ± 8% -1.0 0.40 ± 13% perf-profile.children.cycles-pp.perf_tp_event
1.42 ± 13% -0.9 0.47 ± 13% perf-profile.children.cycles-pp.scheduler_tick
1.05 ± 7% -0.9 0.12 ± 12% perf-profile.children.cycles-pp.__mod_lruvec_page_state
1.31 ± 8% -0.9 0.38 ± 13% perf-profile.children.cycles-pp.perf_event_output_forward
1.31 ± 8% -0.9 0.39 ± 13% perf-profile.children.cycles-pp.__perf_event_overflow
1.06 ± 6% -0.9 0.14 ± 9% perf-profile.children.cycles-pp.xas_load
1.17 ± 8% -0.8 0.34 ± 14% perf-profile.children.cycles-pp.perf_prepare_sample
0.96 ± 22% -0.8 0.14 ± 6% perf-profile.children.cycles-pp.mark_buffer_dirty
0.94 ± 21% -0.8 0.15 ± 6% perf-profile.children.cycles-pp.folio_alloc
1.12 ± 8% -0.8 0.32 ± 15% perf-profile.children.cycles-pp.perf_callchain
1.11 ± 8% -0.8 0.32 ± 15% perf-profile.children.cycles-pp.get_perf_callchain
0.92 ± 19% -0.8 0.15 ± 5% perf-profile.children.cycles-pp.__alloc_pages
0.90 ± 18% -0.8 0.14 ± 9% perf-profile.children.cycles-pp.fault_in_iov_iter_readable
0.86 ± 10% -0.8 0.09 ± 11% perf-profile.children.cycles-pp._raw_spin_lock
0.89 ± 27% -0.7 0.14 ± 7% perf-profile.children.cycles-pp.alloc_page_buffers
0.80 ± 10% -0.7 0.07 ± 15% perf-profile.children.cycles-pp.irq_work_run_list
0.81 ± 17% -0.7 0.07 ± 21% perf-profile.children.cycles-pp.irq_enter_rcu
0.86 ± 18% -0.7 0.13 ± 9% perf-profile.children.cycles-pp.fault_in_readable
0.88 ± 8% -0.7 0.16 ± 4% perf-profile.children.cycles-pp.schedule
0.84 ± 29% -0.7 0.14 ± 9% perf-profile.children.cycles-pp.kmem_cache_alloc
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.asm_sysvec_irq_work
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.sysvec_irq_work
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.__sysvec_irq_work
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.irq_work_single
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.irq_work_run
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp._printk
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.vprintk_emit
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.console_unlock
0.78 ± 9% -0.7 0.07 ± 16% perf-profile.children.cycles-pp.call_console_drivers
0.77 ± 19% -0.7 0.06 ± 47% perf-profile.children.cycles-pp.tick_irq_enter
0.84 ± 29% -0.7 0.14 ± 9% perf-profile.children.cycles-pp.alloc_buffer_head
0.80 ± 16% -0.7 0.10 ± 10% perf-profile.children.cycles-pp.shmem_write_begin
0.78 ± 24% -0.7 0.09 ± 30% perf-profile.children.cycles-pp.kmem_cache_free
0.75 ± 10% -0.7 0.06 ± 11% perf-profile.children.cycles-pp.serial8250_console_write
0.75 ± 10% -0.7 0.06 ± 11% perf-profile.children.cycles-pp.uart_console_write
0.77 ± 14% -0.7 0.09 ± 8% perf-profile.children.cycles-pp.lapic_next_deadline
0.76 ± 9% -0.7 0.08 ± 12% perf-profile.children.cycles-pp.__filemap_remove_folio
0.76 ± 17% -0.7 0.10 ± 10% perf-profile.children.cycles-pp.shmem_getpage_gfp
0.72 ± 55% -0.7 0.06 ± 47% perf-profile.children.cycles-pp.tick_nohz_irq_exit
0.72 ± 9% -0.7 0.06 ± 11% perf-profile.children.cycles-pp.wait_for_xmitr
0.72 ± 10% -0.7 0.06 ± 11% perf-profile.children.cycles-pp.serial8250_console_putchar
0.73 ± 23% -0.6 0.08 ± 29% perf-profile.children.cycles-pp.free_buffer_head
0.70 ± 19% -0.6 0.06 ± 19% perf-profile.children.cycles-pp.rebalance_domains
0.69 ± 14% -0.6 0.06 ± 17% perf-profile.children.cycles-pp.perf_mux_hrtimer_handler
0.73 ± 21% -0.6 0.12 ± 6% perf-profile.children.cycles-pp.get_page_from_freelist
0.71 ± 8% -0.6 0.09 ± 5% perf-profile.children.cycles-pp.native_irq_return_iret
0.64 ± 21% -0.6 0.06 ± 14% perf-profile.children.cycles-pp.free_unref_page_list
0.67 ± 23% -0.6 0.10 ± 10% perf-profile.children.cycles-pp.__folio_mark_dirty
0.59 ± 8% -0.6 0.02 ± 99% perf-profile.children.cycles-pp.io_serial_in
0.62 ± 15% -0.6 0.06 ± 11% perf-profile.children.cycles-pp.folio_clear_dirty_for_io
0.62 ± 14% -0.6 0.06 ± 11% perf-profile.children.cycles-pp.sched_clock_cpu
0.60 ± 20% -0.5 0.09 ± 7% perf-profile.children.cycles-pp.folio_add_lru
0.54 ± 16% -0.5 0.06 ± 9% perf-profile.children.cycles-pp.native_sched_clock
0.50 ± 7% -0.5 0.04 ± 71% perf-profile.children.cycles-pp.read_tsc
0.50 ± 10% -0.4 0.06 ± 6% perf-profile.children.cycles-pp.__might_resched
0.55 ± 38% -0.4 0.11 ± 25% perf-profile.children.cycles-pp.start_kernel
0.50 ± 17% -0.4 0.07 ± 10% perf-profile.children.cycles-pp.xas_store
0.49 ± 11% -0.4 0.06 ± 11% perf-profile.children.cycles-pp.__mod_lruvec_state
0.51 ± 19% -0.4 0.08 ± 8% perf-profile.children.cycles-pp.__pagevec_lru_add
0.54 ± 18% -0.4 0.11 ± 11% perf-profile.children.cycles-pp.load_balance
0.47 ± 49% -0.4 0.05 ± 46% perf-profile.children.cycles-pp.ktime_get_update_offsets_now
0.50 ± 21% -0.4 0.08 ± 5% perf-profile.children.cycles-pp.rmqueue
0.46 ± 14% -0.4 0.04 ± 44% perf-profile.children.cycles-pp.irqtime_account_irq
0.47 ± 22% -0.4 0.06 ± 7% perf-profile.children.cycles-pp.ext4_da_get_block_prep
0.43 ± 34% -0.4 0.03 ±102% perf-profile.children.cycles-pp.memcg_slab_free_hook
0.46 ± 8% -0.4 0.06 ± 17% perf-profile.children.cycles-pp.jbd2_journal_try_to_free_buffers
0.47 ± 12% -0.4 0.08 ± 13% perf-profile.children.cycles-pp.perf_callchain_user
0.44 ± 7% -0.4 0.06 ± 8% perf-profile.children.cycles-pp.ksys_read
0.62 ± 7% -0.4 0.24 ± 18% perf-profile.children.cycles-pp.perf_callchain_kernel
0.44 ± 7% -0.4 0.06 ± 8% perf-profile.children.cycles-pp.vfs_read
0.43 ± 8% -0.4 0.06 ± 13% perf-profile.children.cycles-pp.jbd2_journal_grab_journal_head
0.39 ± 7% -0.4 0.03 ±100% perf-profile.children.cycles-pp.free_pcppages_bulk
0.38 ± 8% -0.4 0.02 ± 99% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.40 ± 13% -0.4 0.05 ± 7% perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
0.43 ± 9% -0.3 0.08 ± 8% perf-profile.children.cycles-pp.try_to_wake_up
0.38 ± 10% -0.3 0.04 ± 44% perf-profile.children.cycles-pp.__mod_node_page_state
0.41 ± 13% -0.3 0.07 ± 15% perf-profile.children.cycles-pp.__get_user_nocheck_8
0.40 ± 20% -0.3 0.08 ± 11% perf-profile.children.cycles-pp.find_busiest_group
0.65 ± 9% -0.3 0.33 ± 17% perf-profile.children.cycles-pp.update_curr
0.51 ± 7% -0.3 0.19 ± 17% perf-profile.children.cycles-pp.unwind_next_frame
0.38 ± 21% -0.3 0.06 ± 9% perf-profile.children.cycles-pp.__mem_cgroup_charge
0.37 ± 20% -0.3 0.06 ± 9% perf-profile.children.cycles-pp.__pagevec_lru_add_fn
0.63 ± 9% -0.3 0.32 ± 17% perf-profile.children.cycles-pp.perf_trace_sched_stat_runtime
0.38 ± 17% -0.3 0.08 ± 12% perf-profile.children.cycles-pp.update_sd_lb_stats
0.38 ± 6% -0.3 0.08 ± 8% perf-profile.children.cycles-pp.__libc_start_main
0.32 ± 12% -0.3 0.02 ± 99% perf-profile.children.cycles-pp.read
0.32 ± 21% -0.3 0.04 ± 45% perf-profile.children.cycles-pp.rmqueue_bulk
0.31 ± 42% -0.3 0.04 ± 71% perf-profile.children.cycles-pp.memcg_slab_post_alloc_hook
0.26 ± 25% -0.2 0.02 ± 99% perf-profile.children.cycles-pp.folio_account_dirtied
0.29 ± 20% -0.2 0.07 ± 14% perf-profile.children.cycles-pp.update_sg_lb_stats
0.27 ± 7% -0.2 0.08 ± 7% perf-profile.children.cycles-pp.asm_exc_page_fault
0.24 ± 10% -0.2 0.08 ± 22% perf-profile.children.cycles-pp.__unwind_start
0.18 ± 16% -0.2 0.03 ±100% perf-profile.children.cycles-pp.__orc_find
0.16 ± 16% -0.1 0.04 ± 71% perf-profile.children.cycles-pp.ksys_write
0.16 ± 16% -0.1 0.04 ± 71% perf-profile.children.cycles-pp.vfs_write
0.16 ± 16% -0.1 0.04 ± 71% perf-profile.children.cycles-pp.new_sync_write
0.15 ± 17% -0.1 0.02 ± 99% perf-profile.children.cycles-pp.__libc_write
0.17 ± 18% -0.1 0.07 ± 6% perf-profile.children.cycles-pp.schedule_timeout
0.15 ± 24% -0.1 0.09 ± 13% perf-profile.children.cycles-pp.pick_next_task_fair
0.00 +2.6 2.64 ± 2% perf-profile.children.cycles-pp.rwsem_spin_on_owner
28.20 ± 7% +59.2 87.44 ± 2% perf-profile.children.cycles-pp.ret_from_fork
14.40 ± 21% +71.3 85.71 ± 2% perf-profile.children.cycles-pp.io_wqe_worker
14.25 ± 21% +71.4 85.64 ± 2% perf-profile.children.cycles-pp.io_worker_handle_work
14.23 ± 21% +71.4 85.63 ± 2% perf-profile.children.cycles-pp.io_issue_sqe
14.23 ± 21% +71.4 85.64 ± 2% perf-profile.children.cycles-pp.io_wq_submit_work
14.22 ± 21% +71.4 85.63 ± 2% perf-profile.children.cycles-pp.io_write
14.10 ± 21% +71.5 85.62 ± 2% perf-profile.children.cycles-pp.ext4_buffered_write_iter
0.00 +80.9 80.95 ± 2% perf-profile.children.cycles-pp.osq_lock
0.00 +83.0 83.00 ± 2% perf-profile.children.cycles-pp.rwsem_optimistic_spin
0.00 +83.6 83.63 ± 2% perf-profile.children.cycles-pp.rwsem_down_write_slowpath
40.90 ± 3% -31.3 9.63 ± 20% perf-profile.self.cycles-pp.mwait_idle_with_hints
8.22 ± 9% -7.1 1.08 ± 14% perf-profile.self.cycles-pp.copy_page_from_iter_atomic
5.02 ± 20% -4.3 0.71 ± 5% perf-profile.self.cycles-pp.copy_user_enhanced_fast_string
2.31 ± 61% -2.0 0.26 ± 25% perf-profile.self.cycles-pp.ktime_get
1.89 ± 12% -1.7 0.17 ± 8% perf-profile.self.cycles-pp.cpuidle_enter_state
1.64 ± 15% -1.5 0.14 ± 20% perf-profile.self.cycles-pp.menu_select
1.42 ± 20% -1.3 0.15 ± 7% perf-profile.self.cycles-pp.__block_commit_write
1.26 ± 7% -1.1 0.14 ± 6% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.86 ± 5% -0.7 0.12 ± 13% perf-profile.self.cycles-pp.xas_load
0.83 ± 17% -0.7 0.12 ± 10% perf-profile.self.cycles-pp.fault_in_readable
0.77 ± 14% -0.7 0.09 ± 8% perf-profile.self.cycles-pp.lapic_next_deadline
0.69 ± 9% -0.6 0.08 ± 16% perf-profile.self.cycles-pp._raw_spin_lock
0.70 ± 8% -0.6 0.09 ± 5% perf-profile.self.cycles-pp.native_irq_return_iret
0.59 ± 8% -0.6 0.02 ± 99% perf-profile.self.cycles-pp.io_serial_in
0.50 ± 6% -0.5 0.03 ±100% perf-profile.self.cycles-pp.read_tsc
0.51 ± 16% -0.5 0.06 ± 9% perf-profile.self.cycles-pp.native_sched_clock
0.46 ± 10% -0.4 0.06 ± 8% perf-profile.self.cycles-pp.__might_resched
0.43 ± 11% -0.4 0.03 ± 70% perf-profile.self.cycles-pp.ext4_bio_write_page
0.42 ± 8% -0.4 0.05 ± 8% perf-profile.self.cycles-pp.jbd2_journal_grab_journal_head
0.38 ± 11% -0.3 0.03 ± 70% perf-profile.self.cycles-pp.__mod_node_page_state
0.36 ± 15% -0.3 0.02 ± 99% perf-profile.self.cycles-pp.release_pages
0.37 ± 62% -0.3 0.05 ± 45% perf-profile.self.cycles-pp.ktime_get_update_offsets_now
0.21 ± 19% -0.2 0.06 ± 8% perf-profile.self.cycles-pp.update_sg_lb_stats
0.18 ± 16% -0.2 0.03 ±100% perf-profile.self.cycles-pp.__orc_find
0.19 ± 8% -0.1 0.08 ± 15% perf-profile.self.cycles-pp.unwind_next_frame
0.00 +2.6 2.62 ± 2% perf-profile.self.cycles-pp.rwsem_spin_on_owner
0.00 +80.4 80.42 ± 2% perf-profile.self.cycles-pp.osq_lock
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://01.org/lkp
View attachment "config-5.18.0-rc1-00003-g584b0180f0f4" of type "text/plain" (162671 bytes)
View attachment "job-script" of type "text/plain" (7959 bytes)
View attachment "job.yaml" of type "text/plain" (5139 bytes)
View attachment "reproduce" of type "text/plain" (296 bytes)
Powered by blists - more mailing lists