[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202412061635.d962a313-lkp@intel.com>
Date: Fri, 6 Dec 2024 17:21:53 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Brian Foster <bfoster@...hat.com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
Theodore Ts'o <tytso@....edu>, <linux-ext4@...r.kernel.org>,
<oliver.sang@...el.com>
Subject: [linus:master] [ext4] c7fc0366c6: stress-ng.metamix.ops_per_sec
22.9% improvement
Hello,
kernel test robot noticed a 22.9% improvement of stress-ng.metamix.ops_per_sec on:
commit: c7fc0366c65628fd69bfc310affec4918199aae2 ("ext4: partial zero eof block on unaligned inode size extension")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz (Ivy Bridge) with 16G memory
parameters:
nr_threads: 100%
disk: 1HDD
testtime: 60s
fs: ext4
test: metamix
cpufreq_governor: performance
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241206/202412061635.d962a313-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/1HDD/ext4/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-ivb-d01/metamix/stress-ng/60s
commit:
25f51ea8ac ("ext4: disambiguate the return value of ext4_dio_write_end_io()")
c7fc0366c6 ("ext4: partial zero eof block on unaligned inode size extension")
25f51ea8ac8144af c7fc0366c65628fd69bfc310aff
---------------- ---------------------------
%stddev %change %stddev
\ | \
553.38 ± 4% +12.1% 620.15 ± 2% uptime.idle
26348 ± 3% -40.4% 15711 ± 7% meminfo.Dirty
16924 ± 2% -32.5% 11430 ± 7% meminfo.Writeback
6.91 ± 9% +163.1% 18.17 ± 7% iostat.cpu.idle
90.85 -13.0% 79.04 iostat.cpu.iowait
1.45 +33.6% 1.93 iostat.cpu.system
1.117e+08 ± 4% +15.9% 1.295e+08 ± 6% sched_debug.cfs_rq:/.avg_vruntime.avg
8990780 ± 48% +201.7% 27121462 ± 23% sched_debug.cfs_rq:/.avg_vruntime.min
1.117e+08 ± 4% +15.9% 1.295e+08 ± 6% sched_debug.cfs_rq:/.min_vruntime.avg
8990780 ± 48% +201.7% 27121462 ± 23% sched_debug.cfs_rq:/.min_vruntime.min
4.41 ± 14% +11.6 16.01 ± 9% mpstat.cpu.all.idle%
93.54 -12.1 81.39 mpstat.cpu.all.iowait%
0.23 ± 2% +0.1 0.31 ± 2% mpstat.cpu.all.irq%
0.07 ± 2% +0.0 0.07 mpstat.cpu.all.soft%
1.13 +0.4 1.54 mpstat.cpu.all.sys%
6.94 ± 9% +162.5% 18.21 ± 7% vmstat.cpu.id
90.85 -13.0% 79.05 vmstat.cpu.wa
31576 +20.8% 38129 vmstat.io.bo
64.71 ± 3% -33.6% 42.98 ± 8% vmstat.procs.b
2893 +2.9% 2976 vmstat.system.in
1911 ± 2% +21.8% 2328 stress-ng.metamix.ops
30.37 +22.9% 37.32 ± 2% stress-ng.metamix.ops_per_sec
4111968 ± 2% +20.4% 4952304 stress-ng.time.file_system_outputs
24323 +8.5% 26386 stress-ng.time.minor_page_faults
6.17 ± 6% +54.1% 9.50 ± 5% stress-ng.time.percent_of_cpu_this_job_got
23964 ± 2% +18.6% 28423 stress-ng.time.voluntary_context_switches
514496 ± 2% +20.4% 619656 proc-vmstat.nr_dirtied
6606 ± 3% -42.4% 3807 ± 5% proc-vmstat.nr_dirty
4338 +3.4% 4486 ± 2% proc-vmstat.nr_page_table_pages
4266 ± 2% -33.9% 2820 ± 7% proc-vmstat.nr_writeback
514089 ± 2% +20.5% 619358 proc-vmstat.nr_written
10850 ± 3% -39.1% 6607 ± 6% proc-vmstat.nr_zone_write_pending
666231 ± 2% +15.4% 769072 proc-vmstat.numa_hit
666231 ± 2% +15.5% 769184 proc-vmstat.numa_local
698850 ± 2% +14.8% 802298 proc-vmstat.pgalloc_normal
678127 ± 3% +15.7% 784461 ± 2% proc-vmstat.pgfree
2088116 ± 2% +19.9% 2503923 proc-vmstat.pgpgout
94023007 +15.2% 1.083e+08 perf-stat.i.branch-instructions
2405282 +15.0% 2765956 perf-stat.i.branch-misses
14.62 ± 2% -1.2 13.38 perf-stat.i.cache-miss-rate%
583290 ± 2% +17.5% 685524 ± 2% perf-stat.i.cache-misses
3648640 ± 2% +29.2% 4714050 perf-stat.i.cache-references
5.461e+08 +24.7% 6.808e+08 ± 2% perf-stat.i.cpu-cycles
164.88 ± 2% +24.0% 204.37 perf-stat.i.cpu-migrations
833.08 +7.0% 891.37 ± 2% perf-stat.i.cycles-between-cache-misses
4.545e+08 +16.6% 5.301e+08 perf-stat.i.instructions
0.61 -1.1% 0.60 perf-stat.i.ipc
15.99 -1.4 14.54 perf-stat.overall.cache-miss-rate%
1.20 +6.9% 1.28 perf-stat.overall.cpi
936.52 ± 2% +6.1% 993.24 perf-stat.overall.cycles-between-cache-misses
0.83 -6.4% 0.78 perf-stat.overall.ipc
92548187 +15.2% 1.066e+08 perf-stat.ps.branch-instructions
2367842 +15.0% 2723489 perf-stat.ps.branch-misses
574185 ± 2% +17.5% 674749 ± 2% perf-stat.ps.cache-misses
3591758 ± 2% +29.2% 4640021 perf-stat.ps.cache-references
5.375e+08 +24.7% 6.701e+08 ± 2% perf-stat.ps.cpu-cycles
162.31 ± 2% +23.9% 201.17 perf-stat.ps.cpu-migrations
4.474e+08 +16.6% 5.218e+08 perf-stat.ps.instructions
2.874e+10 +15.8% 3.328e+10 ± 2% perf-stat.total.instructions
0.02 ± 9% +25.2% 0.02 ± 6% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.48 ± 44% +216.9% 1.52 ±106% perf-sched.sch_delay.avg.ms.io_schedule.bit_wait_io.__wait_on_bit.out_of_line_wait_on_bit
0.00 ± 16% +148.1% 0.01 ± 76% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.02 ± 10% +26.8% 0.03 ± 8% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
7.94 ± 52% -76.1% 1.90 ±131% perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
0.02 ± 55% +1263.0% 0.21 ±185% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
252.86 ± 13% -46.6% 134.96 ± 19% perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
291.92 ± 4% -38.6% 179.21 ± 17% perf-sched.wait_and_delay.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
45.27 ± 9% -11.1% 40.23 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
26.86 ± 2% +22.0% 32.76 ± 6% perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
292.00 ± 12% +71.9% 501.83 ± 16% perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
323.83 ± 7% +29.0% 417.83 ± 12% perf-sched.wait_and_delay.count.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
2817 ± 2% -17.3% 2329 ± 8% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
999.12 ± 4% +71.7% 1715 ± 20% perf-sched.wait_and_delay.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
909.77 ± 11% +88.7% 1716 ± 20% perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file
970.08 ± 4% +51.7% 1471 ± 28% perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
252.77 ± 13% -46.6% 134.90 ± 19% perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
291.02 ± 4% -38.8% 178.11 ± 17% perf-sched.wait_time.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
21.25 ±192% +2117.6% 471.24 ± 90% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat
45.22 ± 9% -11.1% 40.19 ± 7% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
26.84 ± 2% +22.0% 32.73 ± 6% perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
998.61 ± 4% +71.3% 1710 ± 20% perf-sched.wait_time.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
908.10 ± 12% +88.5% 1711 ± 20% perf-sched.wait_time.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file
969.71 ± 4% +51.4% 1468 ± 28% perf-sched.wait_time.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
28.45 ±194% +3281.7% 962.20 ± 56% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat
40.18 ± 3% -4.3 35.85 ± 2% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
40.18 ± 3% -4.3 35.85 ± 2% perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
44.88 ± 3% -4.3 40.58 perf-profile.calltrace.cycles-pp.common_startup_64
40.10 ± 3% -4.3 35.80 ± 2% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
36.18 ± 4% -4.2 31.96 ± 2% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
39.12 ± 3% -4.2 34.91 ± 2% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
39.55 ± 4% -4.1 35.44 perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
22.45 ± 4% -1.7 20.71 ± 4% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
3.22 ± 14% -1.6 1.58 ± 15% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
6.97 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
6.97 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync
6.96 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe
7.03 ± 7% -1.6 5.46 ± 6% perf-profile.calltrace.cycles-pp.fdatasync.stress_metamix
6.99 ± 7% -1.6 5.43 ± 6% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
6.99 ± 7% -1.6 5.43 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
6.17 ± 7% -1.5 4.71 ± 7% perf-profile.calltrace.cycles-pp.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64
5.84 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync
5.84 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync
5.83 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file
5.82 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range
5.82 ± 9% -1.3 4.50 ± 7% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range
5.87 ± 9% -0.9 4.99 ± 6% perf-profile.calltrace.cycles-pp.llseek.stress_metamix
3.42 ± 6% -0.6 2.78 ± 5% perf-profile.calltrace.cycles-pp.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
3.20 ± 5% -0.6 2.64 ± 14% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
2.42 ± 6% -0.5 1.96 ± 15% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
2.09 ± 8% -0.4 1.66 ± 9% perf-profile.calltrace.cycles-pp.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages
1.00 ± 16% -0.4 0.59 ± 45% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
1.79 ± 5% -0.4 1.39 ± 15% perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.llseek.stress_metamix
1.37 ± 16% -0.3 1.03 ± 11% perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.llseek.stress_metamix
1.62 ± 8% -0.3 1.34 ± 9% perf-profile.calltrace.cycles-pp.mpage_submit_folio.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages
0.89 ± 16% -0.2 0.68 ± 17% perf-profile.calltrace.cycles-pp.sched_balance_rq.sched_balance_domains.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt
0.00 +0.6 0.62 ± 17% perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +0.8 0.76 ± 13% perf-profile.calltrace.cycles-pp.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write
0.00 +0.8 0.76 ± 25% perf-profile.calltrace.cycles-pp.__ext4_get_inode_loc.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end
0.00 +0.8 0.76 ± 24% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2
0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write
0.00 +0.8 0.78 ± 19% perf-profile.calltrace.cycles-pp.crc32c_pcl_intel_update.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode
0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread
0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork
0.19 ±142% +0.9 1.10 ± 21% perf-profile.calltrace.cycles-pp.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
0.19 ±142% +0.9 1.10 ± 21% perf-profile.calltrace.cycles-pp.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
0.00 +1.0 0.96 ± 9% perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +1.0 0.97 ± 16% perf-profile.calltrace.cycles-pp.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.0 1.03 ± 16% perf-profile.calltrace.cycles-pp.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.0 1.03 ± 16% perf-profile.calltrace.cycles-pp.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +1.1 1.08 ± 8% perf-profile.calltrace.cycles-pp.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty
0.00 +1.1 1.08 ± 18% perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write
0.00 +1.2 1.15 ± 9% perf-profile.calltrace.cycles-pp.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty
0.00 +1.2 1.16 ± 19% perf-profile.calltrace.cycles-pp.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.7 1.70 ± 11% perf-profile.calltrace.cycles-pp.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end
0.00 +1.7 1.71 ± 10% perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.8 1.78 ± 9% perf-profile.calltrace.cycles-pp.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +1.9 1.88 ± 13% perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +2.2 2.16 ± 9% perf-profile.calltrace.cycles-pp.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write
0.00 +2.3 2.34 ± 8% perf-profile.calltrace.cycles-pp.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +3.4 3.42 ± 5% perf-profile.calltrace.cycles-pp.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
43.79 ± 4% +5.7 49.45 perf-profile.calltrace.cycles-pp.stress_metamix
13.98 ± 10% +8.6 22.60 ± 3% perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64
15.52 ± 10% +8.7 24.25 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write.stress_metamix
15.46 ± 10% +8.7 24.21 ± 3% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix
14.55 ± 10% +8.8 23.32 ± 2% perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
14.97 ± 10% +8.8 23.79 ± 2% perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
15.16 ± 10% +8.8 24.01 ± 2% perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix
17.88 ± 10% +8.9 26.79 ± 2% perf-profile.calltrace.cycles-pp.write.stress_metamix
0.00 +10.0 10.04 ± 6% perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
40.18 ± 3% -4.3 35.85 ± 2% perf-profile.children.cycles-pp.start_secondary
44.88 ± 3% -4.3 40.58 perf-profile.children.cycles-pp.common_startup_64
44.88 ± 3% -4.3 40.58 perf-profile.children.cycles-pp.cpu_startup_entry
44.87 ± 3% -4.3 40.57 perf-profile.children.cycles-pp.do_idle
43.72 ± 3% -4.2 39.52 perf-profile.children.cycles-pp.cpuidle_idle_call
40.17 ± 4% -4.1 36.03 perf-profile.children.cycles-pp.cpuidle_enter_state
40.30 ± 4% -4.1 36.18 perf-profile.children.cycles-pp.cpuidle_enter
22.45 ± 4% -1.7 20.71 ± 4% perf-profile.children.cycles-pp.intel_idle
3.44 ± 14% -1.7 1.79 ± 16% perf-profile.children.cycles-pp.poll_idle
7.44 ± 7% -1.6 5.84 ± 6% perf-profile.children.cycles-pp.do_fsync
7.43 ± 7% -1.6 5.84 ± 6% perf-profile.children.cycles-pp.ext4_sync_file
7.01 ± 7% -1.6 5.45 ± 6% perf-profile.children.cycles-pp.__x64_sys_fdatasync
7.07 ± 7% -1.5 5.54 ± 6% perf-profile.children.cycles-pp.fdatasync
6.20 ± 7% -1.5 4.74 ± 7% perf-profile.children.cycles-pp.file_write_and_wait_range
5.89 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.do_writepages
5.87 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.__filemap_fdatawrite_range
5.87 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.filemap_fdatawrite_wbc
5.86 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.ext4_writepages
6.07 ± 9% -0.9 5.19 ± 6% perf-profile.children.cycles-pp.llseek
3.42 ± 6% -0.6 2.78 ± 6% perf-profile.children.cycles-pp.mpage_map_and_submit_extent
3.21 ± 5% -0.6 2.64 ± 14% perf-profile.children.cycles-pp.worker_thread
3.42 ± 10% -0.5 2.90 ± 9% perf-profile.children.cycles-pp.handle_softirqs
2.42 ± 6% -0.5 1.96 ± 15% perf-profile.children.cycles-pp.process_one_work
2.09 ± 8% -0.4 1.66 ± 9% perf-profile.children.cycles-pp.mpage_map_and_submit_buffers
0.34 ± 16% -0.2 0.13 ± 55% perf-profile.children.cycles-pp.mpage_process_page_bufs
0.50 ± 23% -0.2 0.33 ± 28% perf-profile.children.cycles-pp.native_irq_return_iret
0.24 ± 31% -0.1 0.13 ± 36% perf-profile.children.cycles-pp.ext4_file_read_iter
0.25 ± 20% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.khugepaged
0.24 ± 17% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.hpage_collapse_scan_pmd
0.24 ± 17% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.khugepaged_scan_mm_slot
0.23 ± 13% -0.1 0.15 ± 12% perf-profile.children.cycles-pp.collapse_huge_page
0.17 ± 13% -0.0 0.14 ± 7% perf-profile.children.cycles-pp.__collapse_huge_page_copy
0.07 ± 51% +0.1 0.14 ± 27% perf-profile.children.cycles-pp.tmigr_quick_check
0.11 ± 48% +0.1 0.21 ± 15% perf-profile.children.cycles-pp.tmigr_requires_handle_remote
0.00 +0.1 0.14 ± 41% perf-profile.children.cycles-pp.jbd2_journal_dirty_metadata
0.12 ± 72% +0.2 0.27 ± 19% perf-profile.children.cycles-pp.inode_to_bdi
0.13 ± 55% +0.2 0.32 ± 25% perf-profile.children.cycles-pp.jbd2_journal_get_write_access
0.01 ±223% +0.2 0.21 ± 45% perf-profile.children.cycles-pp.add_transaction_credits
0.30 ± 42% +0.2 0.50 ± 14% perf-profile.children.cycles-pp.__find_get_block
0.06 ± 75% +0.2 0.27 ± 20% perf-profile.children.cycles-pp.folio_mark_accessed
0.31 ± 41% +0.2 0.54 ± 13% perf-profile.children.cycles-pp.bdev_getblk
0.03 ±145% +0.2 0.28 ± 27% perf-profile.children.cycles-pp.__ext4_handle_dirty_metadata
0.01 ±223% +0.2 0.26 ± 27% perf-profile.children.cycles-pp.stop_this_handle
0.05 ±165% +0.3 0.31 ± 13% perf-profile.children.cycles-pp.ext4_inode_journal_mode
0.00 +0.3 0.28 ± 26% perf-profile.children.cycles-pp.filemap_dirty_folio
0.37 ± 15% +0.3 0.65 ± 23% perf-profile.children.cycles-pp.__cond_resched
0.14 ± 52% +0.3 0.43 ± 18% perf-profile.children.cycles-pp.__ext4_journal_get_write_access
0.00 +0.3 0.29 ± 33% perf-profile.children.cycles-pp.folio_redirty_for_writepage
0.36 ± 25% +0.4 0.75 ± 10% perf-profile.children.cycles-pp.filemap_get_entry
0.10 ± 80% +0.4 0.55 ± 9% perf-profile.children.cycles-pp.jbd2_journal_stop
0.00 +0.5 0.45 ± 18% perf-profile.children.cycles-pp.jbd2_journal_file_inode
0.10 ± 80% +0.5 0.61 ± 11% perf-profile.children.cycles-pp.__ext4_journal_stop
0.22 ± 57% +0.6 0.86 ± 27% perf-profile.children.cycles-pp.__ext4_get_inode_loc
0.22 ± 58% +0.6 0.87 ± 27% perf-profile.children.cycles-pp.ext4_get_inode_loc
0.44 ± 27% +0.7 1.10 ± 21% perf-profile.children.cycles-pp.kjournald2
0.44 ± 28% +0.7 1.10 ± 21% perf-profile.children.cycles-pp.jbd2_journal_commit_transaction
0.05 ±103% +0.7 0.72 ± 11% perf-profile.children.cycles-pp.start_this_handle
0.42 ± 15% +0.8 1.18 ± 13% perf-profile.children.cycles-pp.crc32c_pcl_intel_update
0.00 +0.8 0.78 ± 23% perf-profile.children.cycles-pp.ext4_normal_submit_inode_data_buffers
0.28 ± 41% +1.0 1.23 ± 17% perf-profile.children.cycles-pp.ext4_reserve_inode_write
0.08 ± 74% +1.0 1.04 ± 6% perf-profile.children.cycles-pp.jbd2__journal_start
0.26 ± 23% +1.0 1.23 ± 5% perf-profile.children.cycles-pp.ext4_inode_csum
0.28 ± 19% +1.0 1.32 ± 6% perf-profile.children.cycles-pp.ext4_inode_csum_set
3.99 ± 13% +1.1 5.13 ± 6% perf-profile.children.cycles-pp.__filemap_get_folio
3.80 ± 18% +1.2 4.95 ± 6% perf-profile.children.cycles-pp.memset_orig
3.80 ± 18% +1.4 5.20 ± 5% perf-profile.children.cycles-pp.zero_user_segments
0.40 ± 19% +1.6 1.96 ± 11% perf-profile.children.cycles-pp.ext4_fill_raw_inode
0.00 +1.8 1.80 ± 9% perf-profile.children.cycles-pp.ext4_zero_partial_blocks
0.55 ± 16% +2.0 2.55 ± 7% perf-profile.children.cycles-pp.ext4_do_update_inode
0.58 ± 19% +2.2 2.78 ± 6% perf-profile.children.cycles-pp.ext4_mark_iloc_dirty
0.85 ± 16% +3.2 4.03 ± 4% perf-profile.children.cycles-pp.__ext4_mark_inode_dirty
0.00 +3.6 3.61 ± 9% perf-profile.children.cycles-pp.__ext4_block_zero_page_range
38.33 ± 4% +5.5 43.88 ± 2% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
38.13 ± 4% +5.6 43.69 ± 2% perf-profile.children.cycles-pp.do_syscall_64
43.79 ± 4% +5.7 49.45 perf-profile.children.cycles-pp.stress_metamix
13.98 ± 10% +8.6 22.61 ± 3% perf-profile.children.cycles-pp.generic_perform_write
14.57 ± 10% +8.8 23.33 ± 2% perf-profile.children.cycles-pp.ext4_buffered_write_iter
15.14 ± 9% +8.8 23.92 ± 2% perf-profile.children.cycles-pp.vfs_write
15.35 ± 9% +8.8 24.15 ± 2% perf-profile.children.cycles-pp.ksys_write
18.28 ± 9% +8.8 27.11 ± 2% perf-profile.children.cycles-pp.write
0.00 +10.0 10.04 ± 6% perf-profile.children.cycles-pp.ext4_da_do_write_end
22.43 ± 4% -1.7 20.71 ± 4% perf-profile.self.cycles-pp.intel_idle
3.24 ± 15% -1.6 1.69 ± 18% perf-profile.self.cycles-pp.poll_idle
0.34 ± 16% -0.2 0.13 ± 53% perf-profile.self.cycles-pp.mpage_process_page_bufs
0.50 ± 23% -0.2 0.33 ± 28% perf-profile.self.cycles-pp.native_irq_return_iret
0.24 ± 31% -0.1 0.12 ± 30% perf-profile.self.cycles-pp.ext4_file_read_iter
0.01 ±223% +0.1 0.10 ± 33% perf-profile.self.cycles-pp.mmap_region
0.00 +0.1 0.11 ± 34% perf-profile.self.cycles-pp.jbd2_journal_dirty_metadata
0.01 ±223% +0.1 0.15 ± 55% perf-profile.self.cycles-pp.jbd2_journal_get_write_access
0.10 ± 85% +0.1 0.24 ± 20% perf-profile.self.cycles-pp.inode_to_bdi
0.00 +0.1 0.14 ± 32% perf-profile.self.cycles-pp.jbd2__journal_start
0.00 +0.1 0.14 ± 25% perf-profile.self.cycles-pp.jbd2_journal_file_inode
0.18 ± 43% +0.2 0.34 ± 11% perf-profile.self.cycles-pp.__cond_resched
0.05 ±102% +0.2 0.22 ± 26% perf-profile.self.cycles-pp.ext4_do_update_inode
0.07 ± 78% +0.2 0.24 ± 43% perf-profile.self.cycles-pp.__ext4_get_inode_loc
0.06 ± 75% +0.2 0.24 ± 33% perf-profile.self.cycles-pp.folio_mark_accessed
0.00 +0.2 0.18 ± 43% perf-profile.self.cycles-pp.ext4_da_do_write_end
0.00 +0.2 0.19 ± 38% perf-profile.self.cycles-pp.ext4_mark_iloc_dirty
0.03 ±142% +0.2 0.22 ± 32% perf-profile.self.cycles-pp.ext4_inode_csum
0.00 +0.2 0.20 ± 45% perf-profile.self.cycles-pp.add_transaction_credits
0.00 +0.2 0.20 ± 38% perf-profile.self.cycles-pp.__ext4_block_zero_page_range
0.00 +0.2 0.21 ± 37% perf-profile.self.cycles-pp.stop_this_handle
0.71 ± 19% +0.2 0.93 ± 15% perf-profile.self.cycles-pp._raw_spin_lock
0.10 ± 55% +0.2 0.34 ± 9% perf-profile.self.cycles-pp.filemap_get_entry
0.05 ±165% +0.2 0.29 ± 12% perf-profile.self.cycles-pp.ext4_inode_journal_mode
0.07 ± 50% +0.3 0.34 ± 32% perf-profile.self.cycles-pp.zero_user_segments
0.00 +0.3 0.34 ± 7% perf-profile.self.cycles-pp.start_this_handle
0.05 ±103% +0.5 0.50 ± 24% perf-profile.self.cycles-pp.ext4_fill_raw_inode
0.19 ± 19% +0.7 0.89 ± 9% perf-profile.self.cycles-pp.crc32c_pcl_intel_update
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
Powered by blists - more mailing lists