[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <202411301528.342383d8-lkp@intel.com>
Date: Sat, 30 Nov 2024 16:07:10 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Rik van Riel <riel@...riel.com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
Ingo Molnar <mingo@...nel.org>, Andy Lutomirski <luto@...nel.org>, "Peter
Zijlstra" <peterz@...radead.org>, Linus Torvalds
<torvalds@...ux-foundation.org>, <oliver.sang@...el.com>
Subject: [linus:master] [x86/mm/tlb] 7e33001b8b:
will-it-scale.per_thread_ops 20.7% improvement
Hello,
in this test, we don't have CONFIG_DEBUG_VM.
# CONFIG_DEBUG_VM is not set
below report is just FYI.
kernel test robot noticed a 20.7% improvement of will-it-scale.per_thread_ops on:
commit: 7e33001b8b9a78062679e0fdf5b0842a49063135 ("x86/mm/tlb: Put cpumask_test_cpu() check in switch_mm_irqs_off() under CONFIG_DEBUG_VM")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
testcase: will-it-scale
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 104 threads 2 sockets (Skylake) with 192G memory
parameters:
nr_task: 100%
mode: thread
test: context_switch1
cpufreq_governor: performance
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241130/202411301528.342383d8-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-9.4/thread/100%/debian-12-x86_64-20240206.cgz/lkp-skl-fpga01/context_switch1/will-it-scale
commit:
abe949ce0f ("Merge tag 'v6.12-rc7' into x86/mm, to pick up fixes before applying new patches")
7e33001b8b ("x86/mm/tlb: Put cpumask_test_cpu() check in switch_mm_irqs_off() under CONFIG_DEBUG_VM")
abe949ce0f9be943 7e33001b8b9a78062679e0fdf5b
---------------- ---------------------------
%stddev %change %stddev
\ | \
28.28 ± 5% +4.7 32.96 ± 2% mpstat.cpu.all.usr%
68316 ± 45% -43.0% 38971 ± 63% numa-numastat.node0.other_node
68316 ± 45% -43.0% 38971 ± 63% numa-vmstat.node0.numa_other
0.01 ± 72% +140.0% 0.01 ± 18% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
16400085 ± 5% +22.8% 20143765 ± 2% vmstat.system.cs
17103848 +20.7% 20638715 will-it-scale.104.threads
164459 +20.7% 198448 will-it-scale.per_thread_ops
17103848 +20.7% 20638715 will-it-scale.workload
384.50 ± 28% -67.6% 124.60 ± 36% perf-c2c.DRAM.local
1068 ± 8% -77.7% 238.40 ± 29% perf-c2c.DRAM.remote
8314 ± 8% -50.2% 4139 ± 30% perf-c2c.HITM.local
8446 ± 8% -49.1% 4297 ± 28% perf-c2c.HITM.total
0.21 ± 8% -23.3% 0.16 ± 6% sched_debug.cfs_rq:/.h_nr_running.stddev
10757 ± 18% -15.3% 9106 ± 7% sched_debug.cfs_rq:/.load.avg
0.83 +52.0% 1.27 ± 13% sched_debug.cpu.nr_running.min
0.24 ± 4% -41.3% 0.14 ± 11% sched_debug.cpu.nr_running.stddev
24327344 +20.3% 29272455 sched_debug.cpu.nr_switches.avg
25987818 +14.9% 29865135 sched_debug.cpu.nr_switches.max
21699605 ± 3% +21.5% 26368088 sched_debug.cpu.nr_switches.min
1360372 ± 13% -58.0% 571428 ± 41% sched_debug.cpu.nr_switches.stddev
22.94 ± 11% -32.0% 15.60 ± 25% sched_debug.cpu.nr_uninterruptible.max
0.35 ± 26% -43.3% 0.20 ± 39% perf-stat.i.MPKI
1.888e+10 ± 5% +21.0% 2.286e+10 ± 2% perf-stat.i.branch-instructions
2.648e+08 ± 5% +24.1% 3.287e+08 ± 2% perf-stat.i.branch-misses
26642185 ± 15% -28.3% 19102179 ± 24% perf-stat.i.cache-misses
1.078e+08 ± 10% -28.1% 77522633 ± 17% perf-stat.i.cache-references
16520503 ± 5% +22.9% 20296309 ± 2% perf-stat.i.context-switches
2.99 -16.4% 2.50 perf-stat.i.cpi
10422 ± 10% +57.2% 16379 ± 37% perf-stat.i.cycles-between-cache-misses
9.4e+10 ± 5% +21.2% 1.139e+11 ± 2% perf-stat.i.instructions
0.34 +18.6% 0.40 perf-stat.i.ipc
158.85 ± 5% +22.9% 195.15 ± 2% perf-stat.i.metric.K/sec
0.28 ± 12% -40.7% 0.17 ± 24% perf-stat.overall.MPKI
1.40 +0.0 1.44 perf-stat.overall.branch-miss-rate%
2.98 -16.6% 2.49 perf-stat.overall.cpi
10679 ± 10% +51.1% 16135 ± 34% perf-stat.overall.cycles-between-cache-misses
0.34 +19.9% 0.40 perf-stat.overall.ipc
1710303 -1.5% 1684995 perf-stat.overall.path-length
1.882e+10 ± 5% +21.0% 2.278e+10 ± 2% perf-stat.ps.branch-instructions
2.639e+08 ± 5% +24.2% 3.277e+08 ± 2% perf-stat.ps.branch-misses
26554647 ± 15% -28.3% 19035653 ± 24% perf-stat.ps.cache-misses
1.074e+08 ± 10% -28.0% 77345133 ± 17% perf-stat.ps.cache-references
16466063 ± 5% +22.9% 20230564 ± 2% perf-stat.ps.context-switches
9.369e+10 ± 5% +21.2% 1.135e+11 ± 2% perf-stat.ps.instructions
2.925e+13 +18.9% 3.478e+13 perf-stat.total.instructions
10.59 ± 5% -6.1 4.44 ± 3% perf-profile.calltrace.cycles-pp.__switch_to_asm.read
9.09 ± 5% -5.5 3.55 ± 3% perf-profile.calltrace.cycles-pp.__switch_to_asm.write
6.14 ± 2% -1.0 5.11 perf-profile.calltrace.cycles-pp.__schedule.schedule.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
6.30 -1.0 5.30 perf-profile.calltrace.cycles-pp.schedule.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
10.11 -0.6 9.49 perf-profile.calltrace.cycles-pp.__schedule.schedule.pipe_read.vfs_read.ksys_read
10.30 -0.6 9.74 perf-profile.calltrace.cycles-pp.schedule.pipe_read.vfs_read.ksys_read.do_syscall_64
8.56 -0.5 8.04 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
0.81 +0.1 0.93 perf-profile.calltrace.cycles-pp.switch_fpu_return.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
0.60 +0.1 0.72 perf-profile.calltrace.cycles-pp._copy_to_iter.copy_page_to_iter.pipe_read.vfs_read.ksys_read
0.79 +0.1 0.93 perf-profile.calltrace.cycles-pp.fdget_pos.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
0.54 ± 4% +0.1 0.68 perf-profile.calltrace.cycles-pp.prepare_task_switch.__schedule.schedule.pipe_read.vfs_read
0.73 ± 2% +0.2 0.89 perf-profile.calltrace.cycles-pp.copy_page_to_iter.pipe_read.vfs_read.ksys_read.do_syscall_64
0.68 ± 2% +0.2 0.84 perf-profile.calltrace.cycles-pp._copy_from_iter.copy_page_from_iter.pipe_write.vfs_write.ksys_write
0.66 ± 2% +0.2 0.83 perf-profile.calltrace.cycles-pp.__rseq_handle_notify_resume.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
0.79 +0.2 0.96 perf-profile.calltrace.cycles-pp.fdget_pos.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
0.67 +0.2 0.85 perf-profile.calltrace.cycles-pp.prepare_to_wait_event.pipe_read.vfs_read.ksys_read.do_syscall_64
0.52 ± 2% +0.2 0.71 perf-profile.calltrace.cycles-pp.mutex_lock.pipe_read.vfs_read.ksys_read.do_syscall_64
0.82 ± 3% +0.2 1.03 ± 3% perf-profile.calltrace.cycles-pp.update_curr.dequeue_entity.dequeue_entities.dequeue_task_fair.__schedule
0.97 ± 2% +0.2 1.20 perf-profile.calltrace.cycles-pp.copy_page_from_iter.pipe_write.vfs_write.ksys_write.do_syscall_64
0.95 ± 2% +0.2 1.20 ± 2% perf-profile.calltrace.cycles-pp.__rseq_handle_notify_resume.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
0.77 ± 3% +0.2 1.01 ± 2% perf-profile.calltrace.cycles-pp.requeue_delayed_entity.enqueue_task.try_to_wake_up.autoremove_wake_function.__wake_up_common
0.78 ± 4% +0.2 1.03 ± 3% perf-profile.calltrace.cycles-pp.update_curr.check_preempt_wakeup_fair.wakeup_preempt.try_to_wake_up.autoremove_wake_function
1.17 +0.2 1.42 perf-profile.calltrace.cycles-pp.switch_fpu_return.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
0.66 ± 2% +0.3 0.93 perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.pipe_read.vfs_read.ksys_read
1.60 +0.3 1.87 perf-profile.calltrace.cycles-pp.restore_fpregs_from_fpstate.switch_fpu_return.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.92 +0.3 2.19 ± 2% perf-profile.calltrace.cycles-pp.dequeue_entities.dequeue_task_fair.__schedule.schedule.pipe_read
0.83 ± 2% +0.3 1.14 perf-profile.calltrace.cycles-pp.touch_atime.pipe_read.vfs_read.ksys_read.do_syscall_64
2.15 +0.3 2.47 ± 2% perf-profile.calltrace.cycles-pp.dequeue_task_fair.__schedule.schedule.pipe_read.vfs_read
1.08 ± 3% +0.3 1.41 perf-profile.calltrace.cycles-pp.enqueue_task.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_sync_key
1.60 ± 2% +0.3 1.94 ± 2% perf-profile.calltrace.cycles-pp.dequeue_entity.dequeue_entities.dequeue_task_fair.__schedule.schedule
1.58 ± 2% +0.4 2.00 perf-profile.calltrace.cycles-pp.put_prev_entity.pick_next_task_fair.__pick_next_task.__schedule.schedule
1.06 ± 23% +0.4 1.49 perf-profile.calltrace.cycles-pp.set_next_entity.pick_next_task_fair.__pick_next_task.__schedule.schedule
1.85 ± 2% +0.4 2.30 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.read
1.36 ± 3% +0.5 1.83 perf-profile.calltrace.cycles-pp.check_preempt_wakeup_fair.wakeup_preempt.try_to_wake_up.autoremove_wake_function.__wake_up_common
2.01 +0.5 2.48 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.write
0.08 ±223% +0.5 0.59 perf-profile.calltrace.cycles-pp.prepare_task_switch.__schedule.schedule.syscall_exit_to_user_mode.do_syscall_64
0.08 ±223% +0.5 0.59 perf-profile.calltrace.cycles-pp.update_load_avg.dequeue_entity.dequeue_entities.dequeue_task_fair.__schedule
1.36 ± 3% +0.5 1.87 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_safe_stack.read
1.28 ± 16% +0.5 1.80 ± 2% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_safe_stack.write
0.00 +0.5 0.52 perf-profile.calltrace.cycles-pp.os_xsave.read
2.42 ± 2% +0.5 2.95 perf-profile.calltrace.cycles-pp.pick_next_task_fair.__pick_next_task.__schedule.schedule.syscall_exit_to_user_mode
1.56 ± 3% +0.5 2.11 perf-profile.calltrace.cycles-pp.wakeup_preempt.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_sync_key
1.82 ± 2% +0.5 2.37 perf-profile.calltrace.cycles-pp.pick_task_fair.pick_next_task_fair.__pick_next_task.__schedule.schedule
2.55 ± 2% +0.6 3.10 perf-profile.calltrace.cycles-pp.__pick_next_task.__schedule.schedule.syscall_exit_to_user_mode.do_syscall_64
0.00 +0.6 0.55 ± 2% perf-profile.calltrace.cycles-pp.pick_eevdf.pick_task_fair.pick_next_task_fair.__pick_next_task.__schedule
0.62 ± 2% +0.7 1.28 perf-profile.calltrace.cycles-pp.rseq_ip_fixup.__rseq_handle_notify_resume.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.88 +0.7 3.58 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
4.66 ± 2% +0.8 5.43 perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.write
14.94 +0.8 15.76 perf-profile.calltrace.cycles-pp.pipe_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.82 ± 2% +0.8 3.66 perf-profile.calltrace.cycles-pp.pick_next_task_fair.__pick_next_task.__schedule.schedule.pipe_read
4.45 ± 6% +0.9 5.31 ± 6% perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_sync_key.pipe_write
2.97 ± 2% +0.9 3.85 perf-profile.calltrace.cycles-pp.__pick_next_task.__schedule.schedule.pipe_read.vfs_read
4.88 ± 2% +0.9 5.77 perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.read
4.70 ± 6% +1.0 5.66 ± 5% perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_sync_key.pipe_write.vfs_write
6.10 +1.0 7.08 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.read
4.87 ± 6% +1.0 5.89 ± 5% perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_sync_key.pipe_write.vfs_write.ksys_write
6.16 +1.1 7.21 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.write
15.81 +1.1 16.87 perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
5.28 ± 5% +1.1 6.38 ± 5% perf-profile.calltrace.cycles-pp.__wake_up_sync_key.pipe_write.vfs_write.ksys_write.do_syscall_64
17.09 +1.4 18.47 perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
8.48 ± 4% +2.0 10.52 ± 2% perf-profile.calltrace.cycles-pp.pipe_write.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
9.18 ± 3% +2.2 11.36 ± 2% perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
19.80 ± 2% +2.2 22.00 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
20.67 +2.4 23.05 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
10.43 ± 3% +2.5 12.90 ± 2% perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
23.09 +2.8 25.89 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
24.12 +3.0 27.11 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
19.71 ± 5% -11.6 8.12 ± 3% perf-profile.children.cycles-pp.__switch_to_asm
4.83 ± 6% -4.8 0.06 perf-profile.children.cycles-pp.switch_mm_irqs_off
16.33 -1.6 14.71 perf-profile.children.cycles-pp.__schedule
16.61 -1.6 15.04 perf-profile.children.cycles-pp.schedule
0.43 ± 6% -0.3 0.12 ± 22% perf-profile.children.cycles-pp.ttwu_do_activate
0.54 ± 3% -0.2 0.35 ± 7% perf-profile.children.cycles-pp.enqueue_task_fair
0.18 ± 7% -0.1 0.03 ± 82% perf-profile.children.cycles-pp.enqueue_entity
0.05 +0.0 0.06 perf-profile.children.cycles-pp.init_wait_entry
0.06 ± 7% +0.0 0.08 perf-profile.children.cycles-pp.security_file_permission
0.05 ± 8% +0.0 0.07 perf-profile.children.cycles-pp.__x64_sys_read
0.08 ± 5% +0.0 0.10 ± 3% perf-profile.children.cycles-pp.__x64_sys_write
0.09 ± 4% +0.0 0.11 ± 4% perf-profile.children.cycles-pp.kill_fasync
0.11 ± 6% +0.0 0.13 ± 3% perf-profile.children.cycles-pp.dequeue_task
0.04 ± 44% +0.0 0.06 ± 6% perf-profile.children.cycles-pp.psi_account_irqtime
0.05 ± 8% +0.0 0.07 ± 6% perf-profile.children.cycles-pp.timestamp_truncate
0.10 ± 3% +0.0 0.12 ± 3% perf-profile.children.cycles-pp.resched_curr
0.08 ± 5% +0.0 0.11 perf-profile.children.cycles-pp.mem_cgroup_handle_over_high
0.16 ± 4% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.__calc_delta
0.09 +0.0 0.12 ± 4% perf-profile.children.cycles-pp.blkcg_maybe_throttle_current
0.05 ± 7% +0.0 0.08 ± 4% perf-profile.children.cycles-pp.raw_spin_rq_unlock
0.37 ± 3% +0.0 0.40 ± 2% perf-profile.children.cycles-pp.update_rq_clock_task
0.15 ± 2% +0.0 0.19 ± 2% perf-profile.children.cycles-pp.__rdgsbase_inactive
0.14 ± 6% +0.0 0.18 ± 6% perf-profile.children.cycles-pp.dl_scaled_delta_exec
0.11 ± 3% +0.0 0.15 ± 2% perf-profile.children.cycles-pp.pick_next_task
0.12 ± 9% +0.0 0.17 ± 4% perf-profile.children.cycles-pp.ktime_get_coarse_real_ts64
0.01 ±223% +0.0 0.06 ± 6% perf-profile.children.cycles-pp.cgroup_rstat_updated
0.00 +0.1 0.05 perf-profile.children.cycles-pp.rcu_note_context_switch
0.00 +0.1 0.05 perf-profile.children.cycles-pp.write@plt
0.16 ± 2% +0.1 0.21 ± 2% perf-profile.children.cycles-pp.finish_wait
0.24 ± 3% +0.1 0.29 ± 2% perf-profile.children.cycles-pp.testcase
0.36 ± 2% +0.1 0.42 ± 2% perf-profile.children.cycles-pp.avg_vruntime
0.26 ± 3% +0.1 0.32 ± 3% perf-profile.children.cycles-pp.x64_sys_call
0.33 ± 2% +0.1 0.39 perf-profile.children.cycles-pp.__dequeue_entity
0.27 +0.1 0.33 perf-profile.children.cycles-pp.__put_user_8
0.15 ± 3% +0.1 0.21 ± 2% perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
0.25 +0.1 0.31 ± 2% perf-profile.children.cycles-pp.update_curr_se
0.22 ± 3% +0.1 0.29 ± 5% perf-profile.children.cycles-pp.inode_needs_update_time
0.31 +0.1 0.39 ± 2% perf-profile.children.cycles-pp._raw_spin_lock
0.31 ± 5% +0.1 0.39 ± 6% perf-profile.children.cycles-pp.update_curr_dl_se
0.35 ± 2% +0.1 0.42 perf-profile.children.cycles-pp.rep_movs_alternative
1.45 +0.1 1.53 perf-profile.children.cycles-pp.enqueue_task
0.16 ± 2% +0.1 0.24 ± 2% perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
0.27 ± 3% +0.1 0.36 ± 4% perf-profile.children.cycles-pp.file_update_time
0.32 +0.1 0.41 perf-profile.children.cycles-pp.update_min_vruntime
0.36 ± 2% +0.1 0.45 perf-profile.children.cycles-pp.anon_pipe_buf_release
0.33 ± 4% +0.1 0.41 perf-profile.children.cycles-pp.rw_verify_area
0.29 ± 3% +0.1 0.38 ± 3% perf-profile.children.cycles-pp.native_sched_clock
0.36 ± 3% +0.1 0.45 ± 2% perf-profile.children.cycles-pp.raw_spin_rq_lock_nested
0.32 ± 3% +0.1 0.41 ± 2% perf-profile.children.cycles-pp.sched_clock
0.23 ± 4% +0.1 0.33 perf-profile.children.cycles-pp.rcu_all_qs
0.35 ± 3% +0.1 0.45 ± 3% perf-profile.children.cycles-pp.update_entity_lag
0.37 ± 2% +0.1 0.48 ± 2% perf-profile.children.cycles-pp.sched_clock_cpu
0.50 ± 2% +0.1 0.61 perf-profile.children.cycles-pp.mutex_unlock
0.44 ± 2% +0.1 0.56 perf-profile.children.cycles-pp.___perf_sw_event
0.45 ± 2% +0.1 0.57 perf-profile.children.cycles-pp.__get_user_8
0.50 +0.1 0.62 perf-profile.children.cycles-pp.rseq_update_cpu_node_id
0.26 ± 4% +0.1 0.39 perf-profile.children.cycles-pp.current_time
0.46 ± 3% +0.1 0.59 ± 4% perf-profile.children.cycles-pp.update_rq_clock
0.64 ± 2% +0.1 0.78 perf-profile.children.cycles-pp._copy_to_iter
0.80 ± 3% +0.1 0.93 perf-profile.children.cycles-pp.__update_load_avg_se
0.55 ± 2% +0.1 0.69 perf-profile.children.cycles-pp.rseq_get_rseq_cs
0.42 ± 2% +0.1 0.57 ± 2% perf-profile.children.cycles-pp.vruntime_eligible
0.67 +0.2 0.82 perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.45 ± 2% +0.2 0.60 perf-profile.children.cycles-pp.fput
0.46 ± 2% +0.2 0.62 perf-profile.children.cycles-pp.__enqueue_entity
0.74 ± 2% +0.2 0.91 perf-profile.children.cycles-pp.copy_page_to_iter
0.73 ± 2% +0.2 0.90 perf-profile.children.cycles-pp._copy_from_iter
0.78 ± 2% +0.2 0.96 perf-profile.children.cycles-pp.os_xsave
0.40 ± 2% +0.2 0.58 perf-profile.children.cycles-pp.__cond_resched
0.68 +0.2 0.86 perf-profile.children.cycles-pp.prepare_to_wait_event
0.36 ± 3% +0.2 0.56 ± 2% perf-profile.children.cycles-pp.__wrgsbase_inactive
0.91 +0.2 1.14 perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.98 ± 2% +0.2 1.22 perf-profile.children.cycles-pp.copy_page_from_iter
0.77 ± 2% +0.2 1.02 ± 2% perf-profile.children.cycles-pp.requeue_delayed_entity
1.04 ± 2% +0.3 1.29 perf-profile.children.cycles-pp.rseq_ip_fixup
1.02 ± 3% +0.3 1.28 perf-profile.children.cycles-pp.prepare_task_switch
1.60 +0.3 1.88 perf-profile.children.cycles-pp.restore_fpregs_from_fpstate
1.93 +0.3 2.20 ± 2% perf-profile.children.cycles-pp.dequeue_entities
0.68 ± 3% +0.3 0.96 perf-profile.children.cycles-pp.atime_needs_update
1.28 ± 2% +0.3 1.56 perf-profile.children.cycles-pp.set_next_entity
1.60 +0.3 1.91 perf-profile.children.cycles-pp.fdget_pos
0.84 ± 2% +0.3 1.15 perf-profile.children.cycles-pp.touch_atime
2.16 +0.3 2.48 ± 2% perf-profile.children.cycles-pp.dequeue_task_fair
1.64 ± 2% +0.3 1.98 ± 2% perf-profile.children.cycles-pp.dequeue_entity
1.07 ± 2% +0.3 1.42 perf-profile.children.cycles-pp.pick_eevdf
0.82 ± 2% +0.4 1.18 perf-profile.children.cycles-pp.__switch_to
0.88 +0.4 1.23 perf-profile.children.cycles-pp.mutex_lock
1.98 +0.4 2.35 perf-profile.children.cycles-pp.switch_fpu_return
1.64 ± 2% +0.4 2.05 perf-profile.children.cycles-pp.__rseq_handle_notify_resume
1.65 ± 2% +0.4 2.08 perf-profile.children.cycles-pp.put_prev_entity
1.42 ± 2% +0.4 1.86 ± 2% perf-profile.children.cycles-pp.check_preempt_wakeup_fair
2.46 +0.5 2.91 perf-profile.children.cycles-pp.update_load_avg
1.64 ± 2% +0.5 2.13 perf-profile.children.cycles-pp.wakeup_preempt
1.64 ± 2% +0.5 2.14 perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
1.90 ± 2% +0.6 2.46 perf-profile.children.cycles-pp.pick_task_fair
2.68 ± 2% +0.6 3.31 perf-profile.children.cycles-pp.update_curr
15.07 +0.9 15.92 perf-profile.children.cycles-pp.pipe_read
4.50 ± 6% +0.9 5.38 ± 6% perf-profile.children.cycles-pp.try_to_wake_up
4.71 ± 6% +1.0 5.67 ± 5% perf-profile.children.cycles-pp.autoremove_wake_function
4.88 ± 6% +1.0 5.90 ± 5% perf-profile.children.cycles-pp.__wake_up_common
15.83 +1.1 16.89 perf-profile.children.cycles-pp.vfs_read
5.29 ± 5% +1.1 6.39 ± 5% perf-profile.children.cycles-pp.__wake_up_sync_key
4.97 +1.3 6.31 perf-profile.children.cycles-pp.entry_SYSCALL_64
5.38 ± 2% +1.4 6.77 perf-profile.children.cycles-pp.pick_next_task_fair
17.15 +1.4 18.57 perf-profile.children.cycles-pp.ksys_read
5.53 ± 2% +1.4 6.97 perf-profile.children.cycles-pp.__pick_next_task
10.36 ± 3% +1.8 12.16 perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
8.53 ± 4% +2.0 10.58 ± 2% perf-profile.children.cycles-pp.pipe_write
12.35 +2.0 14.40 perf-profile.children.cycles-pp.syscall_return_via_sysret
9.22 ± 3% +2.2 11.40 ± 2% perf-profile.children.cycles-pp.vfs_write
10.50 ± 3% +2.5 12.98 ± 2% perf-profile.children.cycles-pp.ksys_write
40.54 +4.6 45.15 perf-profile.children.cycles-pp.do_syscall_64
47.48 +5.8 53.33 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
19.70 ± 5% -11.6 8.10 ± 3% perf-profile.self.cycles-pp.__switch_to_asm
4.83 ± 6% -4.8 0.05 ± 9% perf-profile.self.cycles-pp.switch_mm_irqs_off
0.05 +0.0 0.06 perf-profile.self.cycles-pp.__set_next_task_fair
0.07 ± 5% +0.0 0.08 perf-profile.self.cycles-pp.kill_fasync
0.06 +0.0 0.07 ± 5% perf-profile.self.cycles-pp.security_file_permission
0.10 ± 4% +0.0 0.12 perf-profile.self.cycles-pp.dequeue_task
0.09 ± 4% +0.0 0.10 ± 4% perf-profile.self.cycles-pp.__wake_up_sync_key
0.16 ± 3% +0.0 0.17 ± 2% perf-profile.self.cycles-pp.__calc_delta
0.16 ± 2% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.dequeue_entities
0.05 ± 8% +0.0 0.07 ± 6% perf-profile.self.cycles-pp.timestamp_truncate
0.08 ± 5% +0.0 0.11 ± 4% perf-profile.self.cycles-pp.mem_cgroup_handle_over_high
0.24 ± 3% +0.0 0.26 ± 2% perf-profile.self.cycles-pp.enqueue_task_fair
0.10 ± 4% +0.0 0.12 ± 3% perf-profile.self.cycles-pp.resched_curr
0.09 ± 8% +0.0 0.11 ± 13% perf-profile.self.cycles-pp.update_rq_clock
0.07 ± 5% +0.0 0.10 ± 4% perf-profile.self.cycles-pp.blkcg_maybe_throttle_current
0.10 ± 3% +0.0 0.13 ± 3% perf-profile.self.cycles-pp.rseq_get_rseq_cs
0.04 ± 45% +0.0 0.07 perf-profile.self.cycles-pp.requeue_delayed_entity
0.03 ± 70% +0.0 0.06 ± 6% perf-profile.self.cycles-pp.file_update_time
0.10 ± 4% +0.0 0.13 ± 3% perf-profile.self.cycles-pp.copy_page_to_iter
0.13 ± 4% +0.0 0.16 ± 3% perf-profile.self.cycles-pp.enqueue_task
0.34 ± 3% +0.0 0.37 ± 2% perf-profile.self.cycles-pp.update_rq_clock_task
0.21 ± 2% +0.0 0.24 perf-profile.self.cycles-pp.dequeue_entity
0.15 ± 2% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.__rdgsbase_inactive
0.11 ± 6% +0.0 0.14 ± 2% perf-profile.self.cycles-pp.__rseq_handle_notify_resume
0.13 ± 7% +0.0 0.17 ± 4% perf-profile.self.cycles-pp.dl_scaled_delta_exec
0.02 ± 99% +0.0 0.06 perf-profile.self.cycles-pp.psi_account_irqtime
0.12 ± 4% +0.0 0.15 perf-profile.self.cycles-pp.wakeup_preempt
0.18 ± 5% +0.0 0.21 ± 8% perf-profile.self.cycles-pp.update_curr_dl_se
0.10 +0.0 0.14 ± 2% perf-profile.self.cycles-pp.pick_next_task
0.14 ± 5% +0.0 0.18 ± 3% perf-profile.self.cycles-pp.touch_atime
0.13 ± 3% +0.0 0.17 ± 7% perf-profile.self.cycles-pp.inode_needs_update_time
0.22 ± 4% +0.0 0.26 ± 2% perf-profile.self.cycles-pp.testcase
0.11 ± 9% +0.0 0.15 ± 3% perf-profile.self.cycles-pp.ktime_get_coarse_real_ts64
0.16 ± 3% +0.0 0.21 ± 2% perf-profile.self.cycles-pp.__pick_next_task
0.01 ±223% +0.0 0.06 ± 6% perf-profile.self.cycles-pp.cgroup_rstat_updated
0.28 ± 2% +0.0 0.33 perf-profile.self.cycles-pp.__dequeue_entity
0.24 ± 2% +0.0 0.29 ± 2% perf-profile.self.cycles-pp.x64_sys_call
0.00 +0.1 0.05 perf-profile.self.cycles-pp.update_entity_lag
0.20 ± 2% +0.1 0.26 perf-profile.self.cycles-pp.put_prev_entity
0.20 ± 3% +0.1 0.25 perf-profile.self.cycles-pp.finish_task_switch
0.00 +0.1 0.05 ± 7% perf-profile.self.cycles-pp.finish_wait
0.22 ± 3% +0.1 0.28 ± 2% perf-profile.self.cycles-pp.dequeue_task_fair
0.28 ± 3% +0.1 0.33 ± 2% perf-profile.self.cycles-pp.rep_movs_alternative
0.22 ± 3% +0.1 0.28 perf-profile.self.cycles-pp.rseq_ip_fixup
0.23 ± 2% +0.1 0.28 ± 2% perf-profile.self.cycles-pp.update_curr_se
0.36 ± 2% +0.1 0.41 ± 3% perf-profile.self.cycles-pp.avg_vruntime
0.24 ± 2% +0.1 0.30 perf-profile.self.cycles-pp.set_next_entity
0.13 ± 2% +0.1 0.19 perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
0.25 ± 2% +0.1 0.31 ± 2% perf-profile.self.cycles-pp.copy_page_from_iter
0.26 ± 2% +0.1 0.32 perf-profile.self.cycles-pp.__put_user_8
0.19 ± 3% +0.1 0.25 perf-profile.self.cycles-pp.autoremove_wake_function
0.26 ± 2% +0.1 0.33 perf-profile.self.cycles-pp.schedule
0.17 ± 4% +0.1 0.23 ± 2% perf-profile.self.cycles-pp.__wake_up_common
0.01 ±223% +0.1 0.08 ± 6% perf-profile.self.cycles-pp.raw_spin_rq_unlock
0.14 ± 3% +0.1 0.21 perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
0.18 ± 2% +0.1 0.25 perf-profile.self.cycles-pp.__cond_resched
0.30 ± 2% +0.1 0.37 perf-profile.self.cycles-pp.update_min_vruntime
0.25 ± 5% +0.1 0.32 perf-profile.self.cycles-pp.rw_verify_area
0.53 ± 3% +0.1 0.61 perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
0.28 +0.1 0.36 perf-profile.self.cycles-pp.ksys_write
0.30 ± 2% +0.1 0.38 ± 3% perf-profile.self.cycles-pp._raw_spin_lock
0.18 ± 4% +0.1 0.26 ± 2% perf-profile.self.cycles-pp.current_time
0.33 ± 8% +0.1 0.41 ± 6% perf-profile.self.cycles-pp.try_to_wake_up
0.28 ± 4% +0.1 0.37 ± 3% perf-profile.self.cycles-pp.native_sched_clock
0.35 +0.1 0.44 perf-profile.self.cycles-pp.anon_pipe_buf_release
0.32 ± 3% +0.1 0.41 perf-profile.self.cycles-pp.check_preempt_wakeup_fair
0.20 ± 5% +0.1 0.30 ± 2% perf-profile.self.cycles-pp.rcu_all_qs
0.47 +0.1 0.57 perf-profile.self.cycles-pp._copy_to_iter
0.38 ± 2% +0.1 0.49 ± 2% perf-profile.self.cycles-pp.___perf_sw_event
0.47 ± 2% +0.1 0.57 perf-profile.self.cycles-pp.mutex_unlock
0.38 ± 2% +0.1 0.48 ± 3% perf-profile.self.cycles-pp.switch_fpu_return
0.37 ± 2% +0.1 0.47 perf-profile.self.cycles-pp.prepare_to_wait_event
0.36 ± 3% +0.1 0.48 perf-profile.self.cycles-pp.fput
0.44 ± 2% +0.1 0.55 perf-profile.self.cycles-pp.__get_user_8
0.59 ± 2% +0.1 0.71 perf-profile.self.cycles-pp.vfs_write
0.49 ± 2% +0.1 0.61 perf-profile.self.cycles-pp.rseq_update_cpu_node_id
0.59 ± 3% +0.1 0.72 perf-profile.self.cycles-pp.pick_next_task_fair
0.77 ± 2% +0.1 0.90 perf-profile.self.cycles-pp.__update_load_avg_se
0.39 ± 2% +0.1 0.52 perf-profile.self.cycles-pp.vruntime_eligible
0.50 ± 3% +0.1 0.63 perf-profile.self.cycles-pp.vfs_read
0.38 ± 2% +0.1 0.52 ± 2% perf-profile.self.cycles-pp.atime_needs_update
0.50 ± 4% +0.1 0.64 perf-profile.self.cycles-pp.pick_task_fair
0.62 +0.1 0.76 perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
0.56 ± 3% +0.1 0.71 perf-profile.self.cycles-pp.prepare_task_switch
0.62 ± 2% +0.1 0.76 perf-profile.self.cycles-pp._copy_from_iter
0.45 ± 2% +0.1 0.60 perf-profile.self.cycles-pp.__enqueue_entity
0.36 ± 2% +0.2 0.52 perf-profile.self.cycles-pp.ksys_read
0.98 ± 2% +0.2 1.15 perf-profile.self.cycles-pp.update_load_avg
0.78 ± 2% +0.2 0.95 perf-profile.self.cycles-pp.os_xsave
0.36 ± 3% +0.2 0.56 ± 2% perf-profile.self.cycles-pp.__wrgsbase_inactive
0.53 ± 2% +0.2 0.74 perf-profile.self.cycles-pp.mutex_lock
0.67 ± 2% +0.2 0.89 ± 2% perf-profile.self.cycles-pp.pick_eevdf
0.72 ± 2% +0.2 0.94 perf-profile.self.cycles-pp.__switch_to
0.88 ± 2% +0.2 1.11 perf-profile.self.cycles-pp._raw_spin_lock_irqsave
1.11 ± 2% +0.3 1.37 perf-profile.self.cycles-pp.update_curr
1.60 +0.3 1.87 perf-profile.self.cycles-pp.restore_fpregs_from_fpstate
1.30 +0.3 1.58 perf-profile.self.cycles-pp.syscall_exit_to_user_mode
1.08 +0.3 1.39 perf-profile.self.cycles-pp.pipe_read
1.59 +0.3 1.90 perf-profile.self.cycles-pp.fdget_pos
1.32 ± 2% +0.4 1.74 ± 2% perf-profile.self.cycles-pp.pipe_write
0.92 ± 2% +0.4 1.34 ± 2% perf-profile.self.cycles-pp.do_syscall_64
1.16 ± 4% +0.7 1.82 ± 3% perf-profile.self.cycles-pp.__schedule
4.43 +1.2 5.64 perf-profile.self.cycles-pp.entry_SYSCALL_64
7.03 +1.3 8.28 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
10.28 ± 3% +1.8 12.05 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
12.22 +2.0 14.25 perf-profile.self.cycles-pp.syscall_return_via_sysret
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
Powered by blists - more mailing lists