[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20210414024526.GA21236@xsang-OptiPlex-9020>
Date: Wed, 14 Apr 2021 10:45:26 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Like Xu <like.xu@...ux.intel.com>
Cc: Peter Zijlstra <peterz@...radead.org>,
kernel test robot <lkp@...el.com>,
LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
ying.huang@...el.com, feng.tang@...el.com, zhengjun.xing@...el.com
Subject: [x86/perf] abd562df94: stress-ng.timer.ops_per_sec 6.1% improvement
Greeting,
FYI, we noticed a 6.1% improvement of stress-ng.timer.ops_per_sec due to commit:
commit: abd562df94d19d0a9769971a35801b3f4991715d ("x86/perf: Use static_call for x86_pmu.guest_get_msrs")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: stress-ng
on test machine: 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory
with following parameters:
nr_threads: 100%
disk: 1HDD
testtime: 60s
class: interrupt
test: timer
cpufreq_governor: performance
ucode: 0x5003006
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml
bin/lkp run compatible-job.yaml
=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode:
interrupt/gcc-9/performance/1HDD/x86_64-rhel-8.3/100%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp7/timer/stress-ng/60s/0x5003006
commit:
9a7832ce3d ("perf/x86/intel/uncore: With > 8 nodes, get pci bus die id from NUMA info")
abd562df94 ("x86/perf: Use static_call for x86_pmu.guest_get_msrs")
9a7832ce3d920426 abd562df94d19d0a9769971a358
---------------- ---------------------------
%stddev %change %stddev
\ | \
7378 +3.7% 7650 stress-ng.time.percent_of_cpu_this_job_got
1.622e+09 +6.1% 1.721e+09 stress-ng.timer.ops
27012179 +6.1% 28671694 stress-ng.timer.ops_per_sec
16.51 -2.3 14.20 ± 2% mpstat.cpu.all.irq%
24897453 +7.2% 26690698 vmstat.system.in
2254 ±161% +248.6% 7857 ± 78% proc-vmstat.numa_pages_migrated
2254 ±161% +248.6% 7857 ± 78% proc-vmstat.pgmigrate_success
2.09 ± 54% +182.7% 5.92 ± 39% sched_debug.cfs_rq:/.removed.runnable_avg.avg
16.77 ± 44% +112.3% 35.62 ± 25% sched_debug.cfs_rq:/.removed.runnable_avg.stddev
2.09 ± 54% +177.6% 5.81 ± 38% sched_debug.cfs_rq:/.removed.util_avg.avg
16.77 ± 44% +106.1% 34.57 ± 23% sched_debug.cfs_rq:/.removed.util_avg.stddev
30.33 ± 6% +39.8% 42.42 ± 17% sched_debug.cpu.nr_uninterruptible.max
1849 ± 37% -57.4% 787.67 ± 21% interrupts.CPU1.CAL:Function_call_interrupts
1585 ± 35% -53.0% 745.00 ± 37% interrupts.CPU2.CAL:Function_call_interrupts
77.00 ± 14% +376.6% 367.00 ±130% interrupts.CPU28.RES:Rescheduling_interrupts
1019 ± 65% -45.9% 551.33 ± 20% interrupts.CPU52.CAL:Function_call_interrupts
1058 ± 49% -47.0% 560.50 ± 20% interrupts.CPU75.CAL:Function_call_interrupts
116.67 ± 16% -33.4% 77.67 ± 23% interrupts.CPU75.RES:Rescheduling_interrupts
1019 ± 61% -45.6% 554.83 ± 20% interrupts.CPU76.CAL:Function_call_interrupts
115.17 ± 54% -38.5% 70.83 ± 4% interrupts.CPU76.RES:Rescheduling_interrupts
0.64 ± 23% -37.4% 0.40 ± 10% perf-stat.i.MPKI
2.853e+10 +6.3% 3.034e+10 perf-stat.i.branch-instructions
3.319e+08 +5.5% 3.501e+08 perf-stat.i.branch-misses
69976777 ± 25% -52.1% 33544338 ± 18% perf-stat.i.cache-references
1.76 -7.5% 1.63 perf-stat.i.cpi
2.441e+11 -1.8% 2.395e+11 perf-stat.i.cpu-cycles
158.86 -3.0% 154.16 perf-stat.i.cpu-migrations
0.01 ± 17% -0.0 0.00 ± 24% perf-stat.i.dTLB-load-miss-rate%
1683057 ± 17% -45.8% 911716 ± 30% perf-stat.i.dTLB-load-misses
4.055e+10 +6.5% 4.32e+10 perf-stat.i.dTLB-loads
22577249 ± 8% +14.5% 25859832 ± 3% perf-stat.i.dTLB-store-misses
2.411e+10 +6.6% 2.57e+10 perf-stat.i.dTLB-stores
2.813e+08 ± 2% +5.2% 2.959e+08 perf-stat.i.iTLB-load-misses
952108 +6.8% 1017001 perf-stat.i.iTLB-loads
1.364e+11 +6.4% 1.452e+11 perf-stat.i.instructions
0.57 +7.7% 0.62 perf-stat.i.ipc
2.54 -1.8% 2.50 perf-stat.i.metric.GHz
971.81 +6.4% 1034 perf-stat.i.metric.M/sec
453092 ± 37% -49.7% 228059 ± 38% perf-stat.i.node-loads
1389772 ± 29% -52.0% 666567 ± 53% perf-stat.i.node-store-misses
0.51 ± 26% -55.0% 0.23 ± 18% perf-stat.overall.MPKI
1.79 -7.8% 1.65 perf-stat.overall.cpi
0.00 ± 17% -0.0 0.00 ± 30% perf-stat.overall.dTLB-load-miss-rate%
0.56 +8.4% 0.61 perf-stat.overall.ipc
98.54 -2.3 96.20 ± 2% perf-stat.overall.node-store-miss-rate%
2.807e+10 +6.4% 2.986e+10 perf-stat.ps.branch-instructions
3.266e+08 +5.5% 3.445e+08 perf-stat.ps.branch-misses
68956196 ± 25% -52.0% 33103941 ± 18% perf-stat.ps.cache-references
2.402e+11 -1.8% 2.358e+11 perf-stat.ps.cpu-cycles
156.99 -3.0% 152.29 perf-stat.ps.cpu-migrations
1686029 ± 17% -45.6% 917216 ± 29% perf-stat.ps.dTLB-load-misses
3.99e+10 +6.5% 4.251e+10 perf-stat.ps.dTLB-loads
22220401 ± 8% +14.5% 25451152 ± 3% perf-stat.ps.dTLB-store-misses
2.373e+10 +6.6% 2.529e+10 perf-stat.ps.dTLB-stores
2.768e+08 +5.2% 2.912e+08 perf-stat.ps.iTLB-load-misses
937147 +6.8% 1000585 perf-stat.ps.iTLB-loads
1.343e+11 +6.4% 1.429e+11 perf-stat.ps.instructions
449302 ± 37% -49.3% 227603 ± 38% perf-stat.ps.node-loads
1367419 ± 29% -52.0% 656210 ± 53% perf-stat.ps.node-store-misses
8.555e+12 +6.4% 9.102e+12 perf-stat.total.instructions
10.31 ± 14% -3.6 6.74 ± 8% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
9.28 ± 14% -3.2 6.04 ± 7% perf-profile.calltrace.cycles-pp.syscall_enter_from_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
17.22 ± 2% -2.9 14.30 ± 2% perf-profile.calltrace.cycles-pp.posixtimer_rearm.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare
19.20 -2.7 16.47 ± 2% perf-profile.calltrace.cycles-pp.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
5.14 ± 3% -2.7 2.46 ± 17% perf-profile.calltrace.cycles-pp.ktime_get.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal.get_signal
6.53 ± 15% -2.6 3.91 ± 7% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
20.90 -2.6 18.30 perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
6.20 ± 3% -2.6 3.62 ± 11% perf-profile.calltrace.cycles-pp.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal.get_signal.arch_do_signal_or_restart
5.91 ± 15% -2.5 3.43 ± 7% perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode.do_syscall_64
5.88 ± 15% -2.5 3.40 ± 7% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode
4.79 ± 24% -1.9 2.90 ± 10% perf-profile.calltrace.cycles-pp.__x64_sys_timer_getoverrun.do_syscall_64.entry_SYSCALL_64_after_hwframe
7.99 ± 10% -1.8 6.19 ± 3% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
2.22 ± 6% -1.3 0.97 ± 18% perf-profile.calltrace.cycles-pp.ktime_get_update_offsets_now.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
37.07 -1.1 36.00 perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
37.73 -1.0 36.74 perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
2.56 ± 27% -1.0 1.61 ± 14% perf-profile.calltrace.cycles-pp.__lock_timer.__x64_sys_timer_getoverrun.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.90 ± 14% -0.8 1.08 ± 7% perf-profile.calltrace.cycles-pp.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt
2.04 ± 3% -0.8 1.29 ± 9% perf-profile.calltrace.cycles-pp.ktime_get.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal
38.61 -0.7 37.96 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
1.28 ± 7% -0.5 0.76 ± 11% perf-profile.calltrace.cycles-pp.ktime_get.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
0.94 ± 16% -0.4 0.58 ± 5% perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.__x64_sys_timer_getoverrun.do_syscall_64.entry_SYSCALL_64_after_hwframe
5.26 -0.3 4.95 perf-profile.calltrace.cycles-pp.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal
8.91 -0.2 8.69 perf-profile.calltrace.cycles-pp.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal.arch_do_signal_or_restart
0.84 -0.1 0.72 ± 2% perf-profile.calltrace.cycles-pp.clockevents_program_min_delta.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal
0.63 +0.1 0.68 perf-profile.calltrace.cycles-pp.__get_user_nocheck_4.__fpu__restore_sig.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64
0.54 +0.1 0.60 perf-profile.calltrace.cycles-pp.__get_user_nocheck_4.copy_fpstate_to_sigframe.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare
0.66 ± 2% +0.1 0.72 perf-profile.calltrace.cycles-pp.read_tsc.ktime_get.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm
0.62 ± 3% +0.1 0.69 perf-profile.calltrace.cycles-pp.enqueue_hrtimer.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal
0.67 +0.1 0.74 perf-profile.calltrace.cycles-pp.read_tsc.ktime_get.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal
0.62 +0.1 0.69 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret
0.87 +0.1 0.94 perf-profile.calltrace.cycles-pp.__set_current_blocked.signal_setup_done.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.59 ± 4% +0.1 0.67 ± 2% perf-profile.calltrace.cycles-pp.__irqentry_text_end
0.79 ± 2% +0.1 0.87 perf-profile.calltrace.cycles-pp.hrtimer_forward.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal.get_signal
1.02 +0.1 1.11 perf-profile.calltrace.cycles-pp.signal_setup_done.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
0.85 ± 4% +0.1 0.94 perf-profile.calltrace.cycles-pp.__dequeue_signal.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare
1.08 ± 2% +0.1 1.19 perf-profile.calltrace.cycles-pp.__set_current_blocked.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.12 +0.1 1.23 perf-profile.calltrace.cycles-pp.__get_user_nocheck_8.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.77 +0.1 0.87 perf-profile.calltrace.cycles-pp.copy_init_pkru_to_fpregs.fpu__clear.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.88 +0.1 0.99 perf-profile.calltrace.cycles-pp.__put_user_nocheck_4.copy_fpstate_to_sigframe.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare
1.03 ± 2% +0.1 1.14 ± 2% perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string._copy_from_user.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64
1.10 +0.1 1.22 ± 2% perf-profile.calltrace.cycles-pp._copy_from_user.restore_altstack.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.46 +0.2 1.63 ± 2% perf-profile.calltrace.cycles-pp._copy_from_user.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.42 +0.2 1.59 ± 2% perf-profile.calltrace.cycles-pp.restore_altstack.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.57 ± 10% +0.2 0.78 ± 3% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
0.87 ± 14% +0.2 1.09 ± 8% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.39 ± 71% +0.3 0.65 ± 5% perf-profile.calltrace.cycles-pp.irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
1.48 ± 7% +0.3 1.77 ± 5% perf-profile.calltrace.cycles-pp.posix_timer_fn.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
1.49 ± 8% +0.3 1.78 ± 5% perf-profile.calltrace.cycles-pp.lapic_next_deadline.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
0.79 ± 18% +0.4 1.19 ± 13% perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.68 ± 22% +0.4 1.08 ± 14% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled
2.79 +0.4 3.20 perf-profile.calltrace.cycles-pp.lapic_next_deadline.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal
0.62 ± 25% +0.4 1.03 ± 17% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
0.68 ± 23% +0.4 1.09 ± 14% perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled._copy_to_user
2.54 ± 6% +0.4 2.96 ± 5% perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
0.18 ±141% +0.4 0.62 ± 18% perf-profile.calltrace.cycles-pp.___might_sleep.__might_fault._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
0.19 ±141% +0.5 0.65 ± 18% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string
0.19 ±141% +0.5 0.65 ± 18% perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string._copy_to_user
1.23 ± 15% +0.5 1.69 ± 18% perf-profile.calltrace.cycles-pp.__might_fault._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
4.29 +0.5 4.75 perf-profile.calltrace.cycles-pp.fpu__clear.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
0.72 ± 21% +0.5 1.19 ± 14% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.75 ± 23% +0.5 1.23 ± 14% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled._copy_to_user.__x64_sys_rt_sigpending
0.22 ±141% +0.5 0.74 ± 17% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string._copy_to_user.__x64_sys_rt_sigpending
0.21 ±141% +0.5 0.75 ± 15% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._copy_to_user
0.22 ±141% +0.5 0.75 ± 15% perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._copy_to_user.__x64_sys_rt_sigpending
0.00 +0.5 0.54 ± 2% perf-profile.calltrace.cycles-pp.recalc_sigpending.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare
0.00 +0.5 0.55 ± 3% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
0.24 ±141% +0.6 0.86 ± 15% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
8.05 ± 3% +0.7 8.73 perf-profile.calltrace.cycles-pp.copy_fpstate_to_sigframe.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
1.21 ± 17% +0.7 1.92 ± 11% perf-profile.calltrace.cycles-pp.copy_user_generic_unrolled._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
6.71 +0.8 7.46 perf-profile.calltrace.cycles-pp.__fpu__restore_sig.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.97 ± 23% +0.8 1.81 ± 23% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
10.17 ± 2% +0.9 11.06 perf-profile.calltrace.cycles-pp.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
8.76 +1.0 9.75 perf-profile.calltrace.cycles-pp.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
13.12 +1.5 14.58 perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
4.57 ± 17% +2.5 7.03 ± 14% perf-profile.calltrace.cycles-pp._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
7.80 ± 14% +2.9 10.68 ± 12% perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
10.54 ± 3% -4.9 5.63 ± 13% perf-profile.children.cycles-pp.ktime_get
10.09 ± 14% -3.4 6.67 ± 7% perf-profile.children.cycles-pp.syscall_enter_from_user_mode
20.36 -3.2 17.12 ± 2% perf-profile.children.cycles-pp.hrtimer_interrupt
21.14 -3.1 18.01 ± 2% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
4.95 ± 4% -2.9 2.02 ± 19% perf-profile.children.cycles-pp.ktime_get_update_offsets_now
17.26 ± 2% -2.9 14.34 ± 2% perf-profile.children.cycles-pp.posixtimer_rearm
23.48 -2.9 20.63 perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
19.25 -2.7 16.53 ± 2% perf-profile.children.cycles-pp.dequeue_signal
20.94 -2.6 18.35 perf-profile.children.cycles-pp.get_signal
6.24 ± 3% -2.6 3.65 ± 11% perf-profile.children.cycles-pp.common_hrtimer_rearm
28.04 -2.5 25.50 perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
11.95 ± 7% -2.4 9.54 ± 4% perf-profile.children.cycles-pp.asm_call_sysvec_on_stack
4.94 ± 24% -2.0 2.99 ± 10% perf-profile.children.cycles-pp.__x64_sys_timer_getoverrun
12.07 -1.3 10.76 perf-profile.children.cycles-pp.clockevents_program_event
37.11 -1.1 36.04 perf-profile.children.cycles-pp.arch_do_signal_or_restart
37.84 -1.0 36.88 perf-profile.children.cycles-pp.exit_to_user_mode_prepare
38.73 -0.6 38.10 perf-profile.children.cycles-pp.syscall_exit_to_user_mode
8.98 -0.2 8.78 perf-profile.children.cycles-pp.hrtimer_start_range_ns
0.84 -0.1 0.73 ± 2% perf-profile.children.cycles-pp.clockevents_program_min_delta
0.10 ± 4% +0.0 0.12 ± 3% perf-profile.children.cycles-pp.do_sigaltstack
0.09 ± 7% +0.0 0.11 ± 4% perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
0.13 ± 5% +0.0 0.15 ± 3% perf-profile.children.cycles-pp.rcu_all_qs
0.11 ± 7% +0.0 0.13 ± 5% perf-profile.children.cycles-pp.irq_enter_rcu
0.22 ± 3% +0.0 0.24 ± 2% perf-profile.children.cycles-pp.complete_signal
0.17 ± 2% +0.0 0.19 ± 2% perf-profile.children.cycles-pp._cond_resched
0.18 ± 3% +0.0 0.21 ± 4% perf-profile.children.cycles-pp.prepare_signal
0.23 ± 2% +0.0 0.25 perf-profile.children.cycles-pp.hrtimer_reprogram
0.30 ± 4% +0.0 0.33 ± 2% perf-profile.children.cycles-pp.tick_program_event
0.43 ± 3% +0.0 0.47 ± 3% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
0.27 ± 3% +0.0 0.31 ± 2% perf-profile.children.cycles-pp.__hrtimer_get_next_event
0.21 ± 3% +0.0 0.25 ± 5% perf-profile.children.cycles-pp.get_xsave_addr
0.57 +0.1 0.62 ± 2% perf-profile.children.cycles-pp.__local_bh_enable_ip
0.48 ± 3% +0.1 0.53 perf-profile.children.cycles-pp.timerqueue_add
0.60 ± 2% +0.1 0.66 perf-profile.children.cycles-pp.__lock_task_sighand
0.50 +0.1 0.56 ± 2% perf-profile.children.cycles-pp.__hrtimer_next_event_base
0.52 ± 2% +0.1 0.58 perf-profile.children.cycles-pp.recalc_sigpending_tsk
0.73 ± 2% +0.1 0.80 ± 2% perf-profile.children.cycles-pp.fpregs_mark_activate
0.64 ± 3% +0.1 0.71 perf-profile.children.cycles-pp.enqueue_hrtimer
0.90 ± 2% +0.1 0.98 perf-profile.children.cycles-pp.__set_task_blocked
0.59 ± 4% +0.1 0.67 ± 2% perf-profile.children.cycles-pp.__irqentry_text_end
0.79 ± 2% +0.1 0.87 perf-profile.children.cycles-pp.hrtimer_forward
0.88 ± 4% +0.1 0.97 perf-profile.children.cycles-pp.__dequeue_signal
1.02 +0.1 1.11 perf-profile.children.cycles-pp.signal_setup_done
0.78 +0.1 0.88 perf-profile.children.cycles-pp.copy_init_pkru_to_fpregs
0.94 ± 2% +0.1 1.05 ± 2% perf-profile.children.cycles-pp.native_sched_clock
0.99 ± 2% +0.1 1.10 ± 2% perf-profile.children.cycles-pp.sched_clock
1.25 +0.1 1.36 perf-profile.children.cycles-pp.__get_user_nocheck_8
1.02 +0.1 1.14 perf-profile.children.cycles-pp.__put_user_nocheck_4
1.23 ± 4% +0.1 1.36 ± 2% perf-profile.children.cycles-pp.irq_exit_rcu
1.30 +0.1 1.42 perf-profile.children.cycles-pp.__get_user_nocheck_4
1.24 +0.1 1.37 perf-profile.children.cycles-pp.recalc_sigpending
1.11 ± 2% +0.1 1.24 ± 2% perf-profile.children.cycles-pp.sched_clock_cpu
0.83 ± 7% +0.2 0.99 ± 5% perf-profile.children.cycles-pp.__might_sleep
1.88 +0.2 2.05 perf-profile.children.cycles-pp._raw_spin_lock_irq
1.43 +0.2 1.60 ± 2% perf-profile.children.cycles-pp.restore_altstack
1.50 ± 2% +0.2 1.68 perf-profile.children.cycles-pp.irqtime_account_irq
2.01 +0.2 2.19 perf-profile.children.cycles-pp.__set_current_blocked
2.74 +0.2 2.93 perf-profile.children.cycles-pp.read_tsc
1.82 +0.2 2.03 perf-profile.children.cycles-pp.syscall_return_via_sysret
2.20 +0.2 2.44 perf-profile.children.cycles-pp.send_sigqueue
0.65 ± 11% +0.3 0.92 ± 3% perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
1.37 ± 9% +0.3 1.65 ± 4% perf-profile.children.cycles-pp.sync_regs
3.36 +0.3 3.66 perf-profile.children.cycles-pp.posix_timer_fn
2.64 +0.3 2.94 perf-profile.children.cycles-pp._copy_from_user
1.42 ± 6% +0.3 1.77 ± 7% perf-profile.children.cycles-pp.___might_sleep
5.91 +0.4 6.30 ± 2% perf-profile.children.cycles-pp.__hrtimer_run_queues
4.33 +0.5 4.79 perf-profile.children.cycles-pp.fpu__clear
2.77 ± 8% +0.7 3.45 ± 9% perf-profile.children.cycles-pp.__might_fault
8.16 ± 3% +0.7 8.86 perf-profile.children.cycles-pp.copy_fpstate_to_sigframe
6.89 +0.7 7.62 perf-profile.children.cycles-pp.lapic_next_deadline
2.77 ± 7% +0.7 3.50 ± 5% perf-profile.children.cycles-pp.copy_user_enhanced_fast_string
6.79 +0.8 7.55 perf-profile.children.cycles-pp.__fpu__restore_sig
10.22 ± 2% +0.9 11.12 perf-profile.children.cycles-pp.__setup_rt_frame
8.77 +0.9 9.68 perf-profile.children.cycles-pp.native_irq_return_iret
8.78 +1.0 9.78 perf-profile.children.cycles-pp.restore_sigcontext
2.93 ± 9% +1.1 4.07 ± 6% perf-profile.children.cycles-pp.copy_user_generic_unrolled
13.19 +1.5 14.66 perf-profile.children.cycles-pp.__x64_sys_rt_sigreturn
4.77 ± 17% +2.6 7.38 ± 13% perf-profile.children.cycles-pp._copy_to_user
8.03 ± 13% +3.0 11.01 ± 12% perf-profile.children.cycles-pp.__x64_sys_rt_sigpending
8.37 ± 5% -5.1 3.27 ± 23% perf-profile.self.cycles-pp.ktime_get
4.29 ± 4% -2.9 1.35 ± 29% perf-profile.self.cycles-pp.ktime_get_update_offsets_now
0.13 ± 3% +0.0 0.15 ± 3% perf-profile.self.cycles-pp.__set_task_blocked
0.12 ± 3% +0.0 0.14 ± 3% perf-profile.self.cycles-pp.signal_setup_done
0.11 ± 4% +0.0 0.13 ± 3% perf-profile.self.cycles-pp.rcu_all_qs
0.11 ± 5% +0.0 0.13 ± 3% perf-profile.self.cycles-pp.sched_clock_cpu
0.18 ± 2% +0.0 0.19 ± 2% perf-profile.self.cycles-pp.restore_altstack
0.16 ± 3% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.__hrtimer_get_next_event
0.10 ± 5% +0.0 0.12 ± 3% perf-profile.self.cycles-pp.do_sigaltstack
0.17 ± 5% +0.0 0.19 ± 2% perf-profile.self.cycles-pp.irq_exit_rcu
0.19 ± 2% +0.0 0.21 perf-profile.self.cycles-pp.hrtimer_reprogram
0.17 ± 2% +0.0 0.19 ± 4% perf-profile.self.cycles-pp.prepare_signal
0.21 ± 3% +0.0 0.23 ± 2% perf-profile.self.cycles-pp.complete_signal
0.29 ± 4% +0.0 0.32 ± 2% perf-profile.self.cycles-pp.tick_program_event
0.21 ± 2% +0.0 0.24 ± 3% perf-profile.self.cycles-pp.sysvec_apic_timer_interrupt
0.29 +0.0 0.32 perf-profile.self.cycles-pp.posix_timer_fn
0.40 ± 3% +0.0 0.44 ± 3% perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
0.32 ± 2% +0.0 0.35 ± 3% perf-profile.self.cycles-pp.dequeue_signal
0.20 ± 2% +0.0 0.24 ± 6% perf-profile.self.cycles-pp.get_xsave_addr
0.30 ± 3% +0.0 0.34 ± 2% perf-profile.self.cycles-pp._copy_from_user
0.44 ± 2% +0.0 0.48 perf-profile.self.cycles-pp.__set_current_blocked
0.40 ± 3% +0.0 0.44 ± 2% perf-profile.self.cycles-pp.irqtime_account_irq
0.54 +0.0 0.59 ± 2% perf-profile.self.cycles-pp.__local_bh_enable_ip
0.61 ± 3% +0.0 0.66 ± 3% perf-profile.self.cycles-pp.hrtimer_start_range_ns
0.44 ± 2% +0.0 0.49 perf-profile.self.cycles-pp.restore_sigcontext
0.63 +0.1 0.68 perf-profile.self.cycles-pp.get_signal
0.41 ± 4% +0.1 0.46 perf-profile.self.cycles-pp.timerqueue_add
0.63 +0.1 0.68 ± 2% perf-profile.self.cycles-pp.clockevents_program_event
0.29 ± 5% +0.1 0.35 perf-profile.self.cycles-pp.syscall_exit_to_user_mode
0.49 ± 3% +0.1 0.54 perf-profile.self.cycles-pp.recalc_sigpending_tsk
0.50 +0.1 0.56 ± 2% perf-profile.self.cycles-pp.__hrtimer_next_event_base
0.74 +0.1 0.80 perf-profile.self.cycles-pp.recalc_sigpending
0.55 +0.1 0.61 perf-profile.self.cycles-pp.copy_init_pkru_to_fpregs
0.69 ± 2% +0.1 0.76 ± 2% perf-profile.self.cycles-pp.fpregs_mark_activate
0.52 ± 3% +0.1 0.59 perf-profile.self.cycles-pp.arch_do_signal_or_restart
0.65 ± 2% +0.1 0.72 perf-profile.self.cycles-pp.send_sigqueue
0.61 +0.1 0.68 ± 2% perf-profile.self.cycles-pp.__x64_sys_rt_sigreturn
0.57 ± 4% +0.1 0.65 ± 2% perf-profile.self.cycles-pp.__irqentry_text_end
0.39 +0.1 0.46 ± 6% perf-profile.self.cycles-pp.do_syscall_64
0.76 ± 3% +0.1 0.84 ± 2% perf-profile.self.cycles-pp.__dequeue_signal
0.78 ± 2% +0.1 0.86 perf-profile.self.cycles-pp.hrtimer_forward
0.84 +0.1 0.93 ± 2% perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
0.59 ± 4% +0.1 0.68 perf-profile.self.cycles-pp.__might_sleep
0.72 ± 2% +0.1 0.82 perf-profile.self.cycles-pp.exit_to_user_mode_prepare
0.31 ± 5% +0.1 0.41 ± 2% perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
0.90 +0.1 1.01 ± 2% perf-profile.self.cycles-pp.native_sched_clock
1.21 +0.1 1.32 perf-profile.self.cycles-pp.__get_user_nocheck_8
1.25 +0.1 1.37 perf-profile.self.cycles-pp.__get_user_nocheck_4
0.98 +0.1 1.10 perf-profile.self.cycles-pp.__put_user_nocheck_4
0.56 ± 4% +0.1 0.68 ± 7% perf-profile.self.cycles-pp.__might_fault
1.07 ± 2% +0.1 1.20 ± 2% perf-profile.self.cycles-pp.hrtimer_interrupt
1.80 +0.2 1.96 perf-profile.self.cycles-pp._raw_spin_lock_irq
0.80 ± 4% +0.2 0.98 ± 3% perf-profile.self.cycles-pp.asm_sysvec_apic_timer_interrupt
0.65 ± 8% +0.2 0.84 ± 6% perf-profile.self.cycles-pp.__x64_sys_rt_sigpending
2.93 +0.2 3.11 perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
2.68 +0.2 2.87 perf-profile.self.cycles-pp.read_tsc
1.10 ± 2% +0.2 1.29 ± 3% perf-profile.self.cycles-pp.___might_sleep
1.94 +0.2 2.14 perf-profile.self.cycles-pp.__setup_rt_frame
1.81 +0.2 2.02 perf-profile.self.cycles-pp.syscall_return_via_sysret
0.36 ± 12% +0.2 0.58 ± 13% perf-profile.self.cycles-pp._copy_to_user
1.34 ± 9% +0.3 1.61 ± 4% perf-profile.self.cycles-pp.sync_regs
2.97 +0.3 3.30 perf-profile.self.cycles-pp.fpu__clear
2.11 ± 3% +0.4 2.47 ± 2% perf-profile.self.cycles-pp.copy_user_enhanced_fast_string
4.35 +0.4 4.73 perf-profile.self.cycles-pp.copy_fpstate_to_sigframe
1.93 ± 3% +0.5 2.40 ± 2% perf-profile.self.cycles-pp.copy_user_generic_unrolled
4.39 +0.5 4.89 perf-profile.self.cycles-pp.__fpu__restore_sig
6.87 +0.7 7.59 perf-profile.self.cycles-pp.lapic_next_deadline
8.75 +0.9 9.65 perf-profile.self.cycles-pp.native_irq_return_iret
stress-ng.timer.ops_per_sec
3e+07 +----------------------------------------------------------------+
| O |
2.95e+07 |-+ O O O O O |
| O O O O O O O O |
2.9e+07 |-O O O O O O O O O O |
| O O O O O |
2.85e+07 |-+ O O O O |
| .+ |
2.8e+07 |.+. .+. .+ + + |
| + + : + +.+. .+ .+ |
2.75e+07 |-+ : +.+ + : + : +. |
| :+ : + :+ +.+.+. .+.+.+ |
2.7e+07 |-+ + + + + + |
| +.+ |
2.65e+07 +----------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org Intel Corporation
Thanks,
Oliver Sang
View attachment "config-5.11.0-rc2-00003-gabd562df94d1" of type "text/plain" (172366 bytes)
View attachment "job-script" of type "text/plain" (8206 bytes)
View attachment "job.yaml" of type "text/plain" (5723 bytes)
View attachment "reproduce" of type "text/plain" (407 bytes)
Powered by blists - more mailing lists