[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <878r7nr4ju.fsf@intel.com>
Date: Fri, 27 Oct 2023 17:09:25 -0700
From: Vinicius Costa Gomes <vinicius.gomes@...el.com>
To: kernel test robot <oliver.sang@...el.com>, Xabier Marquiegui
<reibax@...il.com>
Cc: oe-lkp@...ts.linux.dev, lkp@...el.com, Linux Memory Management List
<linux-mm@...ck.org>, "David S. Miller" <davem@...emloft.net>,
netdev@...r.kernel.org, linux-kselftest@...r.kernel.org,
ying.huang@...el.com, feng.tang@...el.com, fengwei.yin@...el.com,
oliver.sang@...el.com
Subject: Re: [linux-next:master] [ptp] 403376ddb4:
stress-ng.clock.ops_per_sec -12.7% regression
Hi,
kernel test robot <oliver.sang@...el.com> writes:
> Hello,
>
> kernel test robot noticed a -12.7% regression of stress-ng.clock.ops_per_sec on:
>
>
> commit: 403376ddb4221be9db5326ae334773807df71ffe ("ptp: add debugfs interface to see applied channel masks")
> https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
>
I guess this is somewhat expected, taking a look at the stress-ng clock
stressor, it calls open() on a PTP device (/dev/ptp0), which creates a
new event queue and since 403376ddb4221b, it also creates a new debugfs
entry. And that is where the regression seems to be coming from.
But just to get an idea if there was any impact on things that are
expected to be called more frequently, I ran a quick-and-dirty program
that calls clock_gettime() a bunch of times:
~/src/benches $ sudo time ./gettime -d /dev/ptp2 -c 1000000
0.45user 2.44system 0:02.89elapsed 99%CPU (0avgtext+0avgdata 1408maxresident)k
And after reverting that commit (and 75a384ceda93df):
~/src/benches $ sudo time ./gettime -d /dev/ptp2 -c 1000000
0.44user 2.45system 0:02.90elapsed 99%CPU (0avgtext+0avgdata 1408maxresident)k
So, yeah, the paths that I expect to be more used do not seem to have
regressed. So I am not too worried about this.
Anyway, thanks for the report.
> testcase: stress-ng
> test machine: 36 threads 1 sockets Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz (Cascade Lake) with 128G memory
> parameters:
>
> nr_threads: 1
> disk: 1HDD
> testtime: 60s
> class: interrupt
> test: clock
> cpufreq_governor: performance
>
>
>
>
> If you fix the issue in a separate patch/commit (i.e. not just a new version of
> the same patch/commit), kindly add following tags
> | Reported-by: kernel test robot <oliver.sang@...el.com>
> | Closes: https://lore.kernel.org/oe-lkp/202310271441.b9fb0896-oliver.sang@intel.com
>
>
> Details are as below:
> -------------------------------------------------------------------------------------------------->
>
>
> The kernel config and materials to reproduce are available at:
> https://download.01.org/0day-ci/archive/20231027/202310271441.b9fb0896-oliver.sang@intel.com
>
> =========================================================================================
> class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
> interrupt/gcc-12/performance/1HDD/x86_64-rhel-8.3/1/debian-11.1-x86_64-20220510.cgz/lkp-csl-d02/clock/stress-ng/60s
>
> commit:
> c5a445b1e9 ("ptp: support event queue reader channel masks")
> 403376ddb4 ("ptp: add debugfs interface to see applied channel masks")
>
> c5a445b1e9347b14 403376ddb4221be9db5326ae334
> ---------------- ---------------------------
> %stddev %change %stddev
> \ | \
> 2.50 +8.3% 2.71 iostat.cpu.system
> 0.05 ± 3% +0.0 0.06 mpstat.cpu.all.soft%
> 1.84 +0.2 2.04 mpstat.cpu.all.sys%
> 200469 -10.7% 178950 vmstat.system.cs
> 135153 -7.8% 124597 vmstat.system.in
> 2804514 -12.7% 2448917 stress-ng.clock.ops
> 46741 -12.7% 40815 stress-ng.clock.ops_per_sec
> 78.00 +2.6% 80.00 stress-ng.time.percent_of_cpu_this_job_got
> 5606454 -12.7% 4896355 stress-ng.time.voluntary_context_switches
> 18583 ± 5% +31.1% 24360 ± 9% turbostat.C1
> 2.90 ± 2% +0.1 3.03 turbostat.C1E%
> 5353747 -13.0% 4655189 turbostat.POLL
> 0.28 -0.0 0.24 turbostat.POLL%
> 10.11 +1.4% 10.25 turbostat.RAMWatt
> 88825 -6.6% 82944 proc-vmstat.nr_active_anon
> 11367 +4.6% 11892 proc-vmstat.nr_mapped
> 96532 -6.6% 90119 proc-vmstat.nr_shmem
> 19819 +4.1% 20633 proc-vmstat.nr_slab_reclaimable
> 88825 -6.6% 82944 proc-vmstat.nr_zone_active_anon
> 420661 +27.6% 536903 ± 35% proc-vmstat.numa_hit
> 420556 +27.4% 535725 ± 36% proc-vmstat.numa_local
> 133275 -6.9% 124053 proc-vmstat.pgactivate
> 488366 ± 2% +60.4% 783374 proc-vmstat.pgalloc_normal
> 227687 +3.2% 234860 proc-vmstat.pgfault
> 323139 ± 3% +94.6% 628694 proc-vmstat.pgfree
> 0.00 ±141% +325.0% 0.01 ± 26% perf-sched.sch_delay.avg.ms.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
> 0.01 ±217% +455.3% 0.04 ± 68% perf-sched.sch_delay.max.ms.__cond_resched.__kmem_cache_alloc_node.__kmalloc_node.alloc_cpumask_var_node.clock_was_set
> 0.10 ± 14% +66.8% 0.16 ± 15% perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep_timens.__x64_sys_clock_nanosleep
> 0.00 ±141% +325.0% 0.01 ± 26% perf-sched.sch_delay.max.ms.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
> 0.91 ± 21% -53.8% 0.42 ± 55% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
> 1.27 +18.7% 1.51 perf-sched.total_wait_and_delay.average.ms
> 1.27 +18.7% 1.51 perf-sched.total_wait_time.average.ms
> 18.82 ± 20% +59.4% 29.99 ± 27% perf-sched.wait_and_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
> 1278 ± 2% +16.3% 1487 ± 3% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
> 220.33 ± 5% +14.1% 251.33 ± 2% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
> 0.20 ± 13% +63.7% 0.33 ± 14% perf-sched.wait_and_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep_timens.__x64_sys_clock_nanosleep
> 18.82 ± 20% +59.4% 29.99 ± 27% perf-sched.wait_time.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
> 0.01 ±208% +433.3% 0.05 ± 64% perf-sched.wait_time.max.ms.__cond_resched.__kmem_cache_alloc_node.__kmalloc_node.alloc_cpumask_var_node.clock_was_set
> 0.10 ± 12% +60.7% 0.16 ± 13% perf-sched.wait_time.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep_timens.__x64_sys_clock_nanosleep
> 0.13 +35.1% 0.17 ± 2% perf-stat.i.MPKI
> 9.417e+08 +4.1% 9.8e+08 perf-stat.i.branch-instructions
> 4.24 ± 2% +0.6 4.81 ± 2% perf-stat.i.cache-miss-rate%
> 1107890 +23.8% 1371103 ± 2% perf-stat.i.cache-misses
> 18969540 +28.6% 24388966 perf-stat.i.cache-references
> 210406 -11.1% 187069 perf-stat.i.context-switches
> 1.15 -3.5% 1.11 perf-stat.i.cpi
> 185.83 ± 2% +7.7% 200.19 ± 4% perf-stat.i.cpu-migrations
> 56299 ± 6% -64.8% 19830 ± 6% perf-stat.i.cycles-between-cache-misses
> 1.249e+09 +2.7% 1.283e+09 perf-stat.i.dTLB-loads
> 0.00 ± 6% +0.0 0.00 ± 4% perf-stat.i.dTLB-store-miss-rate%
> 7425 ± 5% +12.8% 8373 ± 4% perf-stat.i.dTLB-store-misses
> 7.573e+08 +2.3% 7.746e+08 perf-stat.i.dTLB-stores
> 6674913 ± 2% -5.5% 6305796 perf-stat.i.iTLB-load-misses
> 5.03e+09 +3.1% 5.184e+09 perf-stat.i.instructions
> 1159 +5.5% 1222 ± 2% perf-stat.i.instructions-per-iTLB-miss
> 0.89 +3.0% 0.92 perf-stat.i.ipc
> 536.86 +28.0% 687.18 perf-stat.i.metric.K/sec
> 81.89 +3.0% 84.38 perf-stat.i.metric.M/sec
> 2127 +5.2% 2239 ± 2% perf-stat.i.minor-faults
> 106399 ± 2% +36.0% 144713 ± 3% perf-stat.i.node-loads
> 106511 ± 5% +71.6% 182771 ± 3% perf-stat.i.node-stores
> 2127 +5.2% 2239 ± 2% perf-stat.i.page-faults
> 0.22 +20.1% 0.26 ± 2% perf-stat.overall.MPKI
> 1.09 -2.6% 1.06 perf-stat.overall.cpi
> 4955 -18.8% 4021 ± 2% perf-stat.overall.cycles-between-cache-misses
> 0.00 ± 5% +0.0 0.00 ± 4% perf-stat.overall.dTLB-store-miss-rate%
> 753.96 +9.1% 822.64 ± 2% perf-stat.overall.instructions-per-iTLB-miss
> 0.92 +2.7% 0.94 perf-stat.overall.ipc
> 9.27e+08 +4.1% 9.648e+08 perf-stat.ps.branch-instructions
> 1090868 +23.8% 1350235 ± 2% perf-stat.ps.cache-misses
> 18670301 +28.6% 24005580 perf-stat.ps.cache-references
> 207054 -11.1% 184104 perf-stat.ps.context-switches
> 182.86 ± 2% +7.7% 197.01 ± 4% perf-stat.ps.cpu-migrations
> 1.229e+09 +2.7% 1.263e+09 perf-stat.ps.dTLB-loads
> 7309 ± 5% +12.8% 8244 ± 4% perf-stat.ps.dTLB-store-misses
> 7.454e+08 +2.3% 7.624e+08 perf-stat.ps.dTLB-stores
> 6568626 ± 2% -5.5% 6205894 perf-stat.ps.iTLB-load-misses
> 4.951e+09 +3.1% 5.103e+09 perf-stat.ps.instructions
> 2094 +5.2% 2204 ± 2% perf-stat.ps.minor-faults
> 104758 ± 2% +36.0% 142499 ± 3% perf-stat.ps.node-loads
> 104880 ± 5% +71.6% 179969 ± 3% perf-stat.ps.node-stores
> 2094 +5.2% 2204 ± 2% perf-stat.ps.page-faults
> 3.124e+11 +3.9% 3.245e+11 perf-stat.total.instructions
> 9.42 ± 5% -2.2 7.20 ± 8% perf-profile.calltrace.cycles-pp.clock_gettime
> 9.26 ± 5% -2.2 7.08 ± 8% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.clock_gettime
> 9.24 ± 5% -2.2 7.07 ± 8% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.clock_gettime
> 8.84 ± 5% -2.2 6.67 ± 9% perf-profile.calltrace.cycles-pp.e1000e_read_systim.e1000e_phc_gettimex.pc_clock_gettime.__x64_sys_clock_gettime.do_syscall_64
> 9.20 ± 5% -2.2 7.04 ± 8% perf-profile.calltrace.cycles-pp.__x64_sys_clock_gettime.do_syscall_64.entry_SYSCALL_64_after_hwframe.clock_gettime
> 8.96 ± 5% -2.2 6.81 ± 8% perf-profile.calltrace.cycles-pp.e1000e_phc_gettimex.pc_clock_gettime.__x64_sys_clock_gettime.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 9.12 ± 5% -2.1 6.98 ± 8% perf-profile.calltrace.cycles-pp.pc_clock_gettime.__x64_sys_clock_gettime.do_syscall_64.entry_SYSCALL_64_after_hwframe.clock_gettime
> 10.96 ± 2% -2.0 8.98 ± 8% perf-profile.calltrace.cycles-pp.syscall
> 10.98 ± 3% -1.7 9.28 ± 9% perf-profile.calltrace.cycles-pp.clock_adjtime
> 10.09 ± 3% -1.5 8.55 ± 10% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.clock_adjtime
> 9.94 ± 3% -1.5 8.42 ± 10% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.clock_adjtime
> 9.68 ± 3% -1.5 8.20 ± 10% perf-profile.calltrace.cycles-pp.__do_sys_clock_adjtime.do_syscall_64.entry_SYSCALL_64_after_hwframe.clock_adjtime
> 8.53 ± 4% -1.3 7.22 ± 9% perf-profile.calltrace.cycles-pp.do_adjtimex.__do_sys_clock_adjtime.do_syscall_64.entry_SYSCALL_64_after_hwframe.clock_adjtime
> 6.44 ± 5% -1.2 5.20 ± 10% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
> 6.22 ± 4% -1.1 5.12 ± 11% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.syscall
> 5.59 ± 3% -1.0 4.59 ± 11% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
> 6.42 ± 3% -0.9 5.51 ± 10% perf-profile.calltrace.cycles-pp.timekeeping_inject_offset.do_adjtimex.__do_sys_clock_adjtime.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 5.86 ± 3% -0.9 5.00 ± 9% perf-profile.calltrace.cycles-pp.clock_was_set.timekeeping_inject_offset.do_adjtimex.__do_sys_clock_adjtime.do_syscall_64
> 3.34 ± 5% -0.7 2.68 ± 11% perf-profile.calltrace.cycles-pp.__x64_sys_clock_gettime.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
> 2.99 ± 9% -0.6 2.34 ± 14% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.poll_idle.cpuidle_enter_state.cpuidle_enter
> 2.47 ± 13% -0.5 1.94 ± 15% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.poll_idle.cpuidle_enter_state
> 3.31 ± 3% -0.5 2.83 ± 8% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.clock_was_set.timekeeping_inject_offset.do_adjtimex.__do_sys_clock_adjtime
> 2.78 ± 3% -0.4 2.42 ± 10% perf-profile.calltrace.cycles-pp.timer_create
> 2.37 ± 3% -0.3 2.07 ± 10% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.timer_create
> 2.33 ± 3% -0.3 2.03 ± 10% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.timer_create
> 1.64 ± 3% -0.3 1.36 ± 10% perf-profile.calltrace.cycles-pp.add_device_randomness.do_adjtimex.__do_sys_clock_adjtime.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 1.43 ± 3% -0.3 1.16 ± 11% perf-profile.calltrace.cycles-pp.blake2s_compress.blake2s_update.add_device_randomness.do_adjtimex.__do_sys_clock_adjtime
> 1.52 ± 3% -0.3 1.26 ± 11% perf-profile.calltrace.cycles-pp.blake2s_update.add_device_randomness.do_adjtimex.__do_sys_clock_adjtime.do_syscall_64
> 1.23 ± 8% -0.2 0.99 ± 12% perf-profile.calltrace.cycles-pp.posix_cpu_clock_get.__x64_sys_clock_gettime.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
> 1.02 ± 5% -0.1 0.88 ± 12% perf-profile.calltrace.cycles-pp.read_tsc.ktime_get_update_offsets_now.clock_was_set.timekeeping_inject_offset.do_adjtimex
> 0.74 ± 4% -0.1 0.61 ± 13% perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.hrtimer_wakeup.__hrtimer_run_queues.hrtimer_interrupt
> 0.47 ± 45% +0.4 0.90 ± 17% perf-profile.calltrace.cycles-pp.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
> 0.00 +0.7 0.69 ± 10% perf-profile.calltrace.cycles-pp.lookup_one_len.start_creating.__debugfs_create_file.ptp_open.posix_clock_open
> 0.00 +0.7 0.72 ± 12% perf-profile.calltrace.cycles-pp.dput.simple_recursive_removal.debugfs_remove.ptp_release.posix_clock_release
> 0.00 +0.7 0.75 ± 14% perf-profile.calltrace.cycles-pp.lookup_one_len.start_creating.debugfs_create_dir.ptp_open.posix_clock_open
> 0.00 +0.8 0.75 ± 12% perf-profile.calltrace.cycles-pp.start_creating.__debugfs_create_file.ptp_open.posix_clock_open.chrdev_open
> 0.00 +0.9 0.86 ± 15% perf-profile.calltrace.cycles-pp.start_creating.debugfs_create_dir.ptp_open.posix_clock_open.chrdev_open
> 0.00 +0.9 0.91 ± 14% perf-profile.calltrace.cycles-pp.mnt_get_count.mntput_no_expire.simple_recursive_removal.debugfs_remove.ptp_release
> 0.00 +1.0 1.05 ± 13% perf-profile.calltrace.cycles-pp.mntput_no_expire.simple_recursive_removal.debugfs_remove.ptp_release.posix_clock_release
> 0.00 +1.3 1.26 ± 12% perf-profile.calltrace.cycles-pp.__debugfs_create_file.ptp_open.posix_clock_open.chrdev_open.do_dentry_open
> 0.00 +1.5 1.47 ± 13% perf-profile.calltrace.cycles-pp.debugfs_create_dir.ptp_open.posix_clock_open.chrdev_open.do_dentry_open
> 0.00 +2.4 2.42 ± 11% perf-profile.calltrace.cycles-pp.simple_recursive_removal.debugfs_remove.ptp_release.posix_clock_release.__fput
> 0.00 +3.0 2.95 ± 11% perf-profile.calltrace.cycles-pp.debugfs_remove.ptp_release.posix_clock_release.__fput.__x64_sys_close
> 1.74 ± 5% +3.0 4.71 ± 8% perf-profile.calltrace.cycles-pp.do_dentry_open.do_open.path_openat.do_filp_open.do_sys_openat2
> 2.90 ± 6% +3.0 5.87 ± 9% perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
> 1.35 ± 6% +3.0 4.33 ± 8% perf-profile.calltrace.cycles-pp.posix_clock_open.chrdev_open.do_dentry_open.do_open.path_openat
> 1.42 ± 6% +3.0 4.41 ± 8% perf-profile.calltrace.cycles-pp.chrdev_open.do_dentry_open.do_open.path_openat.do_filp_open
> 2.02 ± 5% +3.0 5.01 ± 8% perf-profile.calltrace.cycles-pp.do_open.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
> 2.95 ± 6% +3.0 5.95 ± 9% perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 3.51 ± 6% +3.0 6.51 ± 9% perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe.open64
> 3.54 ± 6% +3.0 6.55 ± 9% perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe.open64
> 1.18 ± 6% +3.0 4.21 ± 8% perf-profile.calltrace.cycles-pp.ptp_open.posix_clock_open.chrdev_open.do_dentry_open.do_open
> 3.57 ± 6% +3.0 6.61 ± 9% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.open64
> 3.75 ± 6% +3.1 6.82 ± 10% perf-profile.calltrace.cycles-pp.open64
> 3.58 ± 6% +3.1 6.66 ± 10% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.open64
> 0.93 ± 11% +3.2 4.16 ± 11% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__close
> 0.90 ± 11% +3.2 4.13 ± 11% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
> 0.87 ± 11% +3.2 4.10 ± 12% perf-profile.calltrace.cycles-pp.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
> 1.06 ± 8% +3.2 4.30 ± 11% perf-profile.calltrace.cycles-pp.__close
> 0.51 ± 45% +3.4 3.91 ± 12% perf-profile.calltrace.cycles-pp.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
> 0.00 +3.6 3.59 ± 11% perf-profile.calltrace.cycles-pp.ptp_release.posix_clock_release.__fput.__x64_sys_close.do_syscall_64
> 0.00 +3.7 3.68 ± 11% perf-profile.calltrace.cycles-pp.posix_clock_release.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 12.58 ± 4% -2.8 9.74 ± 9% perf-profile.children.cycles-pp.__x64_sys_clock_gettime
> 9.42 ± 5% -2.2 7.20 ± 8% perf-profile.children.cycles-pp.clock_gettime
> 8.84 ± 5% -2.2 6.67 ± 9% perf-profile.children.cycles-pp.e1000e_read_systim
> 8.97 ± 5% -2.2 6.81 ± 8% perf-profile.children.cycles-pp.e1000e_phc_gettimex
> 9.12 ± 5% -2.1 6.98 ± 8% perf-profile.children.cycles-pp.pc_clock_gettime
> 10.92 ± 3% -1.8 9.07 ± 8% perf-profile.children.cycles-pp.syscall
> 11.24 ± 3% -1.7 9.51 ± 10% perf-profile.children.cycles-pp.clock_adjtime
> 9.69 ± 3% -1.5 8.21 ± 10% perf-profile.children.cycles-pp.__do_sys_clock_adjtime
> 7.19 ± 4% -1.4 5.80 ± 10% perf-profile.children.cycles-pp.poll_idle
> 8.53 ± 4% -1.3 7.22 ± 9% perf-profile.children.cycles-pp.do_adjtimex
> 6.42 ± 3% -0.9 5.51 ± 10% perf-profile.children.cycles-pp.timekeeping_inject_offset
> 5.93 ± 3% -0.9 5.06 ± 9% perf-profile.children.cycles-pp.clock_was_set
> 4.47 ± 3% -0.5 3.98 ± 7% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
> 2.37 ± 3% -0.4 1.95 ± 9% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
> 2.88 ± 2% -0.4 2.50 ± 10% perf-profile.children.cycles-pp.timer_create
> 2.08 ± 3% -0.3 1.74 ± 9% perf-profile.children.cycles-pp.read_tsc
> 1.65 ± 3% -0.3 1.37 ± 11% perf-profile.children.cycles-pp.add_device_randomness
> 1.43 ± 3% -0.3 1.16 ± 11% perf-profile.children.cycles-pp.blake2s_compress
> 1.53 ± 3% -0.3 1.26 ± 11% perf-profile.children.cycles-pp.blake2s_update
> 1.74 ± 5% -0.3 1.48 ± 8% perf-profile.children.cycles-pp.hrtimer_wakeup
> 1.24 ± 8% -0.2 1.00 ± 12% perf-profile.children.cycles-pp.posix_cpu_clock_get
> 1.70 ± 5% -0.2 1.46 ± 8% perf-profile.children.cycles-pp.try_to_wake_up
> 1.81 ± 6% -0.2 1.59 ± 11% perf-profile.children.cycles-pp.ktime_get_update_offsets_now
> 1.14 ± 3% -0.2 0.96 ± 10% perf-profile.children.cycles-pp.ttwu_do_activate
> 0.91 ± 3% -0.2 0.73 ± 13% perf-profile.children.cycles-pp.rep_movs_alternative
> 1.20 ± 6% -0.2 1.02 ± 9% perf-profile.children.cycles-pp.lapic_next_deadline
> 1.04 ± 4% -0.2 0.87 ± 13% perf-profile.children.cycles-pp.hrtimer_start_range_ns
> 1.00 ± 4% -0.1 0.86 ± 10% perf-profile.children.cycles-pp.activate_task
> 0.96 ± 5% -0.1 0.83 ± 10% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
> 0.46 ± 7% -0.1 0.33 ± 15% perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
> 0.54 ± 8% -0.1 0.42 ± 16% perf-profile.children.cycles-pp.clockevents_program_min_delta
> 1.04 ± 4% -0.1 0.94 ± 6% perf-profile.children.cycles-pp.memset_orig
> 0.52 ± 8% -0.1 0.42 ± 10% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
> 0.22 ± 17% -0.1 0.14 ± 34% perf-profile.children.cycles-pp.posix_get_monotonic_timespec
> 0.22 ± 20% -0.1 0.14 ± 20% perf-profile.children.cycles-pp.ktime_get_real_ts64
> 0.11 ± 20% -0.1 0.05 ± 51% perf-profile.children.cycles-pp.__cgroup_account_cputime
> 0.22 ± 9% -0.1 0.16 ± 16% perf-profile.children.cycles-pp.posix_get_boottime_timespec
> 0.12 ± 7% -0.0 0.08 ± 27% perf-profile.children.cycles-pp.update_curr_fair
> 0.06 ± 46% +0.0 0.10 ± 25% perf-profile.children.cycles-pp.memcpy_orig
> 0.01 ±223% +0.1 0.07 ± 21% perf-profile.children.cycles-pp.rcu_nocb_try_bypass
> 0.00 +0.1 0.06 ± 14% perf-profile.children.cycles-pp.default_pointer
> 0.00 +0.1 0.07 ± 15% perf-profile.children.cycles-pp.__srcu_read_unlock
> 0.00 +0.1 0.08 ± 32% perf-profile.children.cycles-pp.iput
> 0.00 +0.1 0.08 ± 23% perf-profile.children.cycles-pp.lookup_one_common
> 0.00 +0.1 0.09 ± 18% perf-profile.children.cycles-pp.inode_init_once
> 0.00 +0.1 0.09 ± 22% perf-profile.children.cycles-pp.__srcu_read_lock
> 0.00 +0.1 0.09 ± 26% perf-profile.children.cycles-pp.up_write
> 0.00 +0.1 0.09 ± 20% perf-profile.children.cycles-pp.__d_lookup
> 0.00 +0.1 0.10 ± 19% perf-profile.children.cycles-pp.setup_object
> 0.14 ± 20% +0.1 0.24 ± 13% perf-profile.children.cycles-pp.get_obj_cgroup_from_current
> 0.00 +0.1 0.10 ± 27% perf-profile.children.cycles-pp.shrink_dcache_parent
> 0.02 ±142% +0.1 0.12 ± 9% perf-profile.children.cycles-pp.lockref_put_or_lock
> 0.00 +0.1 0.10 ± 20% perf-profile.children.cycles-pp.d_lookup
> 0.00 +0.1 0.10 ± 18% perf-profile.children.cycles-pp.__wake_up_common_lock
> 0.00 +0.1 0.11 ± 17% perf-profile.children.cycles-pp.lookup_dcache
> 0.00 +0.1 0.11 ± 22% perf-profile.children.cycles-pp.shuffle_freelist
> 0.00 +0.1 0.12 ± 17% perf-profile.children.cycles-pp.d_instantiate
> 0.00 +0.1 0.12 ± 24% perf-profile.children.cycles-pp.down_write
> 0.10 ± 36% +0.1 0.23 ± 29% perf-profile.children.cycles-pp.obj_cgroup_charge
> 0.00 +0.1 0.14 ± 24% perf-profile.children.cycles-pp.allocate_slab
> 0.00 +0.1 0.14 ± 36% perf-profile.children.cycles-pp.memcg_list_lru_alloc
> 0.00 +0.1 0.14 ± 34% perf-profile.children.cycles-pp.d_walk
> 0.00 +0.1 0.14 ± 5% perf-profile.children.cycles-pp.number
> 0.00 +0.2 0.16 ± 16% perf-profile.children.cycles-pp.__destroy_inode
> 0.00 +0.2 0.16 ± 16% perf-profile.children.cycles-pp.dentry_unlink_inode
> 0.10 ± 26% +0.2 0.26 ± 18% perf-profile.children.cycles-pp.___slab_alloc
> 0.00 +0.2 0.17 ± 17% perf-profile.children.cycles-pp.destroy_inode
> 0.00 +0.2 0.18 ± 13% perf-profile.children.cycles-pp.fsnotify_grab_connector
> 0.27 ± 20% +0.2 0.44 ± 12% perf-profile.children.cycles-pp.__call_rcu_common
> 0.34 ± 14% +0.2 0.52 ± 15% perf-profile.children.cycles-pp.mod_objcg_state
> 0.00 +0.2 0.19 ± 8% perf-profile.children.cycles-pp.inode_init_always
> 0.00 +0.2 0.22 ± 15% perf-profile.children.cycles-pp.fsnotify_destroy_marks
> 0.40 ± 12% +0.3 0.66 ± 18% perf-profile.children.cycles-pp.memcg_slab_post_alloc_hook
> 0.00 +0.3 0.26 ± 22% perf-profile.children.cycles-pp.d_invalidate
> 0.00 +0.3 0.28 ± 10% perf-profile.children.cycles-pp.__d_add
> 0.00 +0.3 0.30 ± 11% perf-profile.children.cycles-pp.simple_lookup
> 0.00 +0.3 0.32 ± 15% perf-profile.children.cycles-pp.evict
> 0.55 ± 9% +0.4 0.90 ± 17% perf-profile.children.cycles-pp.smpboot_thread_fn
> 0.14 ± 24% +0.4 0.50 ± 15% perf-profile.children.cycles-pp._find_next_bit
> 0.00 +0.4 0.37 ± 11% perf-profile.children.cycles-pp.sprintf
> 0.00 +0.4 0.37 ± 11% perf-profile.children.cycles-pp.vsnprintf
> 0.00 +0.4 0.43 ± 15% perf-profile.children.cycles-pp.slab_pre_alloc_hook
> 0.00 +0.6 0.58 ± 12% perf-profile.children.cycles-pp.__d_alloc
> 1.69 ± 7% +0.6 2.30 ± 14% perf-profile.children.cycles-pp.__do_softirq
> 1.15 ± 4% +0.6 1.80 ± 17% perf-profile.children.cycles-pp.rcu_do_batch
> 1.26 ± 4% +0.6 1.91 ± 15% perf-profile.children.cycles-pp.rcu_core
> 0.00 +0.7 0.70 ± 14% perf-profile.children.cycles-pp.alloc_inode
> 0.00 +0.8 0.76 ± 10% perf-profile.children.cycles-pp.d_alloc
> 0.00 +0.8 0.84 ± 11% perf-profile.children.cycles-pp.d_alloc_parallel
> 0.59 ± 6% +0.9 1.46 ± 5% perf-profile.children.cycles-pp._raw_spin_lock
> 0.00 +0.9 0.91 ± 11% perf-profile.children.cycles-pp.new_inode
> 0.00 +1.0 1.01 ± 15% perf-profile.children.cycles-pp.kmem_cache_alloc_lru
> 0.00 +1.0 1.05 ± 11% perf-profile.children.cycles-pp.__dentry_kill
> 0.00 +1.1 1.10 ± 11% perf-profile.children.cycles-pp.dentry_kill
> 0.00 +1.2 1.16 ± 11% perf-profile.children.cycles-pp.__lookup_slow
> 0.10 ± 30% +1.2 1.31 ± 10% perf-profile.children.cycles-pp.dput
> 0.00 +1.3 1.26 ± 12% perf-profile.children.cycles-pp.__debugfs_create_file
> 0.00 +1.4 1.44 ± 13% perf-profile.children.cycles-pp.mnt_get_count
> 0.00 +1.4 1.44 ± 11% perf-profile.children.cycles-pp.lookup_one_len
> 0.00 +1.5 1.47 ± 13% perf-profile.children.cycles-pp.debugfs_create_dir
> 0.01 ±223% +1.6 1.60 ± 13% perf-profile.children.cycles-pp.mntput_no_expire
> 0.00 +1.6 1.61 ± 11% perf-profile.children.cycles-pp.start_creating
> 0.00 +2.4 2.43 ± 12% perf-profile.children.cycles-pp.simple_recursive_removal
> 0.00 +3.0 2.96 ± 11% perf-profile.children.cycles-pp.debugfs_remove
> 1.75 ± 5% +3.0 4.71 ± 8% perf-profile.children.cycles-pp.do_dentry_open
> 1.35 ± 6% +3.0 4.33 ± 8% perf-profile.children.cycles-pp.posix_clock_open
> 2.93 ± 5% +3.0 5.91 ± 9% perf-profile.children.cycles-pp.path_openat
> 1.42 ± 6% +3.0 4.41 ± 8% perf-profile.children.cycles-pp.chrdev_open
> 2.03 ± 5% +3.0 5.02 ± 8% perf-profile.children.cycles-pp.do_open
> 2.97 ± 6% +3.0 5.98 ± 9% perf-profile.children.cycles-pp.do_filp_open
> 3.54 ± 6% +3.0 6.56 ± 9% perf-profile.children.cycles-pp.do_sys_openat2
> 3.57 ± 6% +3.0 6.59 ± 9% perf-profile.children.cycles-pp.__x64_sys_openat
> 1.18 ± 6% +3.0 4.22 ± 8% perf-profile.children.cycles-pp.ptp_open
> 3.79 ± 6% +3.1 6.84 ± 10% perf-profile.children.cycles-pp.open64
> 0.87 ± 11% +3.2 4.10 ± 12% perf-profile.children.cycles-pp.__x64_sys_close
> 1.09 ± 9% +3.2 4.34 ± 11% perf-profile.children.cycles-pp.__close
> 0.59 ± 11% +3.3 3.92 ± 12% perf-profile.children.cycles-pp.__fput
> 0.31 ± 19% +3.4 3.68 ± 11% perf-profile.children.cycles-pp.posix_clock_release
> 0.18 ± 15% +3.4 3.59 ± 11% perf-profile.children.cycles-pp.ptp_release
> 8.84 ± 5% -2.2 6.67 ± 9% perf-profile.self.cycles-pp.e1000e_read_systim
> 3.55 ± 7% -0.8 2.72 ± 14% perf-profile.self.cycles-pp.poll_idle
> 4.42 ± 3% -0.5 3.92 ± 7% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
> 2.32 ± 4% -0.4 1.90 ± 9% perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
> 2.01 ± 3% -0.3 1.70 ± 8% perf-profile.self.cycles-pp.read_tsc
> 1.78 ± 5% -0.3 1.50 ± 9% perf-profile.self.cycles-pp.syscall
> 1.43 ± 3% -0.3 1.16 ± 11% perf-profile.self.cycles-pp.blake2s_compress
> 1.08 ± 5% -0.2 0.88 ± 15% perf-profile.self.cycles-pp._copy_to_user
> 1.20 ± 6% -0.2 1.02 ± 9% perf-profile.self.cycles-pp.lapic_next_deadline
> 0.56 ± 4% -0.1 0.44 ± 17% perf-profile.self.cycles-pp.rep_movs_alternative
> 0.42 ± 8% -0.1 0.30 ± 16% perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
> 0.52 ± 8% -0.1 0.42 ± 10% perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
> 0.38 ± 10% -0.1 0.30 ± 19% perf-profile.self.cycles-pp.syscall_enter_from_user_mode
> 0.44 ± 9% -0.1 0.36 ± 9% perf-profile.self.cycles-pp.__schedule
> 0.28 ± 14% -0.1 0.20 ± 10% perf-profile.self.cycles-pp.exit_to_user_mode_prepare
> 0.09 ± 29% -0.1 0.04 ± 72% perf-profile.self.cycles-pp.do_adjtimex
> 0.21 ± 13% -0.0 0.16 ± 10% perf-profile.self.cycles-pp.__update_load_avg_se
> 0.12 ± 8% -0.0 0.08 ± 22% perf-profile.self.cycles-pp.update_curr_fair
> 0.13 ± 20% -0.0 0.10 ± 10% perf-profile.self.cycles-pp.apparmor_capable
> 0.06 ± 50% +0.0 0.10 ± 16% perf-profile.self.cycles-pp.get_obj_cgroup_from_current
> 0.02 ±144% +0.1 0.07 ± 17% perf-profile.self.cycles-pp.do_filp_open
> 0.17 ± 16% +0.1 0.23 ± 12% perf-profile.self.cycles-pp.put_timespec64
> 0.02 ±141% +0.1 0.08 ± 27% perf-profile.self.cycles-pp.___slab_alloc
> 0.00 +0.1 0.07 ± 15% perf-profile.self.cycles-pp.__srcu_read_unlock
> 0.02 ±141% +0.1 0.09 ± 24% perf-profile.self.cycles-pp.lockref_put_or_lock
> 0.44 ± 9% +0.1 0.52 ± 9% perf-profile.self.cycles-pp.kmem_cache_free
> 0.00 +0.1 0.09 ± 21% perf-profile.self.cycles-pp.inode_init_once
> 0.00 +0.1 0.09 ± 21% perf-profile.self.cycles-pp.__srcu_read_lock
> 0.00 +0.1 0.09 ± 25% perf-profile.self.cycles-pp.up_write
> 0.00 +0.1 0.09 ± 21% perf-profile.self.cycles-pp.__d_lookup
> 0.00 +0.1 0.09 ± 26% perf-profile.self.cycles-pp.down_write
> 0.00 +0.1 0.09 ± 27% perf-profile.self.cycles-pp.slab_pre_alloc_hook
> 0.01 ±223% +0.1 0.11 ± 51% perf-profile.self.cycles-pp.mntput_no_expire
> 0.00 +0.1 0.11 ± 9% perf-profile.self.cycles-pp.kmem_cache_alloc_lru
> 0.00 +0.1 0.12 ± 12% perf-profile.self.cycles-pp.inode_init_always
> 0.00 +0.1 0.14 ± 5% perf-profile.self.cycles-pp.number
> 0.26 ± 19% +0.2 0.44 ± 18% perf-profile.self.cycles-pp.memcg_slab_post_alloc_hook
> 0.11 ± 29% +0.3 0.39 ± 18% perf-profile.self.cycles-pp._find_next_bit
> 0.58 ± 7% +0.8 1.42 ± 5% perf-profile.self.cycles-pp._raw_spin_lock
> 0.00 +1.1 1.14 ± 14% perf-profile.self.cycles-pp.mnt_get_count
>
>
>
>
> Disclaimer:
> Results have been estimated based on internal Intel analysis and are provided
> for informational purposes only. Any difference in system hardware or software
> design or configuration may affect actual performance.
>
>
> --
> 0-DAY CI Kernel Test Service
> https://github.com/intel/lkp-tests/wiki
>
--
Vinicius
Powered by blists - more mailing lists