lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202407241523.14878db1-oliver.sang@intel.com>
Date: Wed, 24 Jul 2024 15:57:29 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Jesper Dangaard Brouer <hawk@...nel.org>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, <linux-kernel@...r.kernel.org>,
	Tejun Heo <tj@...nel.org>, <linux-trace-kernel@...r.kernel.org>,
	<cgroups@...r.kernel.org>, <ying.huang@...el.com>, <feng.tang@...el.com>,
	<fengwei.yin@...el.com>, <oliver.sang@...el.com>
Subject: [linus:master] [cgroup/rstat]  21c38a3bd4: stress-ng.msg.ops_per_sec
 -17.7% regression


hi, Jesper Dangaard Brouer, and all,

we made some investigation for this regression and confirmed the code change
in 21c38a3bd4 is irrelevant to stress-ng performance regression, instead, it's
a code alignment issue.

if we applied below patch for both this commit and parent, the regression will
disappear.

diff --git a/Kbuild b/Kbuild
index 464b34a08f51..829d13a010b7 100644
--- a/Kbuild
+++ b/Kbuild
@@ -80,11 +80,11 @@ obj-y                       += init/
 obj-y                  += usr/
 obj-y                  += arch/$(SRCARCH)/
 obj-y                  += $(ARCH_CORE)
+obj-y                  += ipc/
 obj-y                  += kernel/
 obj-y                  += certs/
 obj-y                  += mm/
 obj-y                  += fs/
-obj-y                  += ipc/
 obj-y                  += security/
 obj-y                  += crypto/
 obj-$(CONFIG_BLOCK)    += block/


we still make out below formal report FYI. and Feng Tang (Cced) is working on a
patch to mitigate this kind of less meaningful performance changes due to code
alignment.


Hello,

kernel test robot noticed a -17.7% regression of stress-ng.msg.ops_per_sec on:


commit: 21c38a3bd4ee3fb7337d013a638302fb5e5f9dc2 ("cgroup/rstat: add cgroup_rstat_cpu_lock helpers and tracepoints")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 256 threads 2 sockets GENUINE INTEL(R) XEON(R) (Sierra Forest) with 128G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: msg
	cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240724/202407241523.14878db1-oliver.sang@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-13/performance/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-srf-2sp1/msg/stress-ng/60s

commit: 
  c1457d9aad ("selftests/cgroup: Drop define _GNU_SOURCE")
  21c38a3bd4 ("cgroup/rstat: add cgroup_rstat_cpu_lock helpers and tracepoints")

c1457d9aad5ee2fe 21c38a3bd4ee3fb7337d013a638 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
  11013328           -14.0%    9475268        cpuidle..usage
    338790 ±  2%     -11.5%     299752 ±  2%  meminfo.Active
    338760 ±  2%     -11.5%     299723 ±  2%  meminfo.Active(anon)
    300946 ±  3%     -20.9%     238050 ± 26%  numa-meminfo.node1.Active
    300932 ±  3%     -20.9%     238045 ± 26%  numa-meminfo.node1.Active(anon)
    611478 ± 14%     -21.7%     478711 ± 21%  numa-meminfo.node1.Shmem
     75304 ±  3%     -21.4%      59175 ± 26%  numa-vmstat.node1.nr_active_anon
    153254 ± 14%     -21.8%     119804 ± 21%  numa-vmstat.node1.nr_shmem
     75303 ±  3%     -21.4%      59175 ± 26%  numa-vmstat.node1.nr_zone_active_anon
     71.66            +8.9%      78.05        vmstat.cpu.id
     68.36 ± 12%     -20.3%      54.46 ±  8%  vmstat.procs.r
    250061           -18.3%     204251        vmstat.system.cs
    287875           -18.4%     235023        vmstat.system.in
      0.50            -0.1        0.43 ±  3%  mpstat.cpu.all.irq%
      0.12 ±  2%      -0.0        0.10        mpstat.cpu.all.soft%
     27.05            -6.3       20.79        mpstat.cpu.all.sys%
      1.58            -0.3        1.31        mpstat.cpu.all.usr%
     33.01           -23.2%      25.34        mpstat.max_utilization_pct
    831.17 ±  5%     -23.7%     634.00 ±  7%  perf-c2c.DRAM.local
     27497           -26.4%      20232 ±  2%  perf-c2c.DRAM.remote
     95989           -22.4%      74505        perf-c2c.HITM.local
     22829           -27.2%      16620 ±  2%  perf-c2c.HITM.remote
    118818           -23.3%      91125        perf-c2c.HITM.total
 7.763e+08           -17.7%  6.391e+08        stress-ng.msg.ops
  12937976           -17.7%   10650694        stress-ng.msg.ops_per_sec
     20708           -25.9%      15335        stress-ng.time.involuntary_context_switches
      7333           -22.5%       5682        stress-ng.time.percent_of_cpu_this_job_got
      4240           -22.7%       3279        stress-ng.time.system_time
    195.77           -19.8%     157.09        stress-ng.time.user_time
   7897285           -18.3%    6450672        stress-ng.time.voluntary_context_switches
     84383 ±  3%     -11.6%      74585 ±  3%  proc-vmstat.nr_active_anon
    947441            -1.6%     932675        proc-vmstat.nr_file_pages
    270183            -2.2%     264202        proc-vmstat.nr_inactive_anon
    141423 ±  3%      -8.4%     129608 ±  2%  proc-vmstat.nr_mapped
    180981            -8.2%     166203        proc-vmstat.nr_shmem
     84383 ±  3%     -11.6%      74585 ±  3%  proc-vmstat.nr_zone_active_anon
    270183            -2.2%     264202        proc-vmstat.nr_zone_inactive_anon
   2030924            -7.1%    1886998        proc-vmstat.numa_hit
   1766802            -8.1%    1622970        proc-vmstat.numa_local
   2118829            -7.0%    1971226        proc-vmstat.pgalloc_normal
   1014503            -5.2%     961850        proc-vmstat.pgfault
   1776085 ±  2%      -6.8%    1655949 ±  2%  proc-vmstat.pgfree
    658538 ±  2%     -40.3%     392887        sched_debug.cfs_rq:/.avg_vruntime.avg
   1004627 ±  2%     -38.4%     618736 ±  8%  sched_debug.cfs_rq:/.avg_vruntime.max
    463772 ±  8%     -34.4%     304006        sched_debug.cfs_rq:/.avg_vruntime.min
     55077 ±  4%     -25.7%      40920        sched_debug.cfs_rq:/.avg_vruntime.stddev
    658538 ±  2%     -40.3%     392887        sched_debug.cfs_rq:/.min_vruntime.avg
   1004627 ±  2%     -38.4%     618736 ±  8%  sched_debug.cfs_rq:/.min_vruntime.max
    463772 ±  8%     -34.4%     304006        sched_debug.cfs_rq:/.min_vruntime.min
     55077 ±  4%     -25.7%      40920        sched_debug.cfs_rq:/.min_vruntime.stddev
    193.88 ±  5%     -19.0%     156.99 ±  6%  sched_debug.cfs_rq:/.runnable_avg.avg
    156.63 ±  6%     -11.0%     139.47 ±  5%  sched_debug.cfs_rq:/.runnable_avg.stddev
    193.51 ±  5%     -19.0%     156.69 ±  6%  sched_debug.cfs_rq:/.util_avg.avg
    156.53 ±  6%     -11.2%     139.06 ±  5%  sched_debug.cfs_rq:/.util_avg.stddev
     29.90 ± 11%     -28.2%      21.48 ± 17%  sched_debug.cfs_rq:/.util_est.avg
    155612 ±  8%     -14.5%     133052 ±  3%  sched_debug.cpu.avg_idle.stddev
     31373           -17.6%      25848        sched_debug.cpu.nr_switches.avg
     24963 ±  5%     -16.7%      20782 ±  6%  sched_debug.cpu.nr_switches.min
    141.42 ±  6%     -57.2%      60.50 ± 21%  sched_debug.cpu.nr_uninterruptible.max
     -1239           -81.4%    -230.33        sched_debug.cpu.nr_uninterruptible.min
     81.43 ±  8%     -77.5%      18.29 ± 11%  sched_debug.cpu.nr_uninterruptible.stddev
      0.96            -4.7%       0.91 ±  2%  perf-stat.i.MPKI
 8.823e+09           -20.0%   7.06e+09        perf-stat.i.branch-instructions
      1.59            +0.0        1.64        perf-stat.i.branch-miss-rate%
 1.394e+08           -16.9%  1.158e+08        perf-stat.i.branch-misses
      9.60            -0.8        8.76 ±  2%  perf-stat.i.cache-miss-rate%
  42966024           -23.0%   33093243 ±  2%  perf-stat.i.cache-misses
 4.653e+08           -16.0%  3.908e+08        perf-stat.i.cache-references
    261124           -18.2%     213609        perf-stat.i.context-switches
      4.44            -3.0%       4.31        perf-stat.i.cpi
 2.038e+11           -22.2%  1.585e+11        perf-stat.i.cpu-cycles
     41405           -34.6%      27075        perf-stat.i.cpu-migrations
 4.563e+10           -19.5%  3.672e+10        perf-stat.i.instructions
      0.23            +2.8%       0.23        perf-stat.i.ipc
      0.94           -98.3%       0.02 ± 44%  perf-stat.i.metric.K/sec
     15289 ±  2%      -8.2%      14035 ±  2%  perf-stat.i.minor-faults
     15290 ±  2%      -8.2%      14036 ±  2%  perf-stat.i.page-faults
      1.56            +0.1        1.63        perf-stat.overall.branch-miss-rate%
      9.04            -0.7        8.36 ±  2%  perf-stat.overall.cache-miss-rate%
      4.46            -3.2%       4.32        perf-stat.overall.cpi
      0.22            +3.4%       0.23        perf-stat.overall.ipc
 8.733e+09           -20.3%  6.963e+09        perf-stat.ps.branch-instructions
 1.364e+08           -17.0%  1.132e+08        perf-stat.ps.branch-misses
  42281735           -23.1%   32530958 ±  2%  perf-stat.ps.cache-misses
 4.676e+08           -16.8%   3.89e+08        perf-stat.ps.cache-references
    257335           -18.3%     210344        perf-stat.ps.context-switches
 2.016e+11           -22.4%  1.564e+11        perf-stat.ps.cpu-cycles
     40985           -34.9%      26692        perf-stat.ps.cpu-migrations
 4.517e+10           -19.8%  3.622e+10        perf-stat.ps.instructions
     14314 ±  2%      -6.2%      13432 ±  2%  perf-stat.ps.minor-faults
     14315 ±  2%      -6.2%      13432        perf-stat.ps.page-faults
 2.764e+12           -19.5%  2.225e+12        perf-stat.total.instructions
     35.46           -10.8       24.66        perf-profile.calltrace.cycles-pp.__percpu_counter_sum.msgctl_info.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
     33.85 ±  2%     -10.7       23.11        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__percpu_counter_sum.msgctl_info.ksys_msgctl.do_syscall_64
     33.74 ±  2%     -10.7       23.01        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__percpu_counter_sum.msgctl_info.ksys_msgctl
     38.54            -9.0       29.55        perf-profile.calltrace.cycles-pp.msgctl_info.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.msgctl
      2.99            -0.8        2.23 ±  8%  perf-profile.calltrace.cycles-pp.percpu_counter_add_batch.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
      2.19 ±  6%      -0.7        1.44 ± 11%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.74 ±  7%      -0.5        2.24        perf-profile.calltrace.cycles-pp.percpu_counter_add_batch.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
      3.26            -0.3        2.97        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
      3.90            -0.3        3.63        perf-profile.calltrace.cycles-pp._raw_spin_lock.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
      0.68 ±  6%      +0.1        0.74        perf-profile.calltrace.cycles-pp._copy_from_user.load_msg.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.90            +0.1        1.03        perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      0.55 ± 44%      +0.2        0.70        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__libc_msgsnd.stress_msg
      0.96 ±  5%      +0.2        1.14 ±  3%  perf-profile.calltrace.cycles-pp._copy_to_user.store_msg.do_msg_fill.do_msgrcv.do_syscall_64
      1.32            +0.2        1.55        perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
      1.36            +0.2        1.59        perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
      0.42 ± 44%      +0.2        0.66        perf-profile.calltrace.cycles-pp.rwsem_wake.up_write.msgctl_down.ksys_msgctl.do_syscall_64
      0.46 ± 44%      +0.3        0.72        perf-profile.calltrace.cycles-pp.up_write.msgctl_down.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.12            +0.3        2.40        perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      3.85            +0.3        4.19        perf-profile.calltrace.cycles-pp.common_startup_64
      3.59            +0.3        3.94        perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
      3.58            +0.3        3.93        perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
      3.59            +0.3        3.94        perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
      2.60 ±  6%      +0.4        3.00 ±  2%  perf-profile.calltrace.cycles-pp.store_msg.do_msg_fill.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
      3.37            +0.4        3.81 ±  2%  perf-profile.calltrace.cycles-pp.do_msg_fill.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
      0.43 ± 44%      +0.5        0.98        perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.rwsem_down_write_slowpath.down_write.msgctl_down.ksys_msgctl
      0.78 ±  6%      +0.6        1.34 ±  3%  perf-profile.calltrace.cycles-pp.seq_read_iter.seq_read.vfs_read.ksys_read.do_syscall_64
      0.79 ±  6%      +0.6        1.35 ±  3%  perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
      0.78 ±  6%      +0.6        1.34 ±  3%  perf-profile.calltrace.cycles-pp.seq_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +0.7        0.65 ±  4%  perf-profile.calltrace.cycles-pp.rwsem_down_read_slowpath.down_read.sysvipc_proc_start.seq_read_iter.seq_read
      0.00            +0.7        0.67 ±  4%  perf-profile.calltrace.cycles-pp.down_read.sysvipc_proc_start.seq_read_iter.seq_read.vfs_read
      0.70 ± 44%      +0.7        1.37 ±  2%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read.stress_msg
      0.68 ± 44%      +0.7        1.35 ±  3%  perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read.stress_msg
      0.69 ± 44%      +0.7        1.37 ±  2%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read.stress_msg
      0.70 ± 44%      +0.7        1.38 ±  2%  perf-profile.calltrace.cycles-pp.read.stress_msg
      0.00            +0.7        0.69 ±  4%  perf-profile.calltrace.cycles-pp.sysvipc_proc_start.seq_read_iter.seq_read.vfs_read.ksys_read
      0.67 ±  9%      +0.9        1.56        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.msgctl_down.ksys_msgctl
      0.00            +0.9        0.90        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irq.rwsem_down_write_slowpath.down_write.msgctl_down
      0.00            +1.2        1.19 ±  2%  perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.msgctl_down
      1.52 ±  8%      +1.4        2.94        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.msgctl_down.ksys_msgctl.do_syscall_64
      1.58 ±  8%      +1.5        3.03        perf-profile.calltrace.cycles-pp.down_write.msgctl_down.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.24 ±  7%      +1.7        3.89        perf-profile.calltrace.cycles-pp.msgctl_down.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.msgctl
      2.58 ±  7%      +1.8        4.34        perf-profile.calltrace.cycles-pp.rwsem_down_read_slowpath.down_read.msgctl_info.ksys_msgctl.do_syscall_64
      1.34 ±  8%      +1.8        3.12 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irq.rwsem_down_read_slowpath.down_read.msgctl_info
      1.43 ±  8%      +1.8        3.25 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.rwsem_down_read_slowpath.down_read.msgctl_info.ksys_msgctl
      2.70 ±  7%      +1.8        4.52        perf-profile.calltrace.cycles-pp.down_read.msgctl_info.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
     19.07 ±  3%      +2.1       21.20 ±  3%  perf-profile.calltrace.cycles-pp.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
     19.33 ±  3%      +2.2       21.49 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
     19.39 ±  3%      +2.2       21.56 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
     20.22 ±  3%      +2.2       22.43 ±  3%  perf-profile.calltrace.cycles-pp.__libc_msgrcv
      8.31 ±  2%      +3.5       11.79        perf-profile.calltrace.cycles-pp.idr_find.ipc_obtain_object_check.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
      7.97 ±  2%      +3.5       11.46        perf-profile.calltrace.cycles-pp.idr_find.ipc_obtain_object_check.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
     10.10            +3.6       13.66        perf-profile.calltrace.cycles-pp.ipc_obtain_object_check.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
      9.71            +3.6       13.30        perf-profile.calltrace.cycles-pp.ipc_obtain_object_check.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
     18.24 ± 36%      +5.8       24.05        perf-profile.calltrace.cycles-pp.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd.stress_msg
     18.98 ± 36%      +6.0       24.96        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd.stress_msg
     19.08 ± 36%      +6.0       25.07        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__libc_msgsnd.stress_msg
     20.09 ± 36%      +6.2       26.33        perf-profile.calltrace.cycles-pp.__libc_msgsnd.stress_msg
     35.49           -10.8       24.68        perf-profile.children.cycles-pp.__percpu_counter_sum
     34.24 ±  2%     -10.6       23.68        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
     38.54            -9.0       29.55        perf-profile.children.cycles-pp.msgctl_info
     41.48            -8.8       32.65        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     41.12            -7.4       33.76        perf-profile.children.cycles-pp.ksys_msgctl
     41.44            -7.3       34.09        perf-profile.children.cycles-pp.msgctl
      5.82            -1.3        4.56        perf-profile.children.cycles-pp.percpu_counter_add_batch
     89.95            -0.8       89.17        perf-profile.children.cycles-pp.do_syscall_64
     90.24            -0.8       89.48        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      7.07            -0.7        6.36        perf-profile.children.cycles-pp._raw_spin_lock
      1.54 ±  2%      -0.4        1.16        perf-profile.children.cycles-pp.newidle_balance
      1.58            -0.4        1.21        perf-profile.children.cycles-pp.pick_next_task_fair
      2.58            -0.4        2.23        perf-profile.children.cycles-pp.__schedule
      1.48 ±  2%      -0.4        1.12        perf-profile.children.cycles-pp.load_balance
      2.01            -0.4        1.66        perf-profile.children.cycles-pp.schedule
      1.27            -0.3        1.00 ±  2%  perf-profile.children.cycles-pp.find_busiest_group
      1.27            -0.3        0.99 ±  2%  perf-profile.children.cycles-pp.update_sd_lb_stats
      1.22            -0.3        0.96 ±  2%  perf-profile.children.cycles-pp.update_sg_lb_stats
      1.19            -0.2        0.98        perf-profile.children.cycles-pp.schedule_preempt_disabled
      0.13 ±  6%      -0.0        0.08 ±  5%  perf-profile.children.cycles-pp.raw_spin_rq_lock_nested
      0.32 ±  2%      -0.0        0.29 ±  3%  perf-profile.children.cycles-pp.asm_sysvec_call_function_single
      0.32            -0.0        0.29 ±  2%  perf-profile.children.cycles-pp.idle_cpu
      0.32            -0.0        0.29 ±  4%  perf-profile.children.cycles-pp.sysvec_call_function_single
      0.09 ±  4%      -0.0        0.06 ±  6%  perf-profile.children.cycles-pp.find_busiest_queue
      0.19 ±  3%      -0.0        0.17 ±  3%  perf-profile.children.cycles-pp.update_blocked_averages
      0.14 ±  3%      -0.0        0.12 ±  4%  perf-profile.children.cycles-pp.ss_wakeup
      0.08 ±  4%      -0.0        0.06        perf-profile.children.cycles-pp.cmd_record
      0.35            -0.0        0.33 ±  2%  perf-profile.children.cycles-pp._nohz_idle_balance
      0.08 ±  4%      -0.0        0.06 ±  6%  perf-profile.children.cycles-pp.main
      0.08 ±  4%      -0.0        0.06 ±  6%  perf-profile.children.cycles-pp.run_builtin
      0.10 ±  4%      +0.0        0.12 ±  4%  perf-profile.children.cycles-pp.tick_nohz_next_event
      0.31            +0.0        0.32        perf-profile.children.cycles-pp.tick_nohz_stop_tick
      0.12            +0.0        0.13 ±  3%  perf-profile.children.cycles-pp.rwsem_mark_wake
      0.26            +0.0        0.27        perf-profile.children.cycles-pp.sysvipc_msg_proc_show
      0.13 ±  3%      +0.0        0.15 ±  2%  perf-profile.children.cycles-pp.hrtimer_start_range_ns
      0.14 ±  3%      +0.0        0.16        perf-profile.children.cycles-pp.quiet_vmstat
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.rebalance_domains
      0.14 ±  2%      +0.0        0.16 ±  2%  perf-profile.children.cycles-pp.tick_nohz_get_sleep_length
      0.05            +0.0        0.07        perf-profile.children.cycles-pp.osq_unlock
      0.31            +0.0        0.33        perf-profile.children.cycles-pp.tick_nohz_idle_stop_tick
      0.38            +0.0        0.40        perf-profile.children.cycles-pp.__get_user_8
      0.38            +0.0        0.40        perf-profile.children.cycles-pp.__x64_sys_msgsnd
      0.61            +0.0        0.63        perf-profile.children.cycles-pp.flush_smp_call_function_queue
      0.09 ±  4%      +0.0        0.11 ± 19%  perf-profile.children.cycles-pp._find_next_and_bit
      0.30            +0.0        0.32 ±  2%  perf-profile.children.cycles-pp.ipcperms
      0.17 ±  2%      +0.0        0.20 ±  2%  perf-profile.children.cycles-pp.tick_nohz_restart_sched_tick
      0.12 ±  3%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.therm_throt_process
      0.20 ±  2%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.msgctl_stat
      0.38            +0.0        0.42        perf-profile.children.cycles-pp.__put_user_8
      0.27            +0.0        0.30 ±  2%  perf-profile.children.cycles-pp.tick_nohz_idle_exit
      0.31 ±  3%      +0.0        0.35 ±  7%  perf-profile.children.cycles-pp.nohz_balance_exit_idle
      0.28            +0.0        0.32 ±  2%  perf-profile.children.cycles-pp.menu_select
      0.72            +0.0        0.77        perf-profile.children.cycles-pp._copy_from_user
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.ipcctl_obtain_check
      0.21 ±  3%      +0.1        0.27        perf-profile.children.cycles-pp.task_work_run
      0.46            +0.1        0.52 ±  2%  perf-profile.children.cycles-pp.__radix_tree_lookup
      0.20 ±  2%      +0.1        0.26        perf-profile.children.cycles-pp.task_mm_cid_work
      0.48            +0.1        0.54        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      1.32 ±  2%      +0.1        1.38 ±  2%  perf-profile.children.cycles-pp.stress_msg_receiver
      0.26            +0.1        0.32 ±  2%  perf-profile.children.cycles-pp.intel_thermal_interrupt
      0.17 ±  4%      +0.1        0.23        perf-profile.children.cycles-pp.rwsem_spin_on_owner
      0.26 ±  2%      +0.1        0.32 ±  2%  perf-profile.children.cycles-pp.__sysvec_thermal
      0.19 ±  2%      +0.1        0.26 ±  9%  perf-profile.children.cycles-pp.kick_ilb
      1.11            +0.1        1.19        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.24 ±  3%      +0.1        0.33 ±  2%  perf-profile.children.cycles-pp.up_read
      1.35            +0.1        1.44        perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.34 ±  2%      +0.1        0.43 ±  2%  perf-profile.children.cycles-pp.sysvec_thermal
      1.10 ±  2%      +0.1        1.19 ±  4%  perf-profile.children.cycles-pp.update_process_times
      1.15 ±  2%      +0.1        1.25 ±  4%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      1.14 ±  2%      +0.1        1.24 ±  4%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.38 ±  2%      +0.1        0.48        perf-profile.children.cycles-pp.asm_sysvec_thermal
      1.20            +0.1        1.31 ±  3%  perf-profile.children.cycles-pp.hrtimer_interrupt
      1.21 ±  2%      +0.1        1.32 ±  3%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      1.36            +0.1        1.49 ±  3%  perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
      1.41            +0.1        1.54 ±  2%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      0.13 ±  3%      +0.1        0.26 ±  4%  perf-profile.children.cycles-pp.idr_get_next
      0.12 ±  4%      +0.1        0.25 ±  3%  perf-profile.children.cycles-pp.idr_get_next_ul
      0.21 ±  2%      +0.1        0.34 ±  4%  perf-profile.children.cycles-pp.sysvipc_proc_next
      0.92            +0.1        1.05        perf-profile.children.cycles-pp.intel_idle
      0.61            +0.2        0.78        perf-profile.children.cycles-pp.rwsem_wake
      0.56            +0.2        0.72        perf-profile.children.cycles-pp.up_write
      1.49            +0.2        1.71        perf-profile.children.cycles-pp.cpuidle_enter_state
      1.50            +0.2        1.72        perf-profile.children.cycles-pp.cpuidle_enter
      1.36            +0.3        1.62        perf-profile.children.cycles-pp._copy_to_user
      2.27            +0.3        2.54        perf-profile.children.cycles-pp.cpuidle_idle_call
      3.85            +0.3        4.18        perf-profile.children.cycles-pp.do_idle
      3.85            +0.3        4.19        perf-profile.children.cycles-pp.common_startup_64
      3.85            +0.3        4.19        perf-profile.children.cycles-pp.cpu_startup_entry
      3.59            +0.3        3.94        perf-profile.children.cycles-pp.start_secondary
      0.30 ±  2%      +0.4        0.69 ±  4%  perf-profile.children.cycles-pp.sysvipc_proc_start
      2.97            +0.4        3.38 ±  2%  perf-profile.children.cycles-pp.store_msg
      3.39            +0.4        3.83 ±  2%  perf-profile.children.cycles-pp.do_msg_fill
      0.81            +0.5        1.35 ±  3%  perf-profile.children.cycles-pp.seq_read_iter
      0.80            +0.5        1.34 ±  3%  perf-profile.children.cycles-pp.seq_read
      0.82            +0.5        1.37 ±  2%  perf-profile.children.cycles-pp.ksys_read
      0.82            +0.5        1.36 ±  3%  perf-profile.children.cycles-pp.vfs_read
      0.85            +0.6        1.40 ±  2%  perf-profile.children.cycles-pp.read
      0.41 ±  6%      +0.8        1.19 ±  2%  perf-profile.children.cycles-pp.osq_lock
      0.70 ±  4%      +0.9        1.57        perf-profile.children.cycles-pp.rwsem_optimistic_spin
      1.56 ±  2%      +1.4        2.94        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
      1.63 ±  2%      +1.4        3.03        perf-profile.children.cycles-pp.down_write
      2.30 ±  2%      +1.6        3.89        perf-profile.children.cycles-pp.msgctl_down
      2.93 ±  2%      +2.1        5.00        perf-profile.children.cycles-pp.rwsem_down_read_slowpath
      3.06 ±  2%      +2.1        5.19        perf-profile.children.cycles-pp.down_read
      2.13 ±  3%      +2.6        4.74        perf-profile.children.cycles-pp._raw_spin_lock_irq
     21.15            +2.9       24.08        perf-profile.children.cycles-pp.do_msgsnd
     25.40            +3.0       28.43        perf-profile.children.cycles-pp.do_msgrcv
     23.62            +3.1       26.71        perf-profile.children.cycles-pp.__libc_msgsnd
     27.24            +3.2       30.40        perf-profile.children.cycles-pp.__libc_msgrcv
     16.41 ±  2%      +7.0       23.42        perf-profile.children.cycles-pp.idr_find
     19.91            +7.2       27.10        perf-profile.children.cycles-pp.ipc_obtain_object_check
     41.36            -8.8       32.54        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      5.70            -1.2        4.46        perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.93            -0.2        0.72 ±  2%  perf-profile.self.cycles-pp.update_sg_lb_stats
      0.32            -0.0        0.29        perf-profile.self.cycles-pp.idle_cpu
      0.08 ±  5%      -0.0        0.06        perf-profile.self.cycles-pp.find_busiest_queue
      0.13 ±  3%      -0.0        0.12 ±  4%  perf-profile.self.cycles-pp.ss_wakeup
      0.09            +0.0        0.10        perf-profile.self.cycles-pp.prepare_task_switch
      0.06            +0.0        0.07        perf-profile.self.cycles-pp.switch_mm_irqs_off
      0.06 ±  6%      +0.0        0.07        perf-profile.self.cycles-pp.cpuidle_enter_state
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.self.cycles-pp.menu_select
      0.28            +0.0        0.30 ±  2%  perf-profile.self.cycles-pp.ipcperms
      0.05            +0.0        0.07 ±  5%  perf-profile.self.cycles-pp.osq_unlock
      0.07 ±  5%      +0.0        0.09        perf-profile.self.cycles-pp.rwsem_optimistic_spin
      0.10 ±  5%      +0.0        0.12 ±  6%  perf-profile.self.cycles-pp.kick_ilb
      0.38            +0.0        0.40        perf-profile.self.cycles-pp.__get_user_8
      0.36            +0.0        0.38        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.46            +0.0        0.49        perf-profile.self.cycles-pp.__libc_msgsnd
      0.38            +0.0        0.41        perf-profile.self.cycles-pp.__put_user_8
      0.12 ±  3%      +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.therm_throt_process
      0.07 ±  7%      +0.0        0.10 ±  4%  perf-profile.self.cycles-pp.down_write
      0.47            +0.0        0.50        perf-profile.self.cycles-pp.__libc_msgrcv
      0.13 ±  3%      +0.0        0.17 ±  4%  perf-profile.self.cycles-pp.intel_thermal_interrupt
      0.35 ±  2%      +0.0        0.38        perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.31 ±  3%      +0.0        0.35 ±  7%  perf-profile.self.cycles-pp.nohz_balance_exit_idle
      0.67            +0.0        0.71        perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.08 ±  5%      +0.0        0.13 ±  2%  perf-profile.self.cycles-pp.rwsem_down_write_slowpath
      0.23 ±  3%      +0.0        0.28 ±  4%  perf-profile.self.cycles-pp.__check_object_size
      0.71            +0.1        0.76        perf-profile.self.cycles-pp._copy_from_user
      0.46            +0.1        0.51        perf-profile.self.cycles-pp.__radix_tree_lookup
      0.13 ±  5%      +0.1        0.19        perf-profile.self.cycles-pp.down_read
      0.20 ±  2%      +0.1        0.26        perf-profile.self.cycles-pp.task_mm_cid_work
      1.27 ±  2%      +0.1        1.34 ±  3%  perf-profile.self.cycles-pp.stress_msg_receiver
      1.45            +0.1        1.51        perf-profile.self.cycles-pp._raw_spin_lock
      0.16 ±  2%      +0.1        0.22 ±  2%  perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.15 ±  3%      +0.1        0.22 ±  2%  perf-profile.self.cycles-pp._raw_spin_lock_irq
      0.14 ±  2%      +0.1        0.21        perf-profile.self.cycles-pp.up_read
      0.27 ±  2%      +0.1        0.34        perf-profile.self.cycles-pp.rwsem_down_read_slowpath
      1.11            +0.1        1.19        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.38            +0.1        0.49 ±  4%  perf-profile.self.cycles-pp.store_msg
      1.05            +0.1        1.17 ±  2%  perf-profile.self.cycles-pp.do_msgsnd
      0.11 ±  3%      +0.1        0.23 ±  4%  perf-profile.self.cycles-pp.idr_get_next_ul
      1.65            +0.1        1.78 ±  2%  perf-profile.self.cycles-pp.do_msgrcv
      3.05            +0.1        3.18 ±  2%  perf-profile.self.cycles-pp.ipc_obtain_object_check
      0.92            +0.1        1.05        perf-profile.self.cycles-pp.intel_idle
      1.34            +0.3        1.59        perf-profile.self.cycles-pp._copy_to_user
      0.40 ±  6%      +0.8        1.17 ±  2%  perf-profile.self.cycles-pp.osq_lock
     16.08 ±  2%      +6.9       22.94        perf-profile.self.cycles-pp.idr_find




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ