lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <202503102156.d70c4800-lkp@intel.com>
Date: Mon, 10 Mar 2025 22:06:01 +0800
From: kernel test robot <oliver.sang@...el.com>
To: James Morse <james.morse@....com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, Shaopeng Tan
	<tan.shaopeng@...fujitsu.com>, Reinette Chatre <reinette.chatre@...el.com>,
	Tony Luck <tony.luck@...el.com>, <linux-kernel@...r.kernel.org>,
	<oliver.sang@...el.com>
Subject: [morse:mpam/move_to_fs/v7_bare] [x86/resctrl]  0021800a46:
 will-it-scale.per_process_ops 18.4% improvement



Hello,

kernel test robot noticed a 18.4% improvement of will-it-scale.per_process_ops on:


commit: 0021800a465d495a536265c52f8a031da43948ed ("x86/resctrl: Use schema type to determine the schema format string")
https://git.kernel.org/cgit/linux/kernel/git/morse/linux.git mpam/move_to_fs/v7_bare

testcase: will-it-scale
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480CTDX (Sapphire Rapids) with 512G memory
parameters:

	nr_task: 100%
	mode: process
	test: signal1
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+---------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.usersyscall.ops_per_sec 18.0% improvement                              |
| test machine     | 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480CTDX (Sapphire Rapids) with 512G memory |
| test parameters  | cpufreq_governor=performance                                                                |
|                  | nr_threads=100%                                                                             |
|                  | test=usersyscall                                                                            |
|                  | testtime=60s                                                                                |
+------------------+---------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250310/202503102156.d70c4800-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-9.4/process/100%/debian-12-x86_64-20240206.cgz/lkp-spr-2sp4/signal1/will-it-scale

commit: 
  a13ae432a6 ("x86/resctrl: Use schema type to determine how to parse schema values")
  0021800a46 ("x86/resctrl: Use schema type to determine the schema format string")

a13ae432a665d1d0 0021800a465d495a536265c52f8 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    371124 ± 13%     +23.2%     457121 ±  5%  meminfo.Mapped
      0.02 ±  5%      -0.0        0.02 ±  2%  mpstat.cpu.all.soft%
     93276 ± 13%     +22.9%     114661 ±  5%  proc-vmstat.nr_mapped
    290918 ± 16%     +38.9%     404183 ±  7%  numa-meminfo.node1.Mapped
    388.17 ±119%  +3.2e+05%    1237125 ±124%  numa-meminfo.node1.Unevictable
     72842 ± 16%     +38.7%     101060 ±  7%  numa-vmstat.node1.nr_mapped
     97.04 ±119%  +3.2e+05%     309281 ±124%  numa-vmstat.node1.nr_unevictable
     97.04 ±119%  +3.2e+05%     309281 ±124%  numa-vmstat.node1.nr_zone_unevictable
   6588847           +18.4%    7802117        will-it-scale.224.processes
     29414           +18.4%      34830        will-it-scale.per_process_ops
   6588847           +18.4%    7802117        will-it-scale.workload
      5736 ± 18%     -30.2%       4005 ± 16%  perf-c2c.DRAM.remote
     12227 ± 16%     +55.7%      19033 ± 17%  perf-c2c.HITM.local
      5053 ± 18%     -34.5%       3308 ± 16%  perf-c2c.HITM.remote
     17281 ± 16%     +29.3%      22341 ± 17%  perf-c2c.HITM.total
      1.14 ± 33%     -97.7%       0.03 ±108%  perf-sched.sch_delay.avg.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      3.13 ±  4%     -10.6%       2.80 ±  6%  perf-sched.sch_delay.avg.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.vma_alloc_folio_noprof
      0.38 ±  7%     -18.5%       0.31 ±  5%  perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      0.02 ± 22%     -44.0%       0.01 ± 18%  perf-sched.sch_delay.avg.ms.schedule_timeout.msleep.ast_astdp_connector_helper_detect_ctx.drm_helper_probe_detect_ctx
      0.04 ± 17%     -18.5%       0.03 ±  9%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1.77 ± 28%     -97.4%       0.05 ±117%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      3.07 ± 24%     -47.7%       1.60 ± 50%  perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas
      0.04 ± 48%     -60.4%       0.02 ± 35%  perf-sched.sch_delay.max.ms.schedule_timeout.msleep.ast_astdp_connector_helper_detect_ctx.drm_helper_probe_detect_ctx
      4698 ± 96%     -78.5%       1007        perf-sched.wait_and_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.97 ± 35%     -98.6%       0.01 ±169%  perf-sched.wait_time.avg.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      1.90 ± 43%     -59.9%       0.76 ± 82%  perf-sched.wait_time.avg.ms.__cond_resched.__anon_vma_prepare.__vmf_anon_prepare.do_pte_missing.__handle_mm_fault
      2.31 ± 15%     -42.7%       1.32 ± 56%  perf-sched.wait_time.avg.ms.__cond_resched.down_read.walk_component.link_path_walk.part
      1.74 ± 25%     -98.5%       0.03 ±170%  perf-sched.wait_time.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      3.07 ± 24%     -47.7%       1.60 ± 50%  perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.vms_gather_munmap_vmas
      4436 ±104%     -77.3%       1007        perf-sched.wait_time.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.16           -42.0%       0.09        perf-stat.i.MPKI
 5.316e+09           +17.2%  6.231e+09        perf-stat.i.branch-instructions
      0.40            -0.0        0.38        perf-stat.i.branch-miss-rate%
  21246147           +11.4%   23662917        perf-stat.i.branch-misses
     22.12 ±  2%      -9.2       12.91        perf-stat.i.cache-miss-rate%
   4407081           -33.2%    2945886        perf-stat.i.cache-misses
  19702928 ±  2%     +12.0%   22067992        perf-stat.i.cache-references
     22.77           -14.7%      19.42        perf-stat.i.cpi
    153602           +54.7%     237617        perf-stat.i.cycles-between-cache-misses
 2.842e+10           +17.3%  3.332e+10        perf-stat.i.instructions
      0.05           +15.3%       0.06        perf-stat.i.ipc
      0.15           -43.1%       0.09        perf-stat.overall.MPKI
      0.40            -0.0        0.38        perf-stat.overall.branch-miss-rate%
     22.36 ±  2%      -9.0       13.32        perf-stat.overall.cache-miss-rate%
     22.77           -14.7%      19.43        perf-stat.overall.cpi
    147278           +49.8%     220671        perf-stat.overall.cycles-between-cache-misses
      0.04           +17.2%       0.05        perf-stat.overall.ipc
   1302291            -1.0%    1289647        perf-stat.overall.path-length
 5.297e+09           +17.2%  6.205e+09        perf-stat.ps.branch-instructions
  21150909           +11.4%   23557858        perf-stat.ps.branch-misses
   4378997           -33.3%    2922005        perf-stat.ps.cache-misses
  19589543 ±  2%     +12.0%   21940439        perf-stat.ps.cache-references
 2.831e+10           +17.2%  3.318e+10        perf-stat.ps.instructions
 8.581e+12           +17.3%  1.006e+13        perf-stat.total.instructions
     39.12            -1.2       37.93        perf-profile.calltrace.cycles-pp.inc_rlimit_get_ucounts.sig_get_ucounts.__send_signal_locked.do_send_sig_info.do_send_specific
     39.14            -1.2       37.95        perf-profile.calltrace.cycles-pp.sig_get_ucounts.__send_signal_locked.do_send_sig_info.do_send_specific.__x64_sys_tgkill
     39.54            -0.8       38.74        perf-profile.calltrace.cycles-pp.do_dec_rlimit_put_ucounts.__sigqueue_free.dequeue_signal.get_signal.arch_do_signal_or_restart
     39.55            -0.8       38.75        perf-profile.calltrace.cycles-pp.__sigqueue_free.dequeue_signal.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
      0.54            +0.1        0.62        perf-profile.calltrace.cycles-pp.copy_fpstate_to_sigframe.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart
      0.60            +0.1        0.68        perf-profile.calltrace.cycles-pp.fpu__restore_sig.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.72            +0.1        0.82        perf-profile.calltrace.cycles-pp.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.86            +0.1        0.99        perf-profile.calltrace.cycles-pp.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
      1.10            +0.1        1.23        perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.99            +0.1        1.13        perf-profile.calltrace.cycles-pp.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
      1.19 ± 22%      +0.2        1.43 ±  6%  perf-profile.calltrace.cycles-pp.idr_find.find_task_by_vpid.do_send_specific.__x64_sys_tgkill.do_syscall_64
      1.48 ±  4%      +0.3        1.75        perf-profile.calltrace.cycles-pp.obj_cgroup_charge.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.__send_signal_locked.do_send_sig_info
      1.75 ± 13%      +0.3        2.03 ±  2%  perf-profile.calltrace.cycles-pp.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.57 ±  4%      +0.3        1.85        perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.__send_signal_locked.do_send_sig_info.do_send_specific
      1.46 ±  5%      +0.3        1.75        perf-profile.calltrace.cycles-pp.refill_obj_stock.__memcg_slab_free_hook.kmem_cache_free.dequeue_signal.get_signal
      1.63 ±  4%      +0.3        1.93        perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.__send_signal_locked.do_send_sig_info.do_send_specific.__x64_sys_tgkill
      1.55 ±  5%      +0.3        1.85        perf-profile.calltrace.cycles-pp.__memcg_slab_free_hook.kmem_cache_free.dequeue_signal.get_signal.arch_do_signal_or_restart
      1.62 ±  5%      +0.3        1.93        perf-profile.calltrace.cycles-pp.kmem_cache_free.dequeue_signal.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
      4.87 ±  6%      +0.4        5.28 ±  3%  perf-profile.calltrace.cycles-pp.handler
      2.09            +0.5        2.60        perf-profile.calltrace.cycles-pp.recalc_sigpending.dequeue_signal.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
      2.48            +0.7        3.13        perf-profile.calltrace.cycles-pp.complete_signal.__send_signal_locked.do_send_sig_info.do_send_specific.__x64_sys_tgkill
     39.12            -1.2       37.93        perf-profile.children.cycles-pp.inc_rlimit_get_ucounts
     39.14            -1.2       37.95        perf-profile.children.cycles-pp.sig_get_ucounts
     39.55            -0.8       38.74        perf-profile.children.cycles-pp.do_dec_rlimit_put_ucounts
     39.56            -0.8       38.76        perf-profile.children.cycles-pp.__sigqueue_free
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.__get_user_8
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.__put_user_8
      0.07 ±  5%      +0.0        0.08        perf-profile.children.cycles-pp.mod_objcg_state
      0.13 ±  3%      +0.0        0.14        perf-profile.children.cycles-pp.restore_altstack
      0.13 ±  2%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.__task_pid_nr_ns
      0.15            +0.0        0.17 ±  2%  perf-profile.children.cycles-pp.check_xstate_in_sigframe
      0.11            +0.0        0.13 ±  3%  perf-profile.children.cycles-pp.__get_user_nocheck_4
      0.08 ±  4%      +0.0        0.10 ±  3%  perf-profile.children.cycles-pp.rseq_update_cpu_node_id
      0.14 ±  2%      +0.0        0.16 ±  2%  perf-profile.children.cycles-pp.__put_user_nocheck_4
      0.14            +0.0        0.16        perf-profile.children.cycles-pp.rseq_ip_fixup
      0.12            +0.0        0.14        perf-profile.children.cycles-pp.__get_user_nocheck_8
      0.17            +0.0        0.19        perf-profile.children.cycles-pp._copy_from_user
      0.16            +0.0        0.19 ±  2%  perf-profile.children.cycles-pp.__fpu_restore_sig
      0.16 ±  2%      +0.0        0.19        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.23 ±  2%      +0.0        0.26        perf-profile.children.cycles-pp.save_xstate_epilog
      0.23            +0.0        0.27        perf-profile.children.cycles-pp.__rseq_handle_notify_resume
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.__local_bh_enable_ip
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.shmem_write_end
      0.35            +0.1        0.40        perf-profile.children.cycles-pp.__restore_fpregs_from_user
      0.44 ±  3%      +0.1        0.49 ±  2%  perf-profile.children.cycles-pp.__getpid
      0.43            +0.1        0.49        perf-profile.children.cycles-pp.restore_fpregs_from_user
      0.60            +0.1        0.68        perf-profile.children.cycles-pp.fpu__restore_sig
      0.55            +0.1        0.63        perf-profile.children.cycles-pp.copy_fpstate_to_sigframe
      0.91 ±  7%      +0.1        1.01 ±  5%  perf-profile.children.cycles-pp.find_task_by_vpid
      0.72            +0.1        0.82        perf-profile.children.cycles-pp.restore_sigcontext
      0.88            +0.1        1.00        perf-profile.children.cycles-pp.get_sigframe
      1.12            +0.1        1.25        perf-profile.children.cycles-pp.__x64_sys_rt_sigreturn
      1.00            +0.1        1.14        perf-profile.children.cycles-pp.x64_setup_rt_frame
      1.84 ±  2%      +0.2        2.04 ±  2%  perf-profile.children.cycles-pp.handle_signal
      1.48 ±  4%      +0.3        1.75        perf-profile.children.cycles-pp.obj_cgroup_charge
      1.58 ±  4%      +0.3        1.86        perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
      1.46 ±  5%      +0.3        1.75        perf-profile.children.cycles-pp.refill_obj_stock
      1.64 ±  4%      +0.3        1.93        perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
      1.56 ±  5%      +0.3        1.86        perf-profile.children.cycles-pp.__memcg_slab_free_hook
      1.62 ±  4%      +0.3        1.94        perf-profile.children.cycles-pp.kmem_cache_free
      2.17            +0.5        2.68        perf-profile.children.cycles-pp.recalc_sigpending
      2.48            +0.6        3.13        perf-profile.children.cycles-pp.complete_signal
     39.12            -1.2       37.93        perf-profile.self.cycles-pp.inc_rlimit_get_ucounts
     39.55            -0.8       38.74        perf-profile.self.cycles-pp.do_dec_rlimit_put_ucounts
      0.35 ± 17%      -0.2        0.13 ± 10%  perf-profile.self.cycles-pp.handler
      0.05            +0.0        0.06        perf-profile.self.cycles-pp.__put_user_8
      0.05            +0.0        0.06        perf-profile.self.cycles-pp.x64_sys_call
      0.07            +0.0        0.08        perf-profile.self.cycles-pp.__memcg_slab_post_alloc_hook
      0.09            +0.0        0.10        perf-profile.self.cycles-pp.check_xstate_in_sigframe
      0.06            +0.0        0.07        perf-profile.self.cycles-pp.mod_objcg_state
      0.08            +0.0        0.09        perf-profile.self.cycles-pp.rseq_update_cpu_node_id
      0.11 ±  3%      +0.0        0.12        perf-profile.self.cycles-pp.__get_user_nocheck_4
      0.12 ±  3%      +0.0        0.14 ±  3%  perf-profile.self.cycles-pp.__task_pid_nr_ns
      0.08 ±  4%      +0.0        0.10 ±  5%  perf-profile.self.cycles-pp.get_sigframe
      0.09 ±  5%      +0.0        0.11 ±  3%  perf-profile.self.cycles-pp.save_xstate_epilog
      0.11 ±  3%      +0.0        0.13        perf-profile.self.cycles-pp.__get_user_nocheck_8
      0.04 ± 44%      +0.0        0.06        perf-profile.self.cycles-pp.__memcg_slab_free_hook
      0.12 ±  4%      +0.0        0.14 ±  5%  perf-profile.self.cycles-pp.do_syscall_64
      0.13            +0.0        0.15        perf-profile.self.cycles-pp.__put_user_nocheck_4
      0.13            +0.0        0.15        perf-profile.self.cycles-pp.x64_setup_rt_frame
      0.17            +0.0        0.19        perf-profile.self.cycles-pp._copy_from_user
      0.15            +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.27            +0.0        0.30        perf-profile.self.cycles-pp.fpu__clear_user_states
      0.25 ±  2%      +0.0        0.29        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.35            +0.0        0.40        perf-profile.self.cycles-pp.__restore_fpregs_from_user
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.shmem_write_end
      0.42            +0.1        0.49        perf-profile.self.cycles-pp.copy_fpstate_to_sigframe
      0.62 ±  7%      +0.1        0.72 ±  5%  perf-profile.self.cycles-pp.find_task_by_vpid
      1.48 ±  4%      +0.3        1.75        perf-profile.self.cycles-pp.obj_cgroup_charge
      1.46 ±  5%      +0.3        1.75        perf-profile.self.cycles-pp.refill_obj_stock
      2.16            +0.5        2.67        perf-profile.self.cycles-pp.recalc_sigpending
      2.42            +0.6        3.06        perf-profile.self.cycles-pp.complete_signal


***************************************************************************************************
lkp-spr-2sp4: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480CTDX (Sapphire Rapids) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-spr-2sp4/usersyscall/stress-ng/60s

commit: 
  a13ae432a6 ("x86/resctrl: Use schema type to determine how to parse schema values")
  0021800a46 ("x86/resctrl: Use schema type to determine the schema format string")

a13ae432a665d1d0 0021800a465d495a536265c52f8 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      0.02 ±  5%      -0.0        0.02 ±  3%  mpstat.cpu.all.soft%
    695763 ± 21%     -28.4%     498121 ± 16%  numa-numastat.node1.local_node
      7622 ± 17%     -51.6%       3685 ± 20%  perf-c2c.DRAM.remote
      6750 ± 17%     -55.3%       3017 ± 20%  perf-c2c.HITM.remote
     32475           -15.5%      27453        stress-ng.usersyscall.nanosecs_per_syscall
 1.974e+08           +18.0%  2.329e+08        stress-ng.usersyscall.ops
   3290068           +18.0%    3881749        stress-ng.usersyscall.ops_per_sec
      0.17 ±  5%     -35.2%       0.11 ±  4%  perf-stat.i.MPKI
 6.367e+09           +16.5%  7.419e+09        perf-stat.i.branch-instructions
  53883753 ±  3%     +14.1%   61459248 ±  3%  perf-stat.i.branch-misses
     23.59 ±  4%      -8.8       14.78 ±  3%  perf-stat.i.cache-miss-rate%
   5618447 ±  5%     -27.6%    4069538 ±  5%  perf-stat.i.cache-misses
  22477865 ±  2%      +8.1%   24292780        perf-stat.i.cache-references
     19.10           -14.2%      16.40        perf-stat.i.cpi
    136147 ±  4%     +51.5%     206276 ±  4%  perf-stat.i.cycles-between-cache-misses
 3.331e+10           +16.7%  3.886e+10        perf-stat.i.instructions
      0.06           +15.8%       0.06        perf-stat.i.ipc
      0.17 ±  5%     -38.0%       0.10 ±  4%  perf-stat.overall.MPKI
     24.72 ±  4%      -8.2       16.49 ±  3%  perf-stat.overall.cache-miss-rate%
     19.15           -14.0%      16.47        perf-stat.overall.cpi
    116330 ±  5%     +38.6%     161261 ±  4%  perf-stat.overall.cycles-between-cache-misses
      0.05           +16.3%       0.06        perf-stat.overall.ipc
 6.243e+09           +16.3%   7.26e+09        perf-stat.ps.branch-instructions
  52388076 ±  3%     +14.0%   59726018 ±  3%  perf-stat.ps.branch-misses
   5393246 ±  5%     -27.8%    3891345 ±  4%  perf-stat.ps.cache-misses
  21817868 ±  2%      +8.2%   23596167        perf-stat.ps.cache-references
 3.267e+10           +16.4%  3.804e+10        perf-stat.ps.instructions
 2.007e+12           +16.5%  2.339e+12        perf-stat.total.instructions
      1.23 ± 35%     -67.7%       0.40 ±101%  perf-sched.sch_delay.avg.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      1.78 ± 14%     -70.7%       0.52 ±138%  perf-sched.sch_delay.avg.ms.__cond_resched.__anon_vma_prepare.__vmf_anon_prepare.do_pte_missing.__handle_mm_fault
      0.33 ± 80%     -69.4%       0.10 ±131%  perf-sched.sch_delay.avg.ms.__cond_resched.__get_user_pages.get_user_pages_remote.get_arg_page.copy_strings
      2.56 ± 24%     -69.6%       0.78 ± 80%  perf-sched.sch_delay.avg.ms.__cond_resched.__kmalloc_cache_noprof.vmstat_start.seq_read_iter.proc_reg_read_iter
      0.57 ± 58%     -56.2%       0.25 ± 14%  perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      2.53 ± 12%     -51.6%       1.23 ± 51%  perf-sched.sch_delay.avg.ms.__cond_resched.down_read.walk_component.link_path_walk.part
      1.67 ± 21%     +53.0%       2.56 ± 19%  perf-sched.sch_delay.avg.ms.__cond_resched.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
      1.67 ±167%     -99.2%       0.01 ±223%  perf-sched.sch_delay.avg.ms.__cond_resched.mutex_lock.perf_event_ctx_lock_nested.constprop.0
      0.29 ± 38%     -85.7%       0.04 ± 92%  perf-sched.sch_delay.avg.ms.__cond_resched.mutex_lock_killable.pcpu_alloc_noprof.__percpu_counter_init_many.mm_init
      0.02 ± 16%     +45.6%       0.03 ± 26%  perf-sched.sch_delay.avg.ms.irq_thread.kthread.ret_from_fork.ret_from_fork_asm
      0.04 ± 36%     -48.0%       0.02 ± 16%  perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      0.04 ±  4%     -19.5%       0.03 ±  7%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      2.83           -31.6%       1.94 ± 33%  perf-sched.sch_delay.avg.ms.wait_for_partner.fifo_open.do_dentry_open.vfs_open
      2.43 ± 29%     -54.1%       1.12 ± 76%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pmd_alloc
      1.87 ±  9%     -63.2%       0.69 ±112%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      5.07 ± 12%     -62.3%       1.91 ± 85%  perf-sched.sch_delay.max.ms.__cond_resched.__kmalloc_cache_noprof.vmstat_start.seq_read_iter.proc_reg_read_iter
    367.03 ±138%     -98.5%       5.38 ± 16%  perf-sched.sch_delay.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      4.00 ±  9%     -44.5%       2.22 ± 55%  perf-sched.sch_delay.max.ms.__cond_resched.down_read.walk_component.link_path_walk.part
      3.99           -21.6%       3.13 ± 12%  perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc_noprof.alloc_empty_file.path_openat.do_filp_open
      7.04 ±199%     -99.8%       0.01 ±223%  perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock.perf_event_ctx_lock_nested.constprop.0
      1.19 ± 40%     -84.8%       0.18 ±174%  perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock_killable.pcpu_alloc_noprof.__percpu_counter_init_many.mm_init
      4.01           -74.9%       1.01 ±150%  perf-sched.sch_delay.max.ms.__cond_resched.put_files_struct.do_exit.do_group_exit.__x64_sys_exit_group
     21.67 ±141%     -96.6%       0.73 ± 54%  perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
    757.83 ± 60%     -51.4%     368.37 ±127%  perf-sched.total_sch_delay.max.ms
      4.53 ± 12%     -23.3%       3.47 ± 12%  perf-sched.wait_and_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      7.40 ± 16%     -28.8%       5.27 ± 28%  perf-sched.wait_and_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      3.96 ±  9%     -18.6%       3.22 ± 12%  perf-sched.wait_time.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      2.51 ± 12%     -57.6%       1.06 ± 58%  perf-sched.wait_time.avg.ms.__cond_resched.down_read.walk_component.link_path_walk.part
      1.67 ± 21%     +53.0%       2.56 ± 19%  perf-sched.wait_time.avg.ms.__cond_resched.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
      1.67 ±167%     -99.2%       0.01 ±223%  perf-sched.wait_time.avg.ms.__cond_resched.mutex_lock.perf_event_ctx_lock_nested.constprop.0
     59.25 ±208%     -98.3%       1.00 ±152%  perf-sched.wait_time.avg.ms.__cond_resched.put_files_struct.do_exit.do_group_exit.__x64_sys_exit_group
      6.89 ± 17%     -30.8%       4.76 ± 31%  perf-sched.wait_time.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
    447.08 ±  4%      -7.0%     415.80 ±  5%  perf-sched.wait_time.avg.ms.irq_thread.kthread.ret_from_fork.ret_from_fork_asm
      3.45 ±  9%     -18.8%       2.80 ±  9%  perf-sched.wait_time.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      2.78 ±  2%     -33.6%       1.84 ± 34%  perf-sched.wait_time.avg.ms.wait_for_partner.fifo_open.do_dentry_open.vfs_open
      1.87 ±  9%     -63.2%       0.69 ±112%  perf-sched.wait_time.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.alloc_pages_noprof.__pud_alloc
      4.00 ±  9%     -45.6%       2.18 ± 57%  perf-sched.wait_time.max.ms.__cond_resched.down_read.walk_component.link_path_walk.part
      3.99           -21.6%       3.13 ± 12%  perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc_noprof.alloc_empty_file.path_openat.do_filp_open
      7.04 ±199%     -99.8%       0.01 ±223%  perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.perf_event_ctx_lock_nested.constprop.0
    170.02 ±218%     -99.4%       1.00 ±152%  perf-sched.wait_time.max.ms.__cond_resched.put_files_struct.do_exit.do_group_exit.__x64_sys_exit_group
     37.94            -1.1       36.80        perf-profile.calltrace.cycles-pp.__sigqueue_free.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
     37.92            -1.1       36.79        perf-profile.calltrace.cycles-pp.do_dec_rlimit_put_ucounts.__sigqueue_free.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
     40.38            -0.8       39.53        perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.48 ±  3%      +0.2        1.66        perf-profile.calltrace.cycles-pp.refill_obj_stock.__memcg_slab_free_hook.kmem_cache_free.get_signal.arch_do_signal_or_restart
      1.41 ±  4%      +0.2        1.60 ±  3%  perf-profile.calltrace.cycles-pp.obj_cgroup_charge.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.__send_signal_locked.force_sig_info_to_task
      1.55 ±  3%      +0.2        1.75 ±  2%  perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.__send_signal_locked.force_sig_info_to_task.syscall_user_dispatch
      1.58 ±  3%      +0.2        1.78        perf-profile.calltrace.cycles-pp.__memcg_slab_free_hook.kmem_cache_free.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
      1.47 ±  3%      +0.2        1.67 ±  4%  perf-profile.calltrace.cycles-pp.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
      1.66 ±  3%      +0.2        1.87        perf-profile.calltrace.cycles-pp.kmem_cache_free.get_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
      1.62 ±  3%      +0.2        1.84        perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.__send_signal_locked.force_sig_info_to_task.syscall_user_dispatch.syscall_trace_enter
      1.66 ±  4%      +0.4        2.09 ±  6%  perf-profile.calltrace.cycles-pp.fpregs_mark_activate.fpu__clear_user_states.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode
      2.39 ±  3%      +0.5        2.84 ±  3%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_sigsys_handler
      2.42 ±  4%      +0.5        2.87 ±  2%  perf-profile.calltrace.cycles-pp.complete_signal.__send_signal_locked.force_sig_info_to_task.syscall_user_dispatch.syscall_trace_enter
      1.96 ±  4%      +0.5        2.42 ±  5%  perf-profile.calltrace.cycles-pp.fpu__clear_user_states.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64
      2.39 ±  3%      +0.5        2.85 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_sigsys_handler
      2.39 ±  3%      +0.5        2.85 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.stress_sigsys_handler
      2.20 ±  3%      +0.5        2.67 ±  3%  perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.stress_sigsys_handler
      6.00 ±  5%      +0.6        6.62        perf-profile.calltrace.cycles-pp.stress_sigsys_handler
      3.82 ±  3%      +0.7        4.53        perf-profile.calltrace.cycles-pp.handle_signal.arch_do_signal_or_restart.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
     37.93            -1.1       36.78        perf-profile.children.cycles-pp.do_dec_rlimit_put_ucounts
     37.94            -1.1       36.81        perf-profile.children.cycles-pp.__sigqueue_free
     40.40            -0.8       39.56        perf-profile.children.cycles-pp.get_signal
      0.29 ±  6%      -0.0        0.24 ±  7%  perf-profile.children.cycles-pp.task_work_run
      0.28 ±  6%      -0.0        0.23 ±  7%  perf-profile.children.cycles-pp.task_mm_cid_work
      0.41 ±  5%      -0.0        0.38 ±  3%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      0.13 ±  3%      +0.0        0.15 ±  4%  perf-profile.children.cycles-pp.__get_user_nocheck_8
      0.18 ±  4%      +0.0        0.20 ±  3%  perf-profile.children.cycles-pp._copy_from_user
      0.18 ±  2%      +0.0        0.20 ±  3%  perf-profile.children.cycles-pp.check_xstate_in_sigframe
      0.16 ±  3%      +0.0        0.19        perf-profile.children.cycles-pp.__put_user_nocheck_4
      0.18 ±  2%      +0.0        0.21 ±  4%  perf-profile.children.cycles-pp.copy_siginfo_to_user
      0.31 ±  3%      +0.0        0.36 ±  2%  perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.37 ±  2%      +0.1        0.42 ±  3%  perf-profile.children.cycles-pp.__restore_fpregs_from_user
      0.44 ±  2%      +0.1        0.50        perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.47 ±  3%      +0.1        0.53 ±  3%  perf-profile.children.cycles-pp.restore_fpregs_from_user
      0.70 ±  3%      +0.1        0.79 ±  4%  perf-profile.children.cycles-pp.copy_fpstate_to_sigframe
      0.69 ±  4%      +0.1        0.78 ±  4%  perf-profile.children.cycles-pp.fpu__restore_sig
      0.85 ±  5%      +0.1        0.95 ±  4%  perf-profile.children.cycles-pp.restore_sigcontext
      1.48 ±  3%      +0.2        1.66        perf-profile.children.cycles-pp.refill_obj_stock
      1.41 ±  4%      +0.2        1.60 ±  3%  perf-profile.children.cycles-pp.obj_cgroup_charge
      1.58 ±  3%      +0.2        1.78        perf-profile.children.cycles-pp.__memcg_slab_free_hook
      1.55 ±  3%      +0.2        1.76 ±  2%  perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
      1.48 ±  3%      +0.2        1.69 ±  4%  perf-profile.children.cycles-pp.x64_setup_rt_frame
      1.67 ±  3%      +0.2        1.88        perf-profile.children.cycles-pp.kmem_cache_free
      1.63 ±  3%      +0.2        1.84        perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
      1.73 ±  4%      +0.4        2.16 ±  6%  perf-profile.children.cycles-pp.fpregs_mark_activate
      2.42 ±  4%      +0.5        2.88 ±  2%  perf-profile.children.cycles-pp.complete_signal
      1.98 ±  4%      +0.5        2.44 ±  5%  perf-profile.children.cycles-pp.fpu__clear_user_states
      4.44 ±  4%      +0.5        4.93        perf-profile.children.cycles-pp.stress_sigsys_handler
      3.82 ±  3%      +0.7        4.54        perf-profile.children.cycles-pp.handle_signal
     37.92            -1.1       36.78        perf-profile.self.cycles-pp.do_dec_rlimit_put_ucounts
      0.26 ±  7%      -0.0        0.23 ±  8%  perf-profile.self.cycles-pp.task_mm_cid_work
      0.13 ±  3%      +0.0        0.14 ±  3%  perf-profile.self.cycles-pp.__get_user_nocheck_8
      0.09 ±  4%      +0.0        0.11        perf-profile.self.cycles-pp.copy_siginfo_to_user
      0.17 ±  5%      +0.0        0.19 ±  5%  perf-profile.self.cycles-pp.__get_user_nocheck_1
      0.18 ±  4%      +0.0        0.20 ±  5%  perf-profile.self.cycles-pp._copy_from_user
      0.15 ±  6%      +0.0        0.18 ±  5%  perf-profile.self.cycles-pp.do_syscall_64
      0.16 ±  3%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.__put_user_nocheck_4
      0.22 ±  4%      +0.0        0.25 ±  2%  perf-profile.self.cycles-pp.prctl
      0.16 ±  2%      +0.0        0.19 ±  4%  perf-profile.self.cycles-pp.x64_setup_rt_frame
      0.29 ±  3%      +0.0        0.32 ±  5%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.02 ±141%      +0.0        0.06 ±  9%  perf-profile.self.cycles-pp.__local_bh_enable_ip
      0.30 ±  3%      +0.0        0.35        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.36 ±  2%      +0.1        0.42 ±  2%  perf-profile.self.cycles-pp.__restore_fpregs_from_user
      0.53 ±  2%      +0.1        0.60 ±  4%  perf-profile.self.cycles-pp.copy_fpstate_to_sigframe
      1.40 ±  4%      +0.2        1.59 ±  3%  perf-profile.self.cycles-pp.obj_cgroup_charge
      1.48 ±  3%      +0.2        1.66        perf-profile.self.cycles-pp.refill_obj_stock
      1.70 ±  4%      +0.4        2.13 ±  6%  perf-profile.self.cycles-pp.fpregs_mark_activate
      2.34 ±  4%      +0.5        2.79 ±  3%  perf-profile.self.cycles-pp.complete_signal





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ