lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202310261659.18c0c7e-oliver.sang@intel.com>
Date:   Thu, 26 Oct 2023 16:49:43 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     "Liam R. Howlett" <Liam.Howlett@...cle.com>
CC:     <oe-lkp@...ts.linux.dev>, <lkp@...el.com>,
        <linux-kernel@...r.kernel.org>,
        Andrew Morton <akpm@...ux-foundation.org>,
        Peng Zhang <zhangpeng.00@...edance.com>,
        Suren Baghdasaryan <surenb@...gle.com>, <linux-mm@...ck.org>,
        <ying.huang@...el.com>, <feng.tang@...el.com>,
        <fengwei.yin@...el.com>, <oliver.sang@...el.com>
Subject: [linus:master] [mm]  fd892593d4:  will-it-scale.per_process_ops 3.9%
 improvement



Hello,

kernel test robot noticed a 3.9% improvement of will-it-scale.per_process_ops on:


commit: fd892593d44d8b649caf30a67f0c7696d976d901 ("mm: change do_vmi_align_munmap() tracking of VMAs to remove")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: will-it-scale
test machine: 104 threads 2 sockets (Skylake) with 192G memory
parameters:

	nr_task: 100%
	mode: process
	test: brk2
	cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20231026/202310261659.18c0c7e-oliver.sang@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-8.3/process/100%/debian-11.1-x86_64-20220510.cgz/lkp-skl-fpga01/brk2/will-it-scale

commit: 
  8c314f3b55 ("maple_tree: add benchmarking for mas_prev()")
  fd892593d4 ("mm: change do_vmi_align_munmap() tracking of VMAs to remove")

8c314f3b55fbc422 fd892593d44d8b649caf30a67f0 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     44.35            +1.3%      44.94        turbostat.RAMWatt
      7472            +2.2%       7638        vmstat.system.cs
  13163009            +3.9%   13672076        will-it-scale.104.processes
    126566            +3.9%     131461        will-it-scale.per_process_ops
  13163009            +3.9%   13672076        will-it-scale.workload
 3.703e+08            +2.3%  3.789e+08        proc-vmstat.numa_hit
 3.702e+08            +2.3%  3.788e+08        proc-vmstat.numa_local
  7.24e+08            +2.3%  7.407e+08        proc-vmstat.pgalloc_normal
  7.23e+08            +2.3%  7.397e+08        proc-vmstat.pgfree
      5.72 ±  6%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk
    442.80 ±  5%      +9.9%     486.50 ±  5%  perf-sched.wait_and_delay.count.__cond_resched.kmem_cache_alloc.mas_alloc_nodes.mas_preallocate.do_brk_flags
    407.80 ±  6%     +12.5%     458.90 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.kmem_cache_alloc.vm_area_dup.__split_vma.do_vmi_align_munmap
    311.30 ± 11%     -20.7%     246.80 ±  5%  perf-sched.wait_and_delay.count.__cond_resched.remove_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
      1114 ±  3%    -100.0%       0.00        perf-sched.wait_and_delay.count.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk
     28.01 ± 36%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk
      6.22 ± 19%    -100.0%       0.00        perf-sched.wait_time.avg.ms.__cond_resched.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap
      5.72 ±  6%    -100.0%       0.00        perf-sched.wait_time.avg.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk
     14.25 ± 10%    -100.0%       0.00        perf-sched.wait_time.max.ms.__cond_resched.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap
     28.01 ± 36%    -100.0%       0.00        perf-sched.wait_time.max.ms.__cond_resched.unmap_vmas.unmap_region.do_vmi_align_munmap.__do_sys_brk
      2.49            +3.5%       2.58        perf-stat.i.MPKI
 3.407e+08            +3.5%  3.527e+08        perf-stat.i.cache-misses
 5.063e+08            +3.7%   5.25e+08        perf-stat.i.cache-references
      7446            +2.3%       7616        perf-stat.i.context-switches
    248.21            +2.2%     253.65        perf-stat.i.cpu-migrations
    817.56            -3.4%     789.61        perf-stat.i.cycles-between-cache-misses
      0.04            +0.0        0.04        perf-stat.i.dTLB-load-miss-rate%
  14854767            +3.6%   15392456        perf-stat.i.dTLB-load-misses
  13577896            +4.9%   14238965        perf-stat.i.iTLB-load-misses
  24153428            +3.4%   24977848        perf-stat.i.iTLB-loads
     10082            -4.6%       9618        perf-stat.i.instructions-per-iTLB-miss
      1532            +3.0%       1578        perf-stat.i.metric.K/sec
  43693423            +2.7%   44860627        perf-stat.i.node-loads
  86115914            +2.6%   88355886        perf-stat.i.node-stores
      2.49            +3.5%       2.58        perf-stat.overall.MPKI
    817.24            -3.4%     789.30        perf-stat.overall.cycles-between-cache-misses
      0.04            +0.0        0.04        perf-stat.overall.dTLB-load-miss-rate%
     10079            -4.6%       9614        perf-stat.overall.instructions-per-iTLB-miss
   3134760            -3.7%    3019810        perf-stat.overall.path-length
 3.396e+08            +3.5%  3.515e+08        perf-stat.ps.cache-misses
 5.046e+08            +3.7%  5.232e+08        perf-stat.ps.cache-references
      7420            +2.3%       7589        perf-stat.ps.context-switches
    247.36            +2.2%     252.76        perf-stat.ps.cpu-migrations
  14810795            +3.6%   15350854        perf-stat.ps.dTLB-load-misses
  13531716            +4.9%   14190673        perf-stat.ps.iTLB-load-misses
  24072354            +3.4%   24894366        perf-stat.ps.iTLB-loads
  43550846            +2.7%   44720688        perf-stat.ps.node-loads
  85827526            +2.6%   88063334        perf-stat.ps.node-stores
     36.62            -1.0       35.60        perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
      1.94            -0.8        1.18        perf-profile.calltrace.cycles-pp.mas_wr_store_entry.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
      5.74            -0.7        5.01        perf-profile.calltrace.cycles-pp.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.89 ±  3%      -0.5        2.41 ±  5%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__slab_free.rcu_do_batch.rcu_core.__do_softirq
      2.84 ±  3%      -0.5        2.36 ±  5%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__slab_free.rcu_do_batch.rcu_core
      5.17 ±  2%      -0.5        4.70 ±  3%  perf-profile.calltrace.cycles-pp.__slab_free.rcu_do_batch.rcu_core.__do_softirq.run_ksoftirqd
      4.47            -0.4        4.04        perf-profile.calltrace.cycles-pp.mas_destroy.mas_store_prealloc.do_brk_flags.__do_sys_brk.do_syscall_64
      3.42            -0.4        3.06        perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.49            -0.4        1.14 ±  3%  perf-profile.calltrace.cycles-pp.__unfreeze_partials.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc.do_brk_flags
      2.54            -0.3        2.21        perf-profile.calltrace.cycles-pp.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc.do_brk_flags.__do_sys_brk
      1.10            -0.3        0.82 ±  5%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__unfreeze_partials.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc
      1.09            -0.3        0.82 ±  4%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__unfreeze_partials.kmem_cache_free_bulk.mas_destroy
      4.53            -0.3        4.28 ±  2%  perf-profile.calltrace.cycles-pp.__kmem_cache_alloc_bulk.kmem_cache_alloc_bulk.mas_alloc_nodes.mas_preallocate.__split_vma
      4.88            -0.3        4.63        perf-profile.calltrace.cycles-pp.kmem_cache_alloc_bulk.mas_alloc_nodes.mas_preallocate.__split_vma.do_vmi_align_munmap
      5.53            -0.2        5.31        perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_preallocate.__split_vma.do_vmi_align_munmap.__do_sys_brk
      5.64            -0.2        5.42        perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
      0.60            -0.1        0.47 ± 33%  perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_wr_bnode.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk
      0.85            -0.1        0.72        perf-profile.calltrace.cycles-pp.mas_next_slot.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.10            -0.1        1.00 ±  3%  perf-profile.calltrace.cycles-pp.__unfreeze_partials.mas_destroy.mas_store_prealloc.do_brk_flags.__do_sys_brk
      0.81 ±  2%      -0.1        0.72 ±  5%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__unfreeze_partials.mas_destroy.mas_store_prealloc.do_brk_flags
      1.09            -0.1        1.01        perf-profile.calltrace.cycles-pp.kmem_cache_free_bulk.mas_destroy.mas_store_prealloc.vma_complete.__split_vma
      0.53            +0.0        0.56        perf-profile.calltrace.cycles-pp.mas_store_b_node.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk
      0.58            +0.0        0.61        perf-profile.calltrace.cycles-pp.mab_mas_cp.mas_split_final_node.mas_split.mas_wr_bnode.mas_store_prealloc
      0.75            +0.0        0.78        perf-profile.calltrace.cycles-pp.mas_walk.mas_find.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.61            +0.0        0.65 ±  2%  perf-profile.calltrace.cycles-pp.get_unmapped_area.check_brk_limits.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.71            +0.0        0.75        perf-profile.calltrace.cycles-pp.check_brk_limits.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
      0.71            +0.0        0.75        perf-profile.calltrace.cycles-pp.memcg_slab_post_alloc_hook.kmem_cache_alloc.vm_area_dup.__split_vma.do_vmi_align_munmap
      1.10            +0.0        1.14        perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.do_brk_flags
      1.08            +0.0        1.12        perf-profile.calltrace.cycles-pp.mas_descend_adopt.mas_wmb_replace.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc
      0.71 ±  2%      +0.1        0.76 ±  2%  perf-profile.calltrace.cycles-pp.mas_pop_node.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete
      1.00            +0.1        1.05        perf-profile.calltrace.cycles-pp.__call_rcu_common.mas_wmb_replace.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc
      0.76            +0.1        0.81        perf-profile.calltrace.cycles-pp.mas_mab_cp.mas_store_b_node.mas_wr_bnode.mas_store_prealloc.vma_complete
      0.74 ±  2%      +0.1        0.80        perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap
      1.48            +0.1        1.55        perf-profile.calltrace.cycles-pp.__entry_text_start.brk
      1.02            +0.1        1.08        perf-profile.calltrace.cycles-pp.mas_split_final_node.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete
      0.91            +0.1        0.98 ±  2%  perf-profile.calltrace.cycles-pp.mas_store_b_node.mas_wr_bnode.mas_store_prealloc.vma_complete.__split_vma
      0.62            +0.1        0.69        perf-profile.calltrace.cycles-pp.mas_wr_walk_index.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk
      1.55            +0.1        1.62        perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.do_brk_flags.__do_sys_brk
      1.18            +0.1        1.25        perf-profile.calltrace.cycles-pp.mas_descend_adopt.mas_wmb_replace.mas_split.mas_wr_bnode.mas_store_prealloc
      1.65            +0.1        1.74        perf-profile.calltrace.cycles-pp.mast_fill_bnode.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete
      0.72            +0.1        0.82        perf-profile.calltrace.cycles-pp.mas_push_data.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete
      2.28            +0.1        2.38        perf-profile.calltrace.cycles-pp.mas_wmb_replace.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags
      2.09            +0.1        2.20        perf-profile.calltrace.cycles-pp.mas_wmb_replace.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete
      2.49            +0.1        2.63        perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_dup.__split_vma.do_vmi_align_munmap.__do_sys_brk
      3.36            +0.1        3.49        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
     14.95            +0.1       15.08        perf-profile.calltrace.cycles-pp.mas_store_prealloc.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.89            +0.1        3.03        perf-profile.calltrace.cycles-pp.vm_area_dup.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
      3.85            +0.2        4.01        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.brk
      6.96            +0.3        7.29        perf-profile.calltrace.cycles-pp.mas_spanning_rebalance.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk
      6.29            +0.5        6.78 ±  2%  perf-profile.calltrace.cycles-pp.__unfreeze_partials.rcu_do_batch.rcu_core.__do_softirq.run_ksoftirqd
      9.84            +0.5       10.35        perf-profile.calltrace.cycles-pp.mas_wr_spanning_store.mas_store_prealloc.do_brk_flags.__do_sys_brk.do_syscall_64
      8.18            +0.5        8.69        perf-profile.calltrace.cycles-pp.mas_split.mas_wr_bnode.mas_store_prealloc.vma_complete.__split_vma
      4.09 ±  2%      +0.6        4.67 ±  4%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.free_pcppages_bulk.free_unref_page.__unfreeze_partials
      4.11 ±  2%      +0.6        4.69 ±  4%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.free_pcppages_bulk.free_unref_page.__unfreeze_partials.rcu_do_batch
      4.22 ±  2%      +0.6        4.81 ±  4%  perf-profile.calltrace.cycles-pp.free_pcppages_bulk.free_unref_page.__unfreeze_partials.rcu_do_batch.rcu_core
      4.25 ±  2%      +0.6        4.84 ±  4%  perf-profile.calltrace.cycles-pp.free_unref_page.__unfreeze_partials.rcu_do_batch.rcu_core.__do_softirq
      9.74            +0.6       10.34        perf-profile.calltrace.cycles-pp.mas_wr_bnode.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap
     13.31            +0.6       13.92        perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.__do_sys_brk
      0.00            +0.6        0.63        perf-profile.calltrace.cycles-pp.kmem_cache_alloc.mas_alloc_nodes.mas_preallocate.do_brk_flags.__do_sys_brk
     13.46            +0.6       14.10        perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
     22.96            +0.7       23.64        perf-profile.calltrace.cycles-pp.__split_vma.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
     36.73            -1.0       35.71        perf-profile.children.cycles-pp.do_vmi_align_munmap
     27.38            -1.0       26.40        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     27.64            -1.0       26.67        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      2.68            -0.7        1.94        perf-profile.children.cycles-pp.mas_wr_store_entry
      5.80            -0.7        5.06        perf-profile.children.cycles-pp.mas_store_gfp
     14.46            -0.7       13.77        perf-profile.children.cycles-pp.mas_alloc_nodes
      7.52            -0.5        7.06        perf-profile.children.cycles-pp.mas_destroy
      3.66            -0.4        3.25        perf-profile.children.cycles-pp.kmem_cache_free_bulk
      0.80            -0.4        0.43 ±  2%  perf-profile.children.cycles-pp.free_pgtables
      3.45            -0.4        3.09        perf-profile.children.cycles-pp.unmap_region
      1.85            -0.3        1.57        perf-profile.children.cycles-pp.mas_find
      1.48            -0.3        1.22        perf-profile.children.cycles-pp.mas_walk
      1.49            -0.2        1.25        perf-profile.children.cycles-pp.mtree_range_walk
      1.09            -0.2        0.85        perf-profile.children.cycles-pp.mas_next_slot
      2.68            -0.1        2.55        perf-profile.children.cycles-pp.kmem_cache_free
      4.33            -0.1        4.20        perf-profile.children.cycles-pp.kmem_cache_alloc
      1.90            -0.1        1.79 ±  2%  perf-profile.children.cycles-pp.mas_pop_node
      0.22 ±  2%      -0.1        0.17 ±  3%  perf-profile.children.cycles-pp.remove_vma
      0.22 ±  2%      -0.0        0.17 ±  3%  perf-profile.children.cycles-pp.tlb_gather_mmu
      0.30 ±  2%      -0.0        0.26 ±  3%  perf-profile.children.cycles-pp.__pte_offset_map_lock
      0.13 ±  3%      -0.0        0.11 ±  4%  perf-profile.children.cycles-pp.downgrade_write
      0.20 ±  3%      -0.0        0.18 ±  3%  perf-profile.children.cycles-pp.mas_nomem
      0.08 ±  3%      -0.0        0.06 ±  5%  perf-profile.children.cycles-pp.mas_node_count_gfp
      0.14 ±  2%      +0.0        0.16 ±  4%  perf-profile.children.cycles-pp.discard_slab
      0.48            +0.0        0.51        perf-profile.children.cycles-pp.rcu_nocb_try_bypass
      0.58            +0.0        0.61        perf-profile.children.cycles-pp.rcu_segcblist_enqueue
      0.10 ±  6%      +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.unmap_single_vma
      0.74            +0.0        0.78        perf-profile.children.cycles-pp.mas_wr_walk
      0.66            +0.0        0.70        perf-profile.children.cycles-pp.vma_prepare
      0.86            +0.0        0.89 ±  2%  perf-profile.children.cycles-pp.mas_prev_slot
      1.11            +0.0        1.15        perf-profile.children.cycles-pp.perf_event_mmap_output
      0.11 ±  3%      +0.0        0.15 ±  4%  perf-profile.children.cycles-pp.anon_vma_clone
      0.62            +0.0        0.67        perf-profile.children.cycles-pp.get_unmapped_area
      1.22            +0.0        1.26        perf-profile.children.cycles-pp.vm_area_free_rcu_cb
      0.71            +0.0        0.76        perf-profile.children.cycles-pp.check_brk_limits
      0.94            +0.1        1.00        perf-profile.children.cycles-pp.__cond_resched
      0.76            +0.1        0.82        perf-profile.children.cycles-pp.zap_pmd_range
      1.02            +0.1        1.09        perf-profile.children.cycles-pp.mas_split_final_node
      1.02            +0.1        1.08        perf-profile.children.cycles-pp.memcpy_orig
      0.63            +0.1        0.69        perf-profile.children.cycles-pp.mas_wr_walk_index
      0.29 ±  2%      +0.1        0.36 ±  3%  perf-profile.children.cycles-pp.mas_next_sibling
      1.56            +0.1        1.63        perf-profile.children.cycles-pp.perf_iterate_sb
      1.87            +0.1        1.94        perf-profile.children.cycles-pp.__entry_text_start
      1.66            +0.1        1.75        perf-profile.children.cycles-pp.mast_fill_bnode
      0.73            +0.1        0.83        perf-profile.children.cycles-pp.mas_push_data
      1.63            +0.1        1.74        perf-profile.children.cycles-pp.rcu_cblist_dequeue
      2.27            +0.1        2.38        perf-profile.children.cycles-pp.mas_descend_adopt
      3.42            +0.1        3.53        perf-profile.children.cycles-pp.mab_mas_cp
      1.85            +0.1        1.97        perf-profile.children.cycles-pp.mas_store_b_node
      2.51            +0.1        2.63        perf-profile.children.cycles-pp.__call_rcu_common
      3.38            +0.1        3.52        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      2.90            +0.1        3.04        perf-profile.children.cycles-pp.vm_area_dup
      2.43            +0.2        2.58        perf-profile.children.cycles-pp.mas_mab_cp
      3.88            +0.2        4.05        perf-profile.children.cycles-pp.syscall_return_via_sysret
      4.40            +0.2        4.62        perf-profile.children.cycles-pp.mas_wmb_replace
     13.50            +0.3       13.77        perf-profile.children.cycles-pp.__unfreeze_partials
      7.00            +0.3        7.32        perf-profile.children.cycles-pp.mas_spanning_rebalance
     28.03            +0.5       28.49        perf-profile.children.cycles-pp.rcu_core
      9.86            +0.5       10.37        perf-profile.children.cycles-pp.mas_wr_spanning_store
      8.26            +0.5        8.77        perf-profile.children.cycles-pp.mas_split
     12.98            +0.6       13.62        perf-profile.children.cycles-pp.mas_wr_bnode
     13.58            +0.7       14.23        perf-profile.children.cycles-pp.vma_complete
     23.02            +0.7       23.70        perf-profile.children.cycles-pp.__split_vma
     28.32            +0.7       29.06        perf-profile.children.cycles-pp.mas_store_prealloc
     27.38            -1.0       26.40        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      0.76            -0.2        0.53        perf-profile.self.cycles-pp.mas_next_slot
      1.31            -0.2        1.09        perf-profile.self.cycles-pp.mtree_range_walk
      1.54            -0.1        1.40        perf-profile.self.cycles-pp.kmem_cache_alloc
      2.27            -0.1        2.13        perf-profile.self.cycles-pp.kmem_cache_free
      1.70            -0.1        1.58        perf-profile.self.cycles-pp.mas_pop_node
      0.48            -0.1        0.40 ±  2%  perf-profile.self.cycles-pp.unmap_page_range
      0.52            -0.1        0.45        perf-profile.self.cycles-pp.mas_alloc_nodes
      0.16 ±  3%      -0.1        0.11 ±  3%  perf-profile.self.cycles-pp.free_pgtables
      0.31 ±  2%      -0.0        0.26 ±  2%  perf-profile.self.cycles-pp.mas_destroy
      0.11 ±  4%      -0.0        0.07 ±  7%  perf-profile.self.cycles-pp.remove_vma
      0.19 ±  2%      -0.0        0.15 ±  2%  perf-profile.self.cycles-pp.tlb_gather_mmu
      0.75            -0.0        0.71        perf-profile.self.cycles-pp.mas_wr_store_entry
      0.82            -0.0        0.78        perf-profile.self.cycles-pp.___slab_alloc
      0.12 ±  4%      -0.0        0.09 ±  3%  perf-profile.self.cycles-pp.__pte_offset_map_lock
      0.12 ±  3%      -0.0        0.10 ±  5%  perf-profile.self.cycles-pp.downgrade_write
      0.17 ±  2%      -0.0        0.15 ±  2%  perf-profile.self.cycles-pp.unmap_vmas
      0.24            -0.0        0.22 ±  2%  perf-profile.self.cycles-pp.unmap_region
      0.08 ±  6%      -0.0        0.06 ±  7%  perf-profile.self.cycles-pp.can_vma_merge_after
      0.18 ±  2%      +0.0        0.20 ±  3%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.13 ±  3%      +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.get_unmapped_area
      0.08 ±  5%      +0.0        0.11 ±  4%  perf-profile.self.cycles-pp.unmap_single_vma
      0.17 ±  2%      +0.0        0.19 ±  3%  perf-profile.self.cycles-pp.zap_pmd_range
      0.36 ±  2%      +0.0        0.39 ±  2%  perf-profile.self.cycles-pp.mas_push_data
      0.79            +0.0        0.82        perf-profile.self.cycles-pp.do_vmi_align_munmap
      0.99 ±  2%      +0.0        1.03        perf-profile.self.cycles-pp.perf_event_mmap_output
      0.08 ±  5%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.anon_vma_clone
      0.61            +0.0        0.65        perf-profile.self.cycles-pp.__cond_resched
      0.83            +0.0        0.87        perf-profile.self.cycles-pp.mast_fill_bnode
      0.20 ±  4%      +0.0        0.24 ±  2%  perf-profile.self.cycles-pp.vma_complete
      1.22            +0.0        1.26        perf-profile.self.cycles-pp.mas_wr_spanning_store
      0.88            +0.0        0.93        perf-profile.self.cycles-pp.memcpy_orig
      1.26            +0.0        1.30        perf-profile.self.cycles-pp.__call_rcu_common
      0.21 ±  2%      +0.0        0.26 ±  2%  perf-profile.self.cycles-pp.mas_next_sibling
      1.12            +0.1        1.18        perf-profile.self.cycles-pp.kmem_cache_free_bulk
      0.78            +0.1        0.83        perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk
      0.79            +0.1        0.85        perf-profile.self.cycles-pp.__do_sys_brk
      0.56            +0.1        0.61        perf-profile.self.cycles-pp.mas_wr_walk_index
      0.51 ±  2%      +0.1        0.57        perf-profile.self.cycles-pp.__split_vma
      1.65            +0.1        1.72        perf-profile.self.cycles-pp.__entry_text_start
      1.32            +0.1        1.39        perf-profile.self.cycles-pp.mas_spanning_rebalance
      2.30            +0.1        2.37        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      2.04            +0.1        2.12        perf-profile.self.cycles-pp.mas_descend_adopt
      1.77            +0.1        1.86        perf-profile.self.cycles-pp.mas_mab_cp
      1.61            +0.1        1.71        perf-profile.self.cycles-pp.rcu_cblist_dequeue
      3.13            +0.1        3.27        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      3.88            +0.2        4.04        perf-profile.self.cycles-pp.syscall_return_via_sysret
      4.75            +0.2        4.94        perf-profile.self.cycles-pp.__slab_free



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ