lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20201004132237.GQ393@shao2-debian>
Date:   Sun, 4 Oct 2020 21:22:37 +0800
From:   kernel test robot <rong.a.chen@...el.com>
To:     Peter Xu <peterx@...hat.com>
Cc:     Linus Torvalds <torvalds@...ux-foundation.org>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...el.com
Subject: [mm] 70e806e4e6: will-it-scale.per_process_ops 2.7% improvement

Greeting,

FYI, we noticed a 2.7% improvement of will-it-scale.per_process_ops due to commit:


commit: 70e806e4e645019102d0e09d4933654fb5fb58ce ("mm: Do early cow for pinned pages during fork() for ptes")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: will-it-scale
on test machine: 104 threads Skylake with 192G memory
with following parameters:

	nr_task: 100%
	mode: process
	test: mmap2
	cpufreq_governor: performance
	ucode: 0x2006906

test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale

In addition to that, the commit also has significant impact on the following tests:

+------------------+---------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops 2.0% improvement |
| test machine     | 104 threads Skylake with 192G memory                          |
| test parameters  | cpufreq_governor=performance                                  |
|                  | mode=process                                                  |
|                  | nr_task=100%                                                  |
|                  | test=mmap1                                                    |
|                  | ucode=0x2006906                                               |
+------------------+---------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install job.yaml  # job file is attached in this email
        bin/lkp run     job.yaml

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/process/100%/debian-10.4-x86_64-20200603.cgz/lkp-skl-fpga01/mmap2/will-it-scale/0x2006906

commit: 
  7a4830c380 ("mm/fork: Pass new vma pointer into copy_page_range()")
  70e806e4e6 ("mm: Do early cow for pinned pages during fork() for ptes")

7a4830c380f3a8b3 70e806e4e645019102d0e09d493 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    215537            +2.7%     221349        will-it-scale.per_process_ops
  22415915            +2.7%   23020376        will-it-scale.workload
    839733 ±  3%     -35.4%     542743 ± 17%  cpuidle.C1.time
    282931 ±  8%     +13.9%     322181 ±  9%  numa-numastat.node0.local_node
    306399 ±  5%      +8.4%     332275 ±  5%  numa-numastat.node0.numa_hit
    741.62 ±  7%      +9.1%     809.08 ±  6%  sched_debug.cfs_rq:/.util_avg.min
      0.14 ±  7%     +12.2%       0.16 ±  5%  sched_debug.cpu.nr_running.stddev
    795.75 ±  7%     +35.8%       1080 ±  7%  numa-vmstat.node0.nr_page_table_pages
     11758 ±  8%     +21.3%      14260 ±  4%  numa-vmstat.node0.nr_slab_reclaimable
      1282 ±  4%     -21.9%       1002 ±  7%  numa-vmstat.node1.nr_page_table_pages
     12482 ±  7%     -20.9%       9868 ±  8%  numa-vmstat.node1.nr_slab_reclaimable
    789100 ± 11%     -19.3%     636572 ± 14%  numa-vmstat.node1.numa_local
     47033 ±  8%     +21.3%      57044 ±  4%  numa-meminfo.node0.KReclaimable
      3183 ±  7%     +35.8%       4323 ±  7%  numa-meminfo.node0.PageTables
     47033 ±  8%     +21.3%      57044 ±  4%  numa-meminfo.node0.SReclaimable
    144339 ±  8%     +12.7%     162692 ±  6%  numa-meminfo.node0.Slab
     49936 ±  7%     -20.9%      39478 ±  8%  numa-meminfo.node1.KReclaimable
      5130 ±  4%     -21.8%       4013 ±  7%  numa-meminfo.node1.PageTables
     49936 ±  7%     -20.9%      39478 ±  8%  numa-meminfo.node1.SReclaimable
    567.00 ±  9%    +181.8%       1597 ± 33%  interrupts.CPU3.CAL:Function_call_interrupts
    483.50 ±  8%     +20.2%     581.25 ± 12%  interrupts.CPU59.CAL:Function_call_interrupts
    351.00 ± 12%     +43.7%     504.25 ± 20%  interrupts.CPU59.RES:Rescheduling_interrupts
    322.00 ±  2%     +30.7%     421.00 ± 16%  interrupts.CPU60.RES:Rescheduling_interrupts
    455.75           +17.8%     536.75 ± 17%  interrupts.CPU61.CAL:Function_call_interrupts
    316.50           +25.0%     395.75 ± 16%  interrupts.CPU62.RES:Rescheduling_interrupts
    462.75 ±  4%      +4.9%     485.25 ±  5%  interrupts.CPU72.CAL:Function_call_interrupts
    994.00 ± 95%     -67.9%     318.75 ±  3%  interrupts.CPU98.RES:Rescheduling_interrupts
 5.651e+10            +2.8%  5.809e+10        perf-stat.i.branch-instructions
      0.48            +0.0        0.48        perf-stat.i.branch-miss-rate%
 2.596e+08            +4.0%  2.699e+08        perf-stat.i.branch-misses
     10.93 ±  6%      -2.2        8.74 ±  4%  perf-stat.i.cache-miss-rate%
      1.18            -2.7%       1.15        perf-stat.i.cpi
  44666128            +3.6%   46270200        perf-stat.i.dTLB-load-misses
 6.013e+10            +2.8%  6.179e+10        perf-stat.i.dTLB-loads
     42553           +14.9%      48901 ± 15%  perf-stat.i.dTLB-store-misses
 2.718e+10            +2.8%  2.793e+10        perf-stat.i.dTLB-stores
  44533758           -36.9%   28097154        perf-stat.i.iTLB-load-misses
 2.356e+11            +2.8%  2.422e+11        perf-stat.i.instructions
      5549           +60.7%       8920        perf-stat.i.instructions-per-iTLB-miss
      0.85            +2.8%       0.87        perf-stat.i.ipc
      1382            +2.8%       1421        perf-stat.i.metric.M/sec
      0.46            +0.0        0.46        perf-stat.overall.branch-miss-rate%
     11.14 ± 10%      -2.4        8.75 ±  4%  perf-stat.overall.cache-miss-rate%
      1.18            -2.7%       1.15        perf-stat.overall.cpi
      0.00            +0.0        0.00 ± 15%  perf-stat.overall.dTLB-store-miss-rate%
      5292           +63.0%       8627        perf-stat.overall.instructions-per-iTLB-miss
      0.85            +2.8%       0.87        perf-stat.overall.ipc
 5.632e+10            +2.8%  5.789e+10        perf-stat.ps.branch-instructions
 2.588e+08            +3.9%   2.69e+08        perf-stat.ps.branch-misses
  44511150            +3.6%   46103111        perf-stat.ps.dTLB-load-misses
 5.993e+10            +2.8%  6.158e+10        perf-stat.ps.dTLB-loads
     42650           +14.6%      48890 ± 15%  perf-stat.ps.dTLB-store-misses
 2.709e+10            +2.8%  2.783e+10        perf-stat.ps.dTLB-stores
  44371490           -36.9%   27977958        perf-stat.ps.iTLB-load-misses
 2.348e+11            +2.8%  2.414e+11        perf-stat.ps.instructions
 7.106e+13            +2.6%   7.29e+13        perf-stat.total.instructions
     48.23            -0.9       47.34        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     39.18            -0.9       38.30        perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
     47.45            -0.9       46.59        perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     47.90            -0.9       47.05        perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     46.34            -0.8       45.51        perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
     52.66            -0.8       51.86        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
     15.57            -0.7       14.88        perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
     29.18            -0.7       28.50        perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
     58.71            -0.6       58.12        perf-profile.calltrace.cycles-pp.__munmap
     26.14            -0.6       25.57        perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
      2.89            -0.3        2.54        perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      7.01            -0.2        6.77        perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
      0.76            -0.1        0.65        perf-profile.calltrace.cycles-pp.__vma_rb_erase.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
      0.66            -0.1        0.56        perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.__do_munmap
      3.21            -0.1        3.10        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mmap
      1.01            -0.1        0.94        perf-profile.calltrace.cycles-pp.memcpy_erms.prepend.d_path.perf_event_mmap.mmap_region
      1.63            -0.1        1.58        perf-profile.calltrace.cycles-pp.prepend_path.d_path.perf_event_mmap.mmap_region.do_mmap
      1.04            -0.0        0.99        perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.__do_munmap.__vm_munmap
      0.89            -0.0        0.85        perf-profile.calltrace.cycles-pp.prepend_name.prepend_path.d_path.perf_event_mmap.mmap_region
      0.56            -0.0        0.53 ±  2%  perf-profile.calltrace.cycles-pp.down_write.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
      1.21            -0.0        1.18        perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
      1.25            +0.0        1.28        perf-profile.calltrace.cycles-pp.prepend.d_path.perf_event_mmap.mmap_region.do_mmap
      0.93            +0.0        0.98        perf-profile.calltrace.cycles-pp.strlen.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      1.32            +0.1        1.38        perf-profile.calltrace.cycles-pp.find_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
      0.57 ±  2%      +0.1        0.65        perf-profile.calltrace.cycles-pp.common_file_perm.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
      0.67            +0.1        0.76        perf-profile.calltrace.cycles-pp.kfree.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      2.24            +0.1        2.35        perf-profile.calltrace.cycles-pp.rcu_all_qs._cond_resched.unmap_page_range.unmap_vmas.unmap_region
      1.98            +0.1        2.10        perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff
     25.35            +0.1       25.48        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.39 ± 57%      +0.1        0.53        perf-profile.calltrace.cycles-pp.cap_vm_enough_memory.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff
      9.39            +0.2        9.54        perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      1.72            +0.2        1.88        perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.38 ± 57%      +0.2        0.53 ±  2%  perf-profile.calltrace.cycles-pp.down_write_killable.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.38 ±  6%      +0.2        2.56 ±  2%  perf-profile.calltrace.cycles-pp.kmem_cache_free.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap
      2.67            +0.2        2.86        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__mmap
      2.58            +0.2        2.79        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__munmap
      2.88 ±  4%      +0.2        3.09        perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff
      0.82            +0.2        1.04        perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap
      4.18 ±  5%      +0.2        4.40        perf-profile.calltrace.cycles-pp._cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
      3.49 ±  3%      +0.2        3.73        perf-profile.calltrace.cycles-pp.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
     20.14            +0.3       20.46        perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
     28.30            +0.4       28.71        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     29.38            +0.4       29.82        perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     29.88            +0.5       30.34        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      2.16            +0.5        2.64        perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
     34.46            +0.5       34.99        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
     41.05            +0.6       41.63        perf-profile.calltrace.cycles-pp.__mmap
     39.24            -0.9       38.36        perf-profile.children.cycles-pp.unmap_region
     47.93            -0.9       47.07        perf-profile.children.cycles-pp.__x64_sys_munmap
     47.48            -0.9       46.62        perf-profile.children.cycles-pp.__vm_munmap
     46.43            -0.8       45.59        perf-profile.children.cycles-pp.__do_munmap
     28.42            -0.7       27.68        perf-profile.children.cycles-pp.unmap_page_range
     29.20            -0.7       28.52        perf-profile.children.cycles-pp.unmap_vmas
     59.12            -0.6       58.54        perf-profile.children.cycles-pp.__munmap
     14.57            -0.5       14.04        perf-profile.children.cycles-pp.___might_sleep
     78.17            -0.4       77.73        perf-profile.children.cycles-pp.do_syscall_64
      2.94            -0.3        2.59        perf-profile.children.cycles-pp.vma_link
     87.19            -0.3       86.92        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      7.03            -0.2        6.79        perf-profile.children.cycles-pp.free_pgd_range
      0.32 ±  2%      -0.2        0.15 ±  2%  perf-profile.children.cycles-pp.__rb_insert_augmented
      1.26            -0.1        1.13        perf-profile.children.cycles-pp.down_write
      0.77            -0.1        0.65        perf-profile.children.cycles-pp.__vma_rb_erase
      0.08 ±  5%      -0.1        0.03 ±100%  perf-profile.children.cycles-pp.memcpy
      1.66            -0.1        1.60        perf-profile.children.cycles-pp.prepend_path
      0.17 ±  5%      -0.0        0.12 ±  3%  perf-profile.children.cycles-pp.__vma_link_file
      1.05            -0.0        1.00        perf-profile.children.cycles-pp.unlink_file_vma
      0.92            -0.0        0.87        perf-profile.children.cycles-pp.prepend_name
      0.36            -0.0        0.32 ±  2%  perf-profile.children.cycles-pp.__x86_retpoline_r9
      0.12            -0.0        0.08 ±  5%  perf-profile.children.cycles-pp.get_align_mask
      1.02            -0.0        0.98        perf-profile.children.cycles-pp.__might_sleep
      0.17 ±  3%      -0.0        0.13 ±  3%  perf-profile.children.cycles-pp.timestamp_truncate
      0.38 ±  2%      -0.0        0.35 ±  3%  perf-profile.children.cycles-pp.current_time
      0.46            -0.0        0.43 ±  2%  perf-profile.children.cycles-pp.syscall_enter_from_user_mode
      1.21            -0.0        1.19        perf-profile.children.cycles-pp.__vma_link_rb
      0.10 ±  4%      -0.0        0.08        perf-profile.children.cycles-pp.get_mmap_base
      0.18 ±  2%      -0.0        0.17        perf-profile.children.cycles-pp.tlb_flush_mmu
      0.07            -0.0        0.06        perf-profile.children.cycles-pp.munmap@plt
      0.18            +0.0        0.20 ±  2%  perf-profile.children.cycles-pp.testcase
      0.35            +0.0        0.36 ±  2%  perf-profile.children.cycles-pp.obj_cgroup_charge
      0.29 ±  2%      +0.0        0.31 ±  2%  perf-profile.children.cycles-pp.lru_add_drain_cpu
      0.09            +0.0        0.11 ±  3%  perf-profile.children.cycles-pp.aa_file_perm
      0.44 ±  2%      +0.0        0.46        perf-profile.children.cycles-pp.refill_obj_stock
      0.27 ±  3%      +0.0        0.30 ±  2%  perf-profile.children.cycles-pp.vma_merge
      1.28            +0.0        1.31        perf-profile.children.cycles-pp.prepend
      0.76            +0.0        0.79        perf-profile.children.cycles-pp.up_write
      0.32 ±  2%      +0.0        0.35        perf-profile.children.cycles-pp.downgrade_write
      0.27 ±  3%      +0.0        0.30 ±  2%  perf-profile.children.cycles-pp.cap_mmap_file
      0.07 ±  6%      +0.0        0.11 ±  4%  perf-profile.children.cycles-pp.ima_file_mmap
      0.28 ±  2%      +0.0        0.32 ±  2%  perf-profile.children.cycles-pp.__vm_enough_memory
      0.03 ±100%      +0.0        0.07        perf-profile.children.cycles-pp.fput
      0.93            +0.0        0.98        perf-profile.children.cycles-pp.strlen
      0.34 ±  2%      +0.0        0.39        perf-profile.children.cycles-pp.apparmor_mmap_file
      0.26            +0.0        0.31 ±  2%  perf-profile.children.cycles-pp.sync_mm_rss
      0.22 ±  3%      +0.1        0.27 ±  2%  perf-profile.children.cycles-pp.vma_interval_tree_remove
      0.25            +0.1        0.30 ±  6%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      1.40            +0.1        1.45        perf-profile.children.cycles-pp.find_vma
      0.14 ±  7%      +0.1        0.20 ±  9%  perf-profile.children.cycles-pp.vm_pgprot_modify
      0.16 ±  5%      +0.1        0.21 ±  2%  perf-profile.children.cycles-pp.uprobe_mmap
      1.13            +0.1        1.19        perf-profile.children.cycles-pp.memcpy_erms
      0.59 ±  2%      +0.1        0.66        perf-profile.children.cycles-pp.common_file_perm
      4.85            +0.1        4.93        perf-profile.children.cycles-pp._cond_resched
      0.67 ±  2%      +0.1        0.77        perf-profile.children.cycles-pp.kfree
      0.33 ±  3%      +0.1        0.43        perf-profile.children.cycles-pp.vma_set_page_prot
      2.01            +0.1        2.13        perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
     25.40            +0.1       25.53        perf-profile.children.cycles-pp.do_mmap
      9.46            +0.2        9.61        perf-profile.children.cycles-pp.perf_event_mmap
      8.57            +0.2        8.73        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      1.74            +0.2        1.90        perf-profile.children.cycles-pp.security_mmap_file
      2.39 ±  5%      +0.2        2.58 ±  2%  perf-profile.children.cycles-pp.kmem_cache_free
      0.83            +0.2        1.04        perf-profile.children.cycles-pp.vm_unmapped_area
      2.96 ±  4%      +0.2        3.18        perf-profile.children.cycles-pp.kmem_cache_alloc
      3.49 ±  3%      +0.2        3.73        perf-profile.children.cycles-pp.vm_area_alloc
     20.27            +0.3       20.59        perf-profile.children.cycles-pp.mmap_region
      5.89            +0.4        6.30        perf-profile.children.cycles-pp.syscall_return_via_sysret
     28.34            +0.4       28.76        perf-profile.children.cycles-pp.vm_mmap_pgoff
     29.42            +0.4       29.86        perf-profile.children.cycles-pp.ksys_mmap_pgoff
      2.20            +0.5        2.66        perf-profile.children.cycles-pp.zap_pte_range
     41.40            +0.6       41.99        perf-profile.children.cycles-pp.__mmap
     11.45            -1.0       10.48        perf-profile.self.cycles-pp.unmap_page_range
     12.18            -0.4       11.82        perf-profile.self.cycles-pp.___might_sleep
      6.98            -0.2        6.74        perf-profile.self.cycles-pp.free_pgd_range
      0.31 ±  2%      -0.2        0.14        perf-profile.self.cycles-pp.__rb_insert_augmented
      5.52            -0.1        5.40        perf-profile.self.cycles-pp.entry_SYSCALL_64
      1.04            -0.1        0.93 ±  2%  perf-profile.self.cycles-pp.do_mmap
      0.75            -0.1        0.64        perf-profile.self.cycles-pp.__vma_rb_erase
      0.31 ±  4%      -0.1        0.24 ±  6%  perf-profile.self.cycles-pp.get_unmapped_area
      0.15 ±  2%      -0.1        0.10 ±  4%  perf-profile.self.cycles-pp.__vma_link_file
      0.90            -0.0        0.85        perf-profile.self.cycles-pp.prepend_name
      0.93 ±  2%      -0.0        0.88 ±  3%  perf-profile.self.cycles-pp.shmem_get_unmapped_area
      0.54 ±  2%      -0.0        0.50        perf-profile.self.cycles-pp.down_write
      0.66            -0.0        0.61 ±  2%  perf-profile.self.cycles-pp.__mmap
      0.94            -0.0        0.90        perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
      0.10 ±  4%      -0.0        0.06        perf-profile.self.cycles-pp.get_align_mask
      0.33            -0.0        0.29        perf-profile.self.cycles-pp.__x86_retpoline_r9
      0.16 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.current_time
      0.92            -0.0        0.89        perf-profile.self.cycles-pp.__might_sleep
      0.15 ±  2%      -0.0        0.12 ±  3%  perf-profile.self.cycles-pp.timestamp_truncate
      0.42            -0.0        0.39 ±  2%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.14 ±  3%      -0.0        0.12        perf-profile.self.cycles-pp.prepend
      0.10 ±  4%      -0.0        0.08 ±  6%  perf-profile.self.cycles-pp.get_mmap_base
      0.35 ±  2%      -0.0        0.33        perf-profile.self.cycles-pp.security_mmap_file
      0.15 ±  3%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.vma_interval_tree_insert
      0.13 ±  3%      -0.0        0.12        perf-profile.self.cycles-pp.tlb_flush_mmu
      0.06 ±  6%      -0.0        0.05        perf-profile.self.cycles-pp.munmap@plt
      0.05            +0.0        0.06 ±  6%  perf-profile.self.cycles-pp.should_failslab
      0.11 ±  6%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.security_vm_enough_memory_mm
      0.33            +0.0        0.35 ±  2%  perf-profile.self.cycles-pp.obj_cgroup_charge
      0.50            +0.0        0.52        perf-profile.self.cycles-pp.vm_area_alloc
      0.30            +0.0        0.32 ±  3%  perf-profile.self.cycles-pp.unmap_vmas
      0.24            +0.0        0.26        perf-profile.self.cycles-pp.unmap_region
      0.08 ±  5%      +0.0        0.10 ±  4%  perf-profile.self.cycles-pp.aa_file_perm
      0.38            +0.0        0.40        perf-profile.self.cycles-pp.vmacache_find
      0.42 ±  2%      +0.0        0.45        perf-profile.self.cycles-pp.refill_obj_stock
      0.25            +0.0        0.27 ±  2%  perf-profile.self.cycles-pp.atime_needs_update
      0.28            +0.0        0.30 ±  2%  perf-profile.self.cycles-pp.vm_mmap_pgoff
      0.26            +0.0        0.28 ±  2%  perf-profile.self.cycles-pp.vma_merge
      0.11 ±  7%      +0.0        0.15 ±  7%  perf-profile.self.cycles-pp.vma_set_page_prot
      0.31 ±  2%      +0.0        0.34        perf-profile.self.cycles-pp.downgrade_write
      0.74            +0.0        0.77        perf-profile.self.cycles-pp.up_write
      0.83            +0.0        0.87        perf-profile.self.cycles-pp.find_vma
      0.07 ±  7%      +0.0        0.10 ±  4%  perf-profile.self.cycles-pp.ima_file_mmap
      0.22 ±  3%      +0.0        0.26        perf-profile.self.cycles-pp.cap_mmap_file
      0.30 ±  2%      +0.0        0.34        perf-profile.self.cycles-pp.apparmor_mmap_file
      0.21 ±  2%      +0.0        0.26        perf-profile.self.cycles-pp.__x64_sys_munmap
      0.93            +0.0        0.97        perf-profile.self.cycles-pp.strlen
      0.12 ±  4%      +0.0        0.17        perf-profile.self.cycles-pp.free_pgtables
      0.24            +0.0        0.29 ±  5%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.26            +0.0        0.31 ±  2%  perf-profile.self.cycles-pp.sync_mm_rss
      0.17 ±  4%      +0.1        0.22 ±  4%  perf-profile.self.cycles-pp.vma_interval_tree_remove
      0.13 ±  8%      +0.1        0.18 ± 11%  perf-profile.self.cycles-pp.vm_pgprot_modify
      0.50 ±  2%      +0.1        0.55        perf-profile.self.cycles-pp.common_file_perm
      1.31            +0.1        1.36        perf-profile.self.cycles-pp.mmap_region
      0.15 ±  3%      +0.1        0.20 ±  2%  perf-profile.self.cycles-pp.uprobe_mmap
      2.22            +0.1        2.30        perf-profile.self.cycles-pp._cond_resched
      1.06            +0.1        1.15        perf-profile.self.cycles-pp.memcpy_erms
      0.66 ±  2%      +0.1        0.76        perf-profile.self.cycles-pp.kfree
      8.18            +0.2        8.36        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.82            +0.2        1.04        perf-profile.self.cycles-pp.vm_unmapped_area
      1.59            +0.4        2.00        perf-profile.self.cycles-pp.zap_pte_range
      5.87            +0.4        6.29        perf-profile.self.cycles-pp.syscall_return_via_sysret


                                                                                
                            will-it-scale.per_process_ops                       
                                                                                
  223000 +------------------------------------------------------------------+   
         |     O                                                            |   
  222000 |-+                                    O                O          |   
  221000 |-+                                          O    O                |   
         |          O     O    O                                            |   
  220000 |-+                               O                                |   
  219000 |-+                         O                                      |   
         |                                                                  |   
  218000 |-+              +..                                               |   
  217000 |-+            ..   ..                                             |   
         |.....        .                                             .+..   |   
  216000 |-+   +.... ..        +.....   ...+....                   ..    .. |   
  215000 |-+        +                +..        +.....   ..+.....+.        .|   
         |                                            +..                   |   
  214000 +------------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample

***************************************************************************************************
lkp-skl-fpga01: 104 threads Skylake with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/process/100%/debian-10.4-x86_64-20200603.cgz/lkp-skl-fpga01/mmap1/will-it-scale/0x2006906

commit: 
  7a4830c380 ("mm/fork: Pass new vma pointer into copy_page_range()")
  70e806e4e6 ("mm: Do early cow for pinned pages during fork() for ptes")

7a4830c380f3a8b3 70e806e4e645019102d0e09d493 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    238782            +2.0%     243561        will-it-scale.per_process_ops
      9565            -0.9%       9477        will-it-scale.time.maximum_resident_set_size
  24833379            +2.0%   25330430        will-it-scale.workload
     18752 ±  9%     -14.9%      15950 ± 12%  numa-meminfo.node0.Mapped
      4793 ±  9%     -15.7%       4041 ± 13%  numa-vmstat.node0.nr_mapped
     12863 ± 12%     +18.5%      15248 ± 11%  sched_debug.cpu.sched_count.max
      1740 ±  6%     -12.1%       1530        slabinfo.kmalloc-rcl-512.active_objs
      1740 ±  6%     -12.1%       1530        slabinfo.kmalloc-rcl-512.num_objs
     18707            +9.1%      20407 ±  3%  softirqs.CPU16.RCU
     17047 ±  2%      +8.1%      18429 ±  4%  softirqs.CPU40.RCU
     82.25            -1.5%      81.00        vmstat.cpu.sy
     16.00            +6.2%      17.00        vmstat.cpu.us
      8495 ± 38%     -96.9%     260.75 ± 82%  proc-vmstat.numa_hint_faults
      9348 ± 56%     -93.2%     632.75 ±118%  proc-vmstat.numa_pages_migrated
     41007 ± 55%     -86.3%       5613 ± 69%  proc-vmstat.numa_pte_updates
      9348 ± 56%     -93.2%     632.75 ±118%  proc-vmstat.pgmigrate_success
      7387           -37.7%       4602 ± 34%  interrupts.CPU10.NMI:Non-maskable_interrupts
      7387           -37.7%       4602 ± 34%  interrupts.CPU10.PMI:Performance_monitoring_interrupts
    313.00 ±  2%     +23.9%     387.75 ± 20%  interrupts.CPU24.RES:Rescheduling_interrupts
    353.75 ±  8%      -9.7%     319.50 ±  5%  interrupts.CPU37.RES:Rescheduling_interrupts
      5248 ± 22%     -37.7%       3269 ± 27%  interrupts.CPU53.CAL:Function_call_interrupts
      6467 ± 24%     -28.8%       4602 ± 34%  interrupts.CPU6.NMI:Non-maskable_interrupts
      6467 ± 24%     -28.8%       4602 ± 34%  interrupts.CPU6.PMI:Performance_monitoring_interrupts
    610.00 ± 23%     -24.1%     463.00        interrupts.CPU62.CAL:Function_call_interrupts
      6466 ± 24%     -28.8%       4603 ± 34%  interrupts.CPU7.NMI:Non-maskable_interrupts
      6466 ± 24%     -28.8%       4603 ± 34%  interrupts.CPU7.PMI:Performance_monitoring_interrupts
    453.50           +80.8%     819.75 ± 55%  interrupts.CPU8.CAL:Function_call_interrupts
      6467 ± 24%     -28.8%       4602 ± 34%  interrupts.CPU8.NMI:Non-maskable_interrupts
      6467 ± 24%     -28.8%       4602 ± 34%  interrupts.CPU8.PMI:Performance_monitoring_interrupts
      7388           -37.7%       4603 ± 34%  interrupts.CPU9.NMI:Non-maskable_interrupts
      7388           -37.7%       4603 ± 34%  interrupts.CPU9.PMI:Performance_monitoring_interrupts
 5.672e+10            +2.0%  5.784e+10        perf-stat.i.branch-instructions
 2.382e+08            +2.7%  2.446e+08        perf-stat.i.branch-misses
      1.18            -2.0%       1.16        perf-stat.i.cpi
  49520439            +1.9%   50482111        perf-stat.i.dTLB-load-misses
 5.872e+10            +2.0%  5.987e+10        perf-stat.i.dTLB-loads
     44598            +4.4%      46565        perf-stat.i.dTLB-store-misses
 2.605e+10            +2.0%  2.656e+10        perf-stat.i.dTLB-stores
     96.00            -7.9       88.06 ±  6%  perf-stat.i.iTLB-load-miss-rate%
  49281952            +2.2%   50384621        perf-stat.i.iTLB-load-misses
   1928684 ± 46%    +259.5%    6934327 ± 58%  perf-stat.i.iTLB-loads
 2.354e+11            +2.0%    2.4e+11        perf-stat.i.instructions
      0.85            +2.0%       0.86        perf-stat.i.ipc
      1360            +2.0%       1387        perf-stat.i.metric.M/sec
      2838            -0.9%       2811        perf-stat.i.minor-faults
      2838            -0.9%       2812        perf-stat.i.page-faults
      1.18            -2.0%       1.16        perf-stat.overall.cpi
      0.00            +0.0        0.00        perf-stat.overall.dTLB-store-miss-rate%
     96.28            -8.0       88.30 ±  6%  perf-stat.overall.iTLB-load-miss-rate%
      0.85            +2.0%       0.86        perf-stat.overall.ipc
 5.653e+10            +2.0%  5.765e+10        perf-stat.ps.branch-instructions
 2.375e+08            +2.7%  2.438e+08        perf-stat.ps.branch-misses
  49350510            +1.9%   50309290        perf-stat.ps.dTLB-load-misses
 5.852e+10            +2.0%  5.967e+10        perf-stat.ps.dTLB-loads
     44555            +4.3%      46476        perf-stat.ps.dTLB-store-misses
 2.596e+10            +2.0%  2.647e+10        perf-stat.ps.dTLB-stores
  49109500            +2.2%   50209811        perf-stat.ps.iTLB-load-misses
   1915468 ± 45%    +260.7%    6909154 ± 58%  perf-stat.ps.iTLB-loads
 2.346e+11            +2.0%  2.392e+11        perf-stat.ps.instructions
      2834            -1.0%       2805        perf-stat.ps.minor-faults
      2834            -1.0%       2806        perf-stat.ps.page-faults
 7.095e+13            +1.9%   7.23e+13        perf-stat.total.instructions
     34.83            -1.1       33.77        perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
     44.65            -1.0       43.62        perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
     18.45            -0.9       17.52        perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
     52.42            -0.7       51.69        perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
     53.69            -0.7       52.96        perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     31.25            -0.7       30.52        perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
     54.16            -0.7       53.47        perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     54.49            -0.7       53.84        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
      2.31            -0.5        1.77        perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
     59.40            -0.4       58.98        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
      1.62            -0.2        1.38        perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
     18.16            -0.2       17.93        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     22.17            -0.2       21.94        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     21.60            -0.2       21.38        perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     21.31            -0.2       21.11        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      1.68 ±  2%      -0.1        1.54        perf-profile.calltrace.cycles-pp.find_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
      0.62 ±  2%      -0.0        0.59        perf-profile.calltrace.cycles-pp.security_mmap_addr.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.56            +0.0        0.58        perf-profile.calltrace.cycles-pp.tlb_finish_mmu.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
      0.62            +0.0        0.65        perf-profile.calltrace.cycles-pp.cap_vm_enough_memory.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff
      2.55            +0.1        2.65        perf-profile.calltrace.cycles-pp.rcu_all_qs._cond_resched.unmap_page_range.unmap_vmas.unmap_region
      3.15            +0.1        3.26        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__munmap
      0.89            +0.1        1.01 ±  2%  perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap
      3.87            +0.2        4.03        perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      4.65            +0.2        4.87        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.__munmap
      2.72 ±  2%      +0.2        2.95        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__mmap
      1.94            +0.2        2.19        perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      2.68 ±  2%      +0.3        2.93        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__munmap
      1.00            +0.3        1.26        perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.get_unmapped_area.do_mmap.vm_mmap_pgoff
      2.05            +0.3        2.32        perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      5.05            +0.3        5.32        perf-profile.calltrace.cycles-pp._cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
      2.80 ±  2%      +0.4        3.15        perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
      2.93 ±  7%      +0.4        3.33 ±  3%  perf-profile.calltrace.cycles-pp.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
      2.27 ±  9%      +0.5        2.73 ±  3%  perf-profile.calltrace.cycles-pp.kmem_cache_free.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap
      0.00            +0.6        0.58        perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
     34.85            -1.1       33.80        perf-profile.children.cycles-pp.unmap_vmas
     44.73            -1.0       43.70        perf-profile.children.cycles-pp.unmap_region
     33.89            -1.0       32.89        perf-profile.children.cycles-pp.unmap_page_range
     76.71            -0.9       75.83        perf-profile.children.cycles-pp.do_syscall_64
     52.54            -0.7       51.79        perf-profile.children.cycles-pp.__do_munmap
     53.73            -0.7       53.00        perf-profile.children.cycles-pp.__vm_munmap
     16.84            -0.7       16.13        perf-profile.children.cycles-pp.___might_sleep
     54.19            -0.7       53.49        perf-profile.children.cycles-pp.__x64_sys_munmap
     86.59            -0.6       86.02        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      2.36            -0.5        1.82        perf-profile.children.cycles-pp.vma_link
      1.64            -0.3        1.38        perf-profile.children.cycles-pp.__vma_link_rb
     18.21            -0.2       17.98        perf-profile.children.cycles-pp.do_mmap
     21.63            -0.2       21.41        perf-profile.children.cycles-pp.ksys_mmap_pgoff
     21.36            -0.2       21.16        perf-profile.children.cycles-pp.vm_mmap_pgoff
      0.30 ±  2%      -0.2        0.11 ±  4%  perf-profile.children.cycles-pp.__rb_insert_augmented
      1.78 ±  2%      -0.2        1.63        perf-profile.children.cycles-pp.find_vma
      0.15 ±  2%      -0.1        0.07 ±  6%  perf-profile.children.cycles-pp.__vma_link_file
      0.41 ±  3%      -0.1        0.35        perf-profile.children.cycles-pp.cap_mmap_addr
      0.76            -0.1        0.71        perf-profile.children.cycles-pp.__might_sleep
      0.47            -0.0        0.42        perf-profile.children.cycles-pp.obj_cgroup_charge
      0.43 ±  3%      -0.0        0.39 ±  3%  perf-profile.children.cycles-pp.tlb_gather_mmu
      0.64 ±  2%      -0.0        0.60        perf-profile.children.cycles-pp.security_mmap_addr
      0.23 ±  3%      -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.vmacache_update
      0.25            -0.0        0.22        perf-profile.children.cycles-pp.strlen
      0.45            -0.0        0.42 ±  2%  perf-profile.children.cycles-pp.apparmor_mmap_file
      0.22            -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.cap_capable
      0.23            -0.0        0.22        perf-profile.children.cycles-pp.__x64_sys_mmap
      0.10            +0.0        0.11        perf-profile.children.cycles-pp.vm_area_free
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.__x86_retpoline_rbp
      0.10 ±  5%      +0.0        0.11        perf-profile.children.cycles-pp.get_mmap_base
      0.18 ±  2%      +0.0        0.20        perf-profile.children.cycles-pp.testcase
      0.08 ±  5%      +0.0        0.10        perf-profile.children.cycles-pp.unlink_file_vma
      0.58            +0.0        0.60        perf-profile.children.cycles-pp.tlb_finish_mmu
      0.47 ±  2%      +0.0        0.49        perf-profile.children.cycles-pp.__x86_retpoline_rax
      0.29 ±  2%      +0.0        0.32        perf-profile.children.cycles-pp.__x86_retpoline_r9
      0.47            +0.0        0.50 ±  2%  perf-profile.children.cycles-pp.syscall_enter_from_user_mode
      0.38            +0.0        0.41        perf-profile.children.cycles-pp.downgrade_write
      0.48 ±  3%      +0.0        0.52        perf-profile.children.cycles-pp.vma_merge
      0.22 ±  3%      +0.0        0.26        perf-profile.children.cycles-pp.unlink_anon_vmas
      0.16 ±  5%      +0.0        0.20 ±  2%  perf-profile.children.cycles-pp.blocking_notifier_call_chain
      0.35 ±  2%      +0.0        0.39 ±  2%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.40 ±  2%      +0.1        0.46        perf-profile.children.cycles-pp.refill_obj_stock
      0.33 ±  2%      +0.1        0.39        perf-profile.children.cycles-pp.__vm_enough_memory
      2.90            +0.1        2.98        perf-profile.children.cycles-pp.rcu_all_qs
      0.51            +0.1        0.61        perf-profile.children.cycles-pp.free_pgtables
      0.33            +0.1        0.44        perf-profile.children.cycles-pp.cap_mmap_file
      0.92            +0.1        1.03 ±  2%  perf-profile.children.cycles-pp.perf_event_mmap_output
      3.94            +0.2        4.09        perf-profile.children.cycles-pp.perf_event_mmap
      5.32            +0.2        5.57        perf-profile.children.cycles-pp._cond_resched
      1.97            +0.3        2.22        perf-profile.children.cycles-pp.perf_iterate_sb
      1.01            +0.3        1.27        perf-profile.children.cycles-pp.vm_unmapped_area
      2.09            +0.3        2.36        perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
      9.39            +0.3        9.68        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      2.83 ±  2%      +0.4        3.19        perf-profile.children.cycles-pp.zap_pte_range
      2.97 ±  6%      +0.4        3.37 ±  3%  perf-profile.children.cycles-pp.remove_vma
      2.29 ±  9%      +0.5        2.75 ±  3%  perf-profile.children.cycles-pp.kmem_cache_free
      6.16            +0.5        6.65        perf-profile.children.cycles-pp.syscall_return_via_sysret
     13.46            -1.2       12.24        perf-profile.self.cycles-pp.unmap_page_range
     14.10            -0.4       13.67        perf-profile.self.cycles-pp.___might_sleep
      1.62            -0.2        1.37        perf-profile.self.cycles-pp.__vma_link_rb
      0.59 ±  4%      -0.2        0.40        perf-profile.self.cycles-pp.get_unmapped_area
      0.86 ±  2%      -0.2        0.67 ±  2%  perf-profile.self.cycles-pp.do_mmap
      0.29 ±  2%      -0.2        0.11 ±  3%  perf-profile.self.cycles-pp.__rb_insert_augmented
      1.04 ±  2%      -0.1        0.94        perf-profile.self.cycles-pp.find_vma
      0.13            -0.1        0.04 ± 57%  perf-profile.self.cycles-pp.__vma_link_file
      0.63            -0.1        0.54 ±  3%  perf-profile.self.cycles-pp.security_mmap_file
      1.63            -0.1        1.56        perf-profile.self.cycles-pp.perf_event_mmap
      0.36 ±  3%      -0.1        0.29 ±  2%  perf-profile.self.cycles-pp.cap_mmap_addr
      0.70            -0.1        0.65        perf-profile.self.cycles-pp.__might_sleep
      0.45            -0.0        0.41        perf-profile.self.cycles-pp.obj_cgroup_charge
      0.42 ±  3%      -0.0        0.38 ±  2%  perf-profile.self.cycles-pp.tlb_gather_mmu
      0.40            -0.0        0.37 ±  2%  perf-profile.self.cycles-pp.apparmor_mmap_file
      0.75            -0.0        0.71        perf-profile.self.cycles-pp.__mmap
      0.21 ±  3%      -0.0        0.18 ±  3%  perf-profile.self.cycles-pp.vmacache_update
      0.23            -0.0        0.20 ±  2%  perf-profile.self.cycles-pp.vma_link
      0.23            -0.0        0.21 ±  2%  perf-profile.self.cycles-pp.strlen
      0.13 ±  3%      -0.0        0.11 ±  3%  perf-profile.self.cycles-pp.security_mmap_addr
      0.08            +0.0        0.09        perf-profile.self.cycles-pp.vm_area_free
      0.09            +0.0        0.10        perf-profile.self.cycles-pp.get_mmap_base
      0.07            +0.0        0.08 ±  5%  perf-profile.self.cycles-pp.unlink_file_vma
      0.09 ±  4%      +0.0        0.10        perf-profile.self.cycles-pp.can_vma_merge_before
      0.11 ±  3%      +0.0        0.13        perf-profile.self.cycles-pp.testcase
      0.22 ±  3%      +0.0        0.25        perf-profile.self.cycles-pp.userfaultfd_unmap_prep
      0.27            +0.0        0.29        perf-profile.self.cycles-pp.__x86_retpoline_r9
      0.43            +0.0        0.46 ±  2%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.21 ±  3%      +0.0        0.24        perf-profile.self.cycles-pp.unlink_anon_vmas
      0.15 ±  3%      +0.0        0.19 ±  3%  perf-profile.self.cycles-pp.blocking_notifier_call_chain
      0.36            +0.0        0.40        perf-profile.self.cycles-pp.vm_mmap_pgoff
      0.35            +0.0        0.39        perf-profile.self.cycles-pp.tlb_finish_mmu
      0.34 ±  2%      +0.0        0.37 ±  2%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.51            +0.0        0.55 ±  2%  perf-profile.self.cycles-pp.__munmap
      0.36            +0.0        0.40        perf-profile.self.cycles-pp.downgrade_write
      2.20            +0.0        2.24        perf-profile.self.cycles-pp.rcu_all_qs
      0.21 ±  3%      +0.1        0.27        perf-profile.self.cycles-pp.free_pgtables
      0.38            +0.1        0.44        perf-profile.self.cycles-pp.refill_obj_stock
      0.62            +0.1        0.69        perf-profile.self.cycles-pp.vm_area_alloc
      0.26            +0.1        0.36        perf-profile.self.cycles-pp.cap_mmap_file
      0.86            +0.1        0.97 ±  2%  perf-profile.self.cycles-pp.perf_event_mmap_output
      0.99            +0.1        1.10 ±  4%  perf-profile.self.cycles-pp.perf_iterate_sb
      2.45            +0.2        2.63        perf-profile.self.cycles-pp._cond_resched
      1.71 ±  8%      +0.2        1.94 ±  5%  perf-profile.self.cycles-pp.kmem_cache_alloc
      0.98            +0.3        1.24        perf-profile.self.cycles-pp.vm_unmapped_area
      8.99            +0.3        9.27        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      2.04 ±  2%      +0.4        2.39        perf-profile.self.cycles-pp.zap_pte_range
      1.65 ± 12%      +0.4        2.02 ±  4%  perf-profile.self.cycles-pp.kmem_cache_free
      6.13            +0.5        6.63        perf-profile.self.cycles-pp.syscall_return_via_sysret





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Rong Chen


View attachment "config-5.9.0-rc6-00341-g70e806e4e64501" of type "text/plain" (170148 bytes)

View attachment "job-script" of type "text/plain" (7548 bytes)

View attachment "job.yaml" of type "text/plain" (5121 bytes)

View attachment "reproduce" of type "text/plain" (338 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ