lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Date:   Sun, 4 Oct 2020 21:27:59 +0800
From:   kernel test robot <rong.a.chen@...el.com>
To:     "Jason A. Donenfeld" <Jason@...c4.com>
Cc:     Linus Torvalds <torvalds@...ux-foundation.org>,
        Chris Wilson <chris@...is-wilson.co.uk>,
        Harald Arnesen <harald@...gtun.org>,
        Jason Gunthorpe <jgg@...dia.com>, Peter Xu <peterx@...hat.com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...el.com
Subject: [mm] a4d63c3732: will-it-scale.per_process_ops -4.2% regression

Greeting,

FYI, we noticed a -4.2% regression of will-it-scale.per_process_ops due to commit:


commit: a4d63c3732f1a0c91abcf5b7f32b4ef7dcd82025 ("mm: do not rely on mm == current->mm in __get_user_pages_locked")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: will-it-scale
on test machine: 104 threads Skylake with 192G memory
with following parameters:

	nr_task: 100%
	mode: process
	test: mmap2
	cpufreq_governor: performance
	ucode: 0x2006906

test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <rong.a.chen@...el.com>


Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install job.yaml  # job file is attached in this email
        bin/lkp run     job.yaml

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/process/100%/debian-10.4-x86_64-20200603.cgz/lkp-skl-fpga01/mmap2/will-it-scale/0x2006906

commit: 
  v5.9-rc7
  a4d63c3732 ("mm: do not rely on mm == current->mm in __get_user_pages_locked")

        v5.9-rc7 a4d63c3732f1a0c91abcf5b7f32 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    220777            -4.2%     211546        will-it-scale.per_process_ops
  22960865            -4.2%   22000902        will-it-scale.workload
  83839135 ± 55%     +55.2%  1.301e+08 ± 26%  cpuidle.C1E.time
    211210 ± 30%     +33.2%     281375 ± 21%  cpuidle.C1E.usage
     18140 ±  2%     +39.2%      25254 ± 23%  sched_debug.cpu.nr_switches.max
     13463 ±  4%     +14.0%      15345 ±  2%  sched_debug.cpu.sched_count.max
     81065            +1.1%      81959        proc-vmstat.nr_anon_pages
     96.80            +3.0%      99.75        proc-vmstat.nr_anon_transparent_hugepages
     83789            +1.1%      84672        proc-vmstat.nr_inactive_anon
     83789            +1.1%      84672        proc-vmstat.nr_zone_inactive_anon
      6517 ±144%     -99.0%      64.50 ± 54%  proc-vmstat.numa_hint_faults
      6473 ± 65%     -99.6%      26.50 ±121%  proc-vmstat.numa_pages_migrated
      6473 ± 65%     -99.6%      26.50 ±121%  proc-vmstat.pgmigrate_success
     48103 ±  6%      +8.8%      52321 ±  3%  numa-meminfo.node0.KReclaimable
     48103 ±  6%      +8.8%      52321 ±  3%  numa-meminfo.node0.SReclaimable
     94453 ±  4%     +18.1%     111554 ±  4%  numa-meminfo.node0.SUnreclaim
    142557           +15.0%     163876 ±  3%  numa-meminfo.node0.Slab
     50072 ±  4%     -10.0%      45056 ±  3%  numa-meminfo.node1.KReclaimable
     50072 ±  4%     -10.0%      45056 ±  3%  numa-meminfo.node1.SReclaimable
     98485 ±  4%     -17.9%      80860 ±  4%  numa-meminfo.node1.SUnreclaim
    148557           -15.2%     125917 ±  3%  numa-meminfo.node1.Slab
     12025 ±  6%      +8.8%      13079 ±  3%  numa-vmstat.node0.nr_slab_reclaimable
     23613 ±  4%     +18.1%      27888 ±  4%  numa-vmstat.node0.nr_slab_unreclaimable
    828938 ±  7%     +17.8%     976901 ±  7%  numa-vmstat.node0.numa_hit
    804550 ±  7%     +18.8%     956028 ±  6%  numa-vmstat.node0.numa_local
     12516 ±  4%     -10.0%      11263 ±  3%  numa-vmstat.node1.nr_slab_reclaimable
     24621 ±  4%     -17.9%      20214 ±  4%  numa-vmstat.node1.nr_slab_unreclaimable
    940544 ±  7%     -16.2%     788395 ±  9%  numa-vmstat.node1.numa_hit
    755086 ±  7%     -20.6%     599240 ± 10%  numa-vmstat.node1.numa_local
 5.792e+10            -4.2%   5.55e+10        perf-stat.i.branch-instructions
      0.48            -0.0        0.48        perf-stat.i.branch-miss-rate%
 2.703e+08            -5.6%  2.552e+08        perf-stat.i.branch-misses
      1.15            +4.5%       1.20        perf-stat.i.cpi
    161.86            -1.2%     159.93        perf-stat.i.cpu-migrations
  45949319            -4.5%   43875433        perf-stat.i.dTLB-load-misses
 6.167e+10            -4.2%  5.906e+10        perf-stat.i.dTLB-loads
     47578 ± 15%      -8.9%      43356        perf-stat.i.dTLB-store-misses
 2.787e+10            -4.2%  2.669e+10        perf-stat.i.dTLB-stores
     82.07 ±  3%     +13.8       95.89 ±  2%  perf-stat.i.iTLB-load-miss-rate%
  28493832           +51.9%   43294073        perf-stat.i.iTLB-load-misses
   6078707 ± 18%     -71.6%    1725940 ± 52%  perf-stat.i.iTLB-loads
 2.416e+11            -4.2%  2.314e+11        perf-stat.i.instructions
      8812           -37.2%       5534        perf-stat.i.instructions-per-iTLB-miss
      0.87            -4.3%       0.83        perf-stat.i.ipc
      1418            -4.2%       1358        perf-stat.i.metric.M/sec
      0.47            -0.0        0.46        perf-stat.overall.branch-miss-rate%
      1.15            +4.6%       1.20        perf-stat.overall.cpi
     82.47 ±  3%     +13.7       96.20 ±  2%  perf-stat.overall.iTLB-load-miss-rate%
      8484           -37.0%       5347        perf-stat.overall.instructions-per-iTLB-miss
      0.87            -4.4%       0.83        perf-stat.overall.ipc
 5.773e+10            -4.2%  5.531e+10        perf-stat.ps.branch-instructions
 2.693e+08            -5.6%  2.544e+08        perf-stat.ps.branch-misses
    161.32            -1.2%     159.42        perf-stat.ps.cpu-migrations
  45785331            -4.5%   43716262        perf-stat.ps.dTLB-load-misses
 6.146e+10            -4.2%  5.886e+10        perf-stat.ps.dTLB-loads
     47549 ± 15%      -8.9%      43311        perf-stat.ps.dTLB-store-misses
 2.778e+10            -4.3%   2.66e+10        perf-stat.ps.dTLB-stores
  28376216           +52.0%   43133756        perf-stat.ps.iTLB-load-misses
   6071510 ± 18%     -71.6%    1726775 ± 52%  perf-stat.ps.iTLB-loads
 2.407e+11            -4.2%  2.306e+11        perf-stat.ps.instructions
 7.276e+13            -4.2%  6.968e+13        perf-stat.total.instructions
     89619            +2.5%      91832        interrupts.CAL:Function_call_interrupts
    311.60           +22.6%     382.00 ± 22%  interrupts.CPU17.RES:Rescheduling_interrupts
      6803 ±  6%      +5.3%       7164 ±  5%  interrupts.CPU19.NMI:Non-maskable_interrupts
      6803 ±  6%      +5.3%       7164 ±  5%  interrupts.CPU19.PMI:Performance_monitoring_interrupts
      6070 ± 20%     +18.0%       7163 ±  5%  interrupts.CPU2.NMI:Non-maskable_interrupts
      6070 ± 20%     +18.0%       7163 ±  5%  interrupts.CPU2.PMI:Performance_monitoring_interrupts
      6802 ±  6%      +5.3%       7164 ±  5%  interrupts.CPU20.NMI:Non-maskable_interrupts
      6802 ±  6%      +5.3%       7164 ±  5%  interrupts.CPU20.PMI:Performance_monitoring_interrupts
      6801 ±  6%      +5.2%       7158 ±  5%  interrupts.CPU24.NMI:Non-maskable_interrupts
      6801 ±  6%      +5.2%       7158 ±  5%  interrupts.CPU24.PMI:Performance_monitoring_interrupts
      7216 ±  6%      +4.9%       7572 ±  5%  interrupts.CPU26.NMI:Non-maskable_interrupts
      7216 ±  6%      +4.9%       7572 ±  5%  interrupts.CPU26.PMI:Performance_monitoring_interrupts
      7216 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU27.NMI:Non-maskable_interrupts
      7216 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU27.PMI:Performance_monitoring_interrupts
      7217 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU28.NMI:Non-maskable_interrupts
      7217 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU28.PMI:Performance_monitoring_interrupts
      7218 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU29.NMI:Non-maskable_interrupts
      7218 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU29.PMI:Performance_monitoring_interrupts
      6071 ± 20%     +18.0%       7163 ±  5%  interrupts.CPU3.NMI:Non-maskable_interrupts
      6071 ± 20%     +18.0%       7163 ±  5%  interrupts.CPU3.PMI:Performance_monitoring_interrupts
      7219 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU30.NMI:Non-maskable_interrupts
      7219 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU30.PMI:Performance_monitoring_interrupts
      7218 ±  6%      +4.9%       7574 ±  5%  interrupts.CPU31.NMI:Non-maskable_interrupts
      7218 ±  6%      +4.9%       7574 ±  5%  interrupts.CPU31.PMI:Performance_monitoring_interrupts
      7217 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU37.NMI:Non-maskable_interrupts
      7217 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU37.PMI:Performance_monitoring_interrupts
      6071 ± 20%     +18.0%       7163 ±  5%  interrupts.CPU4.NMI:Non-maskable_interrupts
      6071 ± 20%     +18.0%       7163 ±  5%  interrupts.CPU4.PMI:Performance_monitoring_interrupts
    308.60           +35.9%     419.25 ± 31%  interrupts.CPU40.RES:Rescheduling_interrupts
      6436 ± 20%     +17.7%       7573 ±  5%  interrupts.CPU46.NMI:Non-maskable_interrupts
      6436 ± 20%     +17.7%       7573 ±  5%  interrupts.CPU46.PMI:Performance_monitoring_interrupts
      6437 ± 20%     +17.7%       7573 ±  5%  interrupts.CPU47.NMI:Non-maskable_interrupts
      6437 ± 20%     +17.7%       7573 ±  5%  interrupts.CPU47.PMI:Performance_monitoring_interrupts
      6436 ± 20%     +17.7%       7573 ±  5%  interrupts.CPU48.NMI:Non-maskable_interrupts
      6436 ± 20%     +17.7%       7573 ±  5%  interrupts.CPU48.PMI:Performance_monitoring_interrupts
      6436 ± 20%     +17.7%       7574 ±  5%  interrupts.CPU49.NMI:Non-maskable_interrupts
      6436 ± 20%     +17.7%       7574 ±  5%  interrupts.CPU49.PMI:Performance_monitoring_interrupts
      6071 ± 20%     +18.0%       7164 ±  5%  interrupts.CPU5.NMI:Non-maskable_interrupts
      6071 ± 20%     +18.0%       7164 ±  5%  interrupts.CPU5.PMI:Performance_monitoring_interrupts
      6436 ± 20%     +17.7%       7574 ±  5%  interrupts.CPU50.NMI:Non-maskable_interrupts
      6436 ± 20%     +17.7%       7574 ±  5%  interrupts.CPU50.PMI:Performance_monitoring_interrupts
      6435 ± 20%     +17.7%       7574 ±  5%  interrupts.CPU51.NMI:Non-maskable_interrupts
      6435 ± 20%     +17.7%       7574 ±  5%  interrupts.CPU51.PMI:Performance_monitoring_interrupts
      6052 ± 20%     +18.0%       7139 ±  5%  interrupts.CPU52.NMI:Non-maskable_interrupts
      6052 ± 20%     +18.0%       7139 ±  5%  interrupts.CPU52.PMI:Performance_monitoring_interrupts
      6790 ±  6%      +5.2%       7142 ±  5%  interrupts.CPU54.NMI:Non-maskable_interrupts
      6790 ±  6%      +5.2%       7142 ±  5%  interrupts.CPU54.PMI:Performance_monitoring_interrupts
    965.80 ± 62%     +59.0%       1535 ± 48%  interrupts.CPU54.RES:Rescheduling_interrupts
      6057 ± 20%     +17.9%       7142 ±  5%  interrupts.CPU55.NMI:Non-maskable_interrupts
      6057 ± 20%     +17.9%       7142 ±  5%  interrupts.CPU55.PMI:Performance_monitoring_interrupts
    566.20 ± 24%    +100.8%       1136 ± 50%  interrupts.CPU58.CAL:Function_call_interrupts
      6056 ± 20%     +17.9%       7138 ±  5%  interrupts.CPU58.NMI:Non-maskable_interrupts
      6056 ± 20%     +17.9%       7138 ±  5%  interrupts.CPU58.PMI:Performance_monitoring_interrupts
      6055 ± 20%     +17.9%       7139 ±  5%  interrupts.CPU59.NMI:Non-maskable_interrupts
      6055 ± 20%     +17.9%       7139 ±  5%  interrupts.CPU59.PMI:Performance_monitoring_interrupts
      6789 ±  6%      +5.1%       7138 ±  5%  interrupts.CPU60.NMI:Non-maskable_interrupts
      6789 ±  6%      +5.1%       7138 ±  5%  interrupts.CPU60.PMI:Performance_monitoring_interrupts
      6786 ±  6%      +5.2%       7138 ±  5%  interrupts.CPU61.NMI:Non-maskable_interrupts
      6786 ±  6%      +5.2%       7138 ±  5%  interrupts.CPU61.PMI:Performance_monitoring_interrupts
      6787 ±  6%      +5.2%       7139 ±  5%  interrupts.CPU63.NMI:Non-maskable_interrupts
      6787 ±  6%      +5.2%       7139 ±  5%  interrupts.CPU63.PMI:Performance_monitoring_interrupts
      6053 ± 20%     +17.9%       7136 ±  5%  interrupts.CPU65.NMI:Non-maskable_interrupts
      6053 ± 20%     +17.9%       7136 ±  5%  interrupts.CPU65.PMI:Performance_monitoring_interrupts
    314.40 ±  3%      +6.1%     333.50 ±  5%  interrupts.CPU67.RES:Rescheduling_interrupts
      5316 ± 25%     +34.2%       7133 ±  5%  interrupts.CPU76.NMI:Non-maskable_interrupts
      5316 ± 25%     +34.2%       7133 ±  5%  interrupts.CPU76.PMI:Performance_monitoring_interrupts
      6434 ± 20%     +17.6%       7570 ±  5%  interrupts.CPU78.NMI:Non-maskable_interrupts
      6434 ± 20%     +17.6%       7570 ±  5%  interrupts.CPU78.PMI:Performance_monitoring_interrupts
      6433 ± 20%     +17.7%       7570 ±  5%  interrupts.CPU79.NMI:Non-maskable_interrupts
      6433 ± 20%     +17.7%       7570 ±  5%  interrupts.CPU79.PMI:Performance_monitoring_interrupts
    453.80          +142.8%       1102 ± 58%  interrupts.CPU8.CAL:Function_call_interrupts
      6435 ± 20%     +17.6%       7570 ±  5%  interrupts.CPU80.NMI:Non-maskable_interrupts
      6435 ± 20%     +17.6%       7570 ±  5%  interrupts.CPU80.PMI:Performance_monitoring_interrupts
      6436 ± 20%     +17.6%       7571 ±  5%  interrupts.CPU81.NMI:Non-maskable_interrupts
      6436 ± 20%     +17.6%       7571 ±  5%  interrupts.CPU81.PMI:Performance_monitoring_interrupts
      6436 ± 20%     +17.6%       7571 ±  5%  interrupts.CPU82.NMI:Non-maskable_interrupts
      6436 ± 20%     +17.6%       7571 ±  5%  interrupts.CPU82.PMI:Performance_monitoring_interrupts
      7217 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU88.NMI:Non-maskable_interrupts
      7217 ±  6%      +4.9%       7573 ±  5%  interrupts.CPU88.PMI:Performance_monitoring_interrupts
      7217 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU89.NMI:Non-maskable_interrupts
      7217 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU89.PMI:Performance_monitoring_interrupts
      7218 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU90.NMI:Non-maskable_interrupts
      7218 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU90.PMI:Performance_monitoring_interrupts
      7218 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU91.NMI:Non-maskable_interrupts
      7218 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU91.PMI:Performance_monitoring_interrupts
      7219 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU92.NMI:Non-maskable_interrupts
      7219 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU92.PMI:Performance_monitoring_interrupts
      7218 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU93.NMI:Non-maskable_interrupts
      7218 ±  6%      +4.9%       7571 ±  5%  interrupts.CPU93.PMI:Performance_monitoring_interrupts
     25.63            -0.9       24.69        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     20.44            -0.8       19.59        perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
     30.38            -0.8       29.63        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      4.49            -0.7        3.75        perf-profile.calltrace.cycles-pp._cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
     29.85            -0.7       29.14        perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     28.79            -0.7       28.09        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     34.89            -0.6       34.25        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
     41.44            -0.6       40.82        perf-profile.calltrace.cycles-pp.__mmap
      3.77 ±  4%      -0.5        3.26 ±  3%  perf-profile.calltrace.cycles-pp.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      3.13 ±  5%      -0.4        2.68 ±  3%  perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff
      0.53 ±  3%      -0.3        0.25 ±100%  perf-profile.calltrace.cycles-pp.cap_vm_enough_memory.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff
      2.64            -0.2        2.41        perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      1.26            -0.2        1.07        perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
      2.36            -0.2        2.18        perf-profile.calltrace.cycles-pp.rcu_all_qs._cond_resched.unmap_page_range.unmap_vmas.unmap_region
      3.37            -0.2        3.20        perf-profile.calltrace.cycles-pp.d_path.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      0.55 ±  3%      -0.2        0.38 ± 57%  perf-profile.calltrace.cycles-pp.down_write.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
      2.30            -0.2        2.14        perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
      1.57            -0.2        1.42        perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
      3.03            -0.1        2.89        perf-profile.calltrace.cycles-pp.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      1.75            -0.1        1.61        perf-profile.calltrace.cycles-pp.prepend_path.d_path.perf_event_mmap.mmap_region.do_mmap
      4.31            -0.1        4.17        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.__munmap
      1.21            -0.1        1.07        perf-profile.calltrace.cycles-pp.shmem_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.84            -0.1        0.74        perf-profile.calltrace.cycles-pp.touch_atime.shmem_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      3.08            -0.1        2.99        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mmap
      1.39            -0.1        1.30        perf-profile.calltrace.cycles-pp.find_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
      0.62            -0.1        0.53        perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.shmem_mmap.mmap_region.do_mmap
      0.98            -0.1        0.89        perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.__do_munmap.__vm_munmap
      3.88            -0.1        3.79        perf-profile.calltrace.cycles-pp.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
      0.99            -0.1        0.92        perf-profile.calltrace.cycles-pp.kmem_cache_alloc_trace.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      0.73 ±  2%      -0.1        0.68        perf-profile.calltrace.cycles-pp.kfree.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      2.97            -0.1        2.92        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__munmap
      0.90 ±  3%      -0.1        0.85 ±  2%  perf-profile.calltrace.cycles-pp.strlen.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      0.57            -0.0        0.52 ±  2%  perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.__do_munmap
      1.06            -0.0        1.01        perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap
      0.63            -0.0        0.59        perf-profile.calltrace.cycles-pp.common_file_perm.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
      1.09            -0.0        1.06        perf-profile.calltrace.cycles-pp.prepend.d_path.perf_event_mmap.mmap_region.do_mmap
      0.72 ±  2%      +0.0        0.76        perf-profile.calltrace.cycles-pp.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      2.71            +0.1        2.78        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__munmap
      9.31            +0.1        9.39        perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      2.73            +0.1        2.83        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__mmap
      4.27            +0.1        4.42        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.__mmap
      0.77            +0.2        0.95 ±  2%  perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap
      1.85            +0.2        2.03        perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.83            +0.5        2.33        perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
     52.13            +0.6       52.74        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
     58.31            +0.6       58.94        perf-profile.calltrace.cycles-pp.__munmap
     45.74            +0.7       46.43        perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
     46.83            +0.7       47.54        perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     47.29            +0.7       48.01        perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     47.59            +0.7       48.34        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
     38.38            +1.0       39.38        perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
      6.69            +1.2        7.91        perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
     25.68            -0.9       24.73        perf-profile.children.cycles-pp.do_mmap
     20.58            -0.9       19.71        perf-profile.children.cycles-pp.mmap_region
     29.90            -0.7       29.19        perf-profile.children.cycles-pp.ksys_mmap_pgoff
     28.83            -0.7       28.13        perf-profile.children.cycles-pp.vm_mmap_pgoff
     41.80            -0.6       41.15        perf-profile.children.cycles-pp.__mmap
      3.77 ±  4%      -0.5        3.26 ±  3%  perf-profile.children.cycles-pp.vm_area_alloc
      3.21 ±  5%      -0.4        2.76 ±  3%  perf-profile.children.cycles-pp.kmem_cache_alloc
      4.97            -0.3        4.65        perf-profile.children.cycles-pp._cond_resched
      2.68            -0.2        2.46        perf-profile.children.cycles-pp.vma_link
     14.27            -0.2       14.08        perf-profile.children.cycles-pp.___might_sleep
      1.26            -0.2        1.08        perf-profile.children.cycles-pp.__vma_link_rb
      3.39            -0.2        3.22        perf-profile.children.cycles-pp.d_path
      2.33            -0.2        2.17        perf-profile.children.cycles-pp.zap_pte_range
      1.60            -0.2        1.45        perf-profile.children.cycles-pp.free_pgtables
      3.04            -0.1        2.89        perf-profile.children.cycles-pp.shmem_get_unmapped_area
      6.07            -0.1        5.92        perf-profile.children.cycles-pp.entry_SYSCALL_64
      1.78            -0.1        1.64        perf-profile.children.cycles-pp.prepend_path
      1.23            -0.1        1.10        perf-profile.children.cycles-pp.shmem_mmap
      2.71            -0.1        2.59        perf-profile.children.cycles-pp.rcu_all_qs
      1.47            -0.1        1.38        perf-profile.children.cycles-pp.find_vma
      0.41            -0.1        0.32        perf-profile.children.cycles-pp.vma_set_page_prot
      0.84            -0.1        0.74        perf-profile.children.cycles-pp.touch_atime
      1.16 ±  2%      -0.1        1.07 ±  2%  perf-profile.children.cycles-pp.down_write
      3.91            -0.1        3.82        perf-profile.children.cycles-pp.get_unmapped_area
      0.99            -0.1        0.90        perf-profile.children.cycles-pp.unlink_file_vma
      0.63            -0.1        0.54        perf-profile.children.cycles-pp.atime_needs_update
      1.05            -0.1        0.97        perf-profile.children.cycles-pp.kmem_cache_alloc_trace
      0.36            -0.1        0.29 ±  2%  perf-profile.children.cycles-pp.apparmor_mmap_file
      0.21 ±  3%      -0.1        0.15 ± 10%  perf-profile.children.cycles-pp.uprobe_mmap
      0.29 ±  2%      -0.1        0.23        perf-profile.children.cycles-pp.vma_merge
      0.74 ±  2%      -0.1        0.69        perf-profile.children.cycles-pp.kfree
      0.36            -0.1        0.31 ±  8%  perf-profile.children.cycles-pp.current_time
      0.52 ±  2%      -0.1        0.47 ±  5%  perf-profile.children.cycles-pp.security_mmap_addr
      0.31            -0.1        0.26        perf-profile.children.cycles-pp.sync_mm_rss
      0.90 ±  3%      -0.1        0.85 ±  2%  perf-profile.children.cycles-pp.strlen
      0.97            -0.0        0.92        perf-profile.children.cycles-pp.__might_sleep
      0.33 ±  2%      -0.0        0.29 ±  3%  perf-profile.children.cycles-pp.__vm_enough_memory
      1.07            -0.0        1.02        perf-profile.children.cycles-pp.vm_unmapped_area
      0.39            -0.0        0.35 ±  2%  perf-profile.children.cycles-pp.lru_add_drain
      0.35            -0.0        0.30 ±  3%  perf-profile.children.cycles-pp.cap_mmap_addr
      0.64            -0.0        0.60        perf-profile.children.cycles-pp.common_file_perm
      1.12            -0.0        1.08        perf-profile.children.cycles-pp.prepend
      0.31 ±  3%      -0.0        0.27 ±  5%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.18 ±  2%      -0.0        0.15 ±  5%  perf-profile.children.cycles-pp.vm_pgprot_modify
      0.38            -0.0        0.34        perf-profile.children.cycles-pp.obj_cgroup_charge
      0.74            -0.0        0.71        perf-profile.children.cycles-pp.up_write
      0.26 ±  6%      -0.0        0.23 ±  8%  perf-profile.children.cycles-pp.path_noexec
      1.01            -0.0        0.98        perf-profile.children.cycles-pp.memcpy_erms
      0.37            -0.0        0.34        perf-profile.children.cycles-pp.downgrade_write
      0.31 ±  2%      -0.0        0.28 ±  2%  perf-profile.children.cycles-pp.lru_add_drain_cpu
      0.18 ±  5%      -0.0        0.15        perf-profile.children.cycles-pp.cap_capable
      0.19 ±  3%      -0.0        0.17 ±  3%  perf-profile.children.cycles-pp.__x64_sys_mmap
      0.32            -0.0        0.29        perf-profile.children.cycles-pp.tlb_gather_mmu
      0.54 ±  2%      -0.0        0.52        perf-profile.children.cycles-pp.cap_vm_enough_memory
      0.24 ±  2%      -0.0        0.21 ±  2%  perf-profile.children.cycles-pp.vma_interval_tree_remove
      0.22 ±  4%      -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.unlink_anon_vmas
      1.04            -0.0        1.02        perf-profile.children.cycles-pp.down_write_killable
      0.45 ±  2%      -0.0        0.43        perf-profile.children.cycles-pp.vmacache_find
      0.14 ±  2%      -0.0        0.12 ±  5%  perf-profile.children.cycles-pp.timestamp_truncate
      0.08 ±  5%      -0.0        0.06 ±  6%  perf-profile.children.cycles-pp.munmap@plt
      0.43 ±  2%      +0.0        0.45        perf-profile.children.cycles-pp.syscall_enter_from_user_mode
      0.18 ±  3%      +0.0        0.20        perf-profile.children.cycles-pp.tlb_flush_mmu
      0.10 ±  6%      +0.0        0.12 ±  8%  perf-profile.children.cycles-pp.aa_file_perm
      0.43 ±  3%      +0.0        0.46 ±  2%  perf-profile.children.cycles-pp.fput_many
      0.05 ±  7%      +0.0        0.09 ±  5%  perf-profile.children.cycles-pp.__x86_indirect_thunk_r9
      0.29            +0.0        0.33        perf-profile.children.cycles-pp.cap_mmap_file
      0.19 ±  3%      +0.0        0.23 ±  2%  perf-profile.children.cycles-pp.userfaultfd_unmap_complete
      0.73 ±  2%      +0.0        0.78        perf-profile.children.cycles-pp.security_vm_enough_memory_mm
      0.08            +0.1        0.13 ±  5%  perf-profile.children.cycles-pp.get_align_mask
      0.16 ±  2%      +0.1        0.23 ±  3%  perf-profile.children.cycles-pp.blocking_notifier_call_chain
      0.08 ±  5%      +0.1        0.15 ±  5%  perf-profile.children.cycles-pp.__x86_retpoline_rbp
      9.37            +0.1        9.45        perf-profile.children.cycles-pp.perf_event_mmap
      0.38 ±  7%      +0.1        0.46 ±  4%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      0.42            +0.1        0.52 ±  2%  perf-profile.children.cycles-pp.refill_obj_stock
      6.10            +0.1        6.24        perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.33 ±  3%      +0.2        0.49        perf-profile.children.cycles-pp.__x86_retpoline_r9
      1.88            +0.2        2.05        perf-profile.children.cycles-pp.security_mmap_file
      0.79            +0.2        0.96 ±  2%  perf-profile.children.cycles-pp.perf_event_mmap_output
      1.85            +0.5        2.35        perf-profile.children.cycles-pp.perf_iterate_sb
     58.74            +0.6       59.34        perf-profile.children.cycles-pp.__munmap
     45.84            +0.7       46.52        perf-profile.children.cycles-pp.__do_munmap
     46.87            +0.7       47.58        perf-profile.children.cycles-pp.__vm_munmap
     47.31            +0.7       48.03        perf-profile.children.cycles-pp.__x64_sys_munmap
     38.46            +1.0       39.45        perf-profile.children.cycles-pp.unmap_region
      6.70            +1.2        7.93        perf-profile.children.cycles-pp.free_pgd_range
     12.02            -0.3       11.68        perf-profile.self.cycles-pp.___might_sleep
      2.32            -0.2        2.09        perf-profile.self.cycles-pp._cond_resched
      1.25            -0.2        1.07        perf-profile.self.cycles-pp.__vma_link_rb
      0.90 ±  3%      -0.1        0.78 ±  3%  perf-profile.self.cycles-pp.shmem_get_unmapped_area
      0.84 ±  2%      -0.1        0.72 ±  6%  perf-profile.self.cycles-pp.prepend_path
      1.65            -0.1        1.54        perf-profile.self.cycles-pp.zap_pte_range
      1.29            -0.1        1.19        perf-profile.self.cycles-pp.__do_munmap
      5.36            -0.1        5.26        perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.47 ± 11%      -0.1        0.38 ±  2%  perf-profile.self.cycles-pp.get_obj_cgroup_from_current
      1.45            -0.1        1.37        perf-profile.self.cycles-pp.perf_event_mmap
      0.32            -0.1        0.25        perf-profile.self.cycles-pp.apparmor_mmap_file
      0.28 ±  2%      -0.1        0.21        perf-profile.self.cycles-pp.vma_merge
      0.20 ±  3%      -0.1        0.14 ± 11%  perf-profile.self.cycles-pp.uprobe_mmap
      0.54            -0.1        0.48        perf-profile.self.cycles-pp.common_file_perm
      0.89            -0.1        0.82        perf-profile.self.cycles-pp.find_vma
      2.04            -0.1        1.98        perf-profile.self.cycles-pp.rcu_all_qs
      0.73 ±  2%      -0.1        0.67        perf-profile.self.cycles-pp.kfree
      0.53            -0.1        0.48        perf-profile.self.cycles-pp.vm_area_alloc
      0.89 ±  3%      -0.1        0.84 ±  2%  perf-profile.self.cycles-pp.strlen
      0.61            -0.0        0.56        perf-profile.self.cycles-pp.kmem_cache_alloc_trace
      0.30            -0.0        0.26        perf-profile.self.cycles-pp.sync_mm_rss
      0.89            -0.0        0.84        perf-profile.self.cycles-pp.__might_sleep
      0.25            -0.0        0.20 ±  3%  perf-profile.self.cycles-pp.__x64_sys_munmap
      0.52 ±  3%      -0.0        0.47 ±  3%  perf-profile.self.cycles-pp.down_write
      1.06            -0.0        1.01        perf-profile.self.cycles-pp.vm_unmapped_area
      0.15 ±  2%      -0.0        0.11 ±  4%  perf-profile.self.cycles-pp.vma_set_page_prot
      0.30 ±  2%      -0.0        0.26        perf-profile.self.cycles-pp.cap_mmap_addr
      0.29 ±  3%      -0.0        0.25 ±  4%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.17 ±  2%      -0.0        0.14 ±  6%  perf-profile.self.cycles-pp.vm_pgprot_modify
      0.36            -0.0        0.33        perf-profile.self.cycles-pp.obj_cgroup_charge
      0.72            -0.0        0.69        perf-profile.self.cycles-pp.up_write
      0.30            -0.0        0.26 ±  3%  perf-profile.self.cycles-pp.unmap_region
      0.17 ±  5%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.cap_capable
      0.30 ±  2%      -0.0        0.27        perf-profile.self.cycles-pp.lru_add_drain_cpu
      0.35            -0.0        0.33 ±  2%  perf-profile.self.cycles-pp.downgrade_write
      0.44            -0.0        0.42        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.46 ±  2%      -0.0        0.44        perf-profile.self.cycles-pp.down_write_killable
      0.39            -0.0        0.36 ±  2%  perf-profile.self.cycles-pp.shmem_mmap
      0.31 ±  2%      -0.0        0.29 ±  3%  perf-profile.self.cycles-pp.tlb_gather_mmu
      0.19 ±  2%      -0.0        0.17 ±  4%  perf-profile.self.cycles-pp.vma_interval_tree_remove
      0.18 ±  2%      -0.0        0.15 ±  5%  perf-profile.self.cycles-pp.__x64_sys_mmap
      0.14            -0.0        0.12 ± 13%  perf-profile.self.cycles-pp.current_time
      0.20 ±  2%      -0.0        0.18 ±  4%  perf-profile.self.cycles-pp.unlink_anon_vmas
      0.13 ±  3%      -0.0        0.11 ±  6%  perf-profile.self.cycles-pp.timestamp_truncate
      0.08            -0.0        0.07 ± 13%  perf-profile.self.cycles-pp.ktime_get_coarse_real_ts64
      0.13 ±  3%      -0.0        0.12 ±  3%  perf-profile.self.cycles-pp.unlink_file_vma
      0.08            -0.0        0.07 ±  6%  perf-profile.self.cycles-pp.lru_add_drain
      0.09            -0.0        0.08        perf-profile.self.cycles-pp.__vma_link_file
      0.33 ±  2%      +0.0        0.35 ±  2%  perf-profile.self.cycles-pp.vm_mmap_pgoff
      0.12 ±  3%      +0.0        0.14        perf-profile.self.cycles-pp.tlb_flush_mmu
      0.39 ±  2%      +0.0        0.41 ±  2%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.09 ±  4%      +0.0        0.11 ±  9%  perf-profile.self.cycles-pp.aa_file_perm
      0.24 ±  2%      +0.0        0.27        perf-profile.self.cycles-pp.cap_mmap_file
      0.21 ±  2%      +0.0        0.26        perf-profile.self.cycles-pp.vma_link
      0.06            +0.0        0.10 ±  4%  perf-profile.self.cycles-pp.get_align_mask
      0.96            +0.0        1.01        perf-profile.self.cycles-pp.do_mmap
      0.17 ±  3%      +0.0        0.22        perf-profile.self.cycles-pp.userfaultfd_unmap_complete
      0.12 ±  3%      +0.1        0.18 ±  2%  perf-profile.self.cycles-pp.security_vm_enough_memory_mm
      0.00            +0.1        0.06 ±  6%  perf-profile.self.cycles-pp.__x86_indirect_thunk_r9
      0.15 ±  3%      +0.1        0.22        perf-profile.self.cycles-pp.blocking_notifier_call_chain
      0.06 ±  6%      +0.1        0.13 ±  5%  perf-profile.self.cycles-pp.__x86_retpoline_rbp
      0.36 ±  2%      +0.1        0.45        perf-profile.self.cycles-pp.security_mmap_file
      0.36 ±  8%      +0.1        0.44 ±  5%  perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      0.31 ±  3%      +0.1        0.40 ±  2%  perf-profile.self.cycles-pp.get_unmapped_area
      0.40            +0.1        0.50 ±  2%  perf-profile.self.cycles-pp.refill_obj_stock
      0.30 ±  2%      +0.1        0.43        perf-profile.self.cycles-pp.__x86_retpoline_r9
      6.09            +0.1        6.22        perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.75            +0.1        0.89        perf-profile.self.cycles-pp.perf_event_mmap_output
      1.40            +0.1        1.54        perf-profile.self.cycles-pp.mmap_region
      1.01 ±  2%      +0.3        1.28        perf-profile.self.cycles-pp.perf_iterate_sb
     10.66            +0.8       11.42        perf-profile.self.cycles-pp.unmap_page_range
      6.66            +1.2        7.88        perf-profile.self.cycles-pp.free_pgd_range


                                                                                
                            will-it-scale.per_process_ops                       
                                                                                
  224000 +------------------------------------------------------------------+   
         |                                                                  |   
  222000 |-+   .+.....                                                      |   
         | ....       +......+......                                        |   
  220000 |.+                        +                                       |   
         |                                                                  |   
  218000 |-+                                                                |   
         |                                                                  |   
  216000 |-+                                                                |   
         |                                                                  |   
  214000 |-+                                                                |   
         |                                                                  |   
  212000 |-+          O      O      O      O     O      O                   |   
         |      O                                              O     O      |   
  210000 +------------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                will-it-scale.workload                          
                                                                                
  2.32e+07 +----------------------------------------------------------------+   
           |     .+....                                                     |   
   2.3e+07 |-....      .    ...+.....                                       |   
           |.           +...         +                                      |   
  2.28e+07 |-+                                                              |   
           |                                                                |   
  2.26e+07 |-+                                                              |   
           |                                                                |   
  2.24e+07 |-+                                                              |   
           |                                                                |   
  2.22e+07 |-+                                                              |   
           |            O            O                   O                  |   
   2.2e+07 |-+    O            O            O     O                         |   
           |                                                   O      O     |   
  2.18e+07 +----------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Rong Chen


View attachment "config-5.9.0-rc7-00001-ga4d63c3732f1a0" of type "text/plain" (170148 bytes)

View attachment "job-script" of type "text/plain" (7573 bytes)

View attachment "job.yaml" of type "text/plain" (5118 bytes)

View attachment "reproduce" of type "text/plain" (338 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ