[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20201004132759.GT393@shao2-debian>
Date: Sun, 4 Oct 2020 21:27:59 +0800
From: kernel test robot <rong.a.chen@...el.com>
To: "Jason A. Donenfeld" <Jason@...c4.com>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>,
Chris Wilson <chris@...is-wilson.co.uk>,
Harald Arnesen <harald@...gtun.org>,
Jason Gunthorpe <jgg@...dia.com>, Peter Xu <peterx@...hat.com>,
LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
zhengjun.xing@...el.com
Subject: [mm] a4d63c3732: will-it-scale.per_process_ops -4.2% regression
Greeting,
FYI, we noticed a -4.2% regression of will-it-scale.per_process_ops due to commit:
commit: a4d63c3732f1a0c91abcf5b7f32b4ef7dcd82025 ("mm: do not rely on mm == current->mm in __get_user_pages_locked")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: will-it-scale
on test machine: 104 threads Skylake with 192G memory
with following parameters:
nr_task: 100%
mode: process
test: mmap2
cpufreq_governor: performance
ucode: 0x2006906
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <rong.a.chen@...el.com>
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/100%/debian-10.4-x86_64-20200603.cgz/lkp-skl-fpga01/mmap2/will-it-scale/0x2006906
commit:
v5.9-rc7
a4d63c3732 ("mm: do not rely on mm == current->mm in __get_user_pages_locked")
v5.9-rc7 a4d63c3732f1a0c91abcf5b7f32
---------------- ---------------------------
%stddev %change %stddev
\ | \
220777 -4.2% 211546 will-it-scale.per_process_ops
22960865 -4.2% 22000902 will-it-scale.workload
83839135 ± 55% +55.2% 1.301e+08 ± 26% cpuidle.C1E.time
211210 ± 30% +33.2% 281375 ± 21% cpuidle.C1E.usage
18140 ± 2% +39.2% 25254 ± 23% sched_debug.cpu.nr_switches.max
13463 ± 4% +14.0% 15345 ± 2% sched_debug.cpu.sched_count.max
81065 +1.1% 81959 proc-vmstat.nr_anon_pages
96.80 +3.0% 99.75 proc-vmstat.nr_anon_transparent_hugepages
83789 +1.1% 84672 proc-vmstat.nr_inactive_anon
83789 +1.1% 84672 proc-vmstat.nr_zone_inactive_anon
6517 ±144% -99.0% 64.50 ± 54% proc-vmstat.numa_hint_faults
6473 ± 65% -99.6% 26.50 ±121% proc-vmstat.numa_pages_migrated
6473 ± 65% -99.6% 26.50 ±121% proc-vmstat.pgmigrate_success
48103 ± 6% +8.8% 52321 ± 3% numa-meminfo.node0.KReclaimable
48103 ± 6% +8.8% 52321 ± 3% numa-meminfo.node0.SReclaimable
94453 ± 4% +18.1% 111554 ± 4% numa-meminfo.node0.SUnreclaim
142557 +15.0% 163876 ± 3% numa-meminfo.node0.Slab
50072 ± 4% -10.0% 45056 ± 3% numa-meminfo.node1.KReclaimable
50072 ± 4% -10.0% 45056 ± 3% numa-meminfo.node1.SReclaimable
98485 ± 4% -17.9% 80860 ± 4% numa-meminfo.node1.SUnreclaim
148557 -15.2% 125917 ± 3% numa-meminfo.node1.Slab
12025 ± 6% +8.8% 13079 ± 3% numa-vmstat.node0.nr_slab_reclaimable
23613 ± 4% +18.1% 27888 ± 4% numa-vmstat.node0.nr_slab_unreclaimable
828938 ± 7% +17.8% 976901 ± 7% numa-vmstat.node0.numa_hit
804550 ± 7% +18.8% 956028 ± 6% numa-vmstat.node0.numa_local
12516 ± 4% -10.0% 11263 ± 3% numa-vmstat.node1.nr_slab_reclaimable
24621 ± 4% -17.9% 20214 ± 4% numa-vmstat.node1.nr_slab_unreclaimable
940544 ± 7% -16.2% 788395 ± 9% numa-vmstat.node1.numa_hit
755086 ± 7% -20.6% 599240 ± 10% numa-vmstat.node1.numa_local
5.792e+10 -4.2% 5.55e+10 perf-stat.i.branch-instructions
0.48 -0.0 0.48 perf-stat.i.branch-miss-rate%
2.703e+08 -5.6% 2.552e+08 perf-stat.i.branch-misses
1.15 +4.5% 1.20 perf-stat.i.cpi
161.86 -1.2% 159.93 perf-stat.i.cpu-migrations
45949319 -4.5% 43875433 perf-stat.i.dTLB-load-misses
6.167e+10 -4.2% 5.906e+10 perf-stat.i.dTLB-loads
47578 ± 15% -8.9% 43356 perf-stat.i.dTLB-store-misses
2.787e+10 -4.2% 2.669e+10 perf-stat.i.dTLB-stores
82.07 ± 3% +13.8 95.89 ± 2% perf-stat.i.iTLB-load-miss-rate%
28493832 +51.9% 43294073 perf-stat.i.iTLB-load-misses
6078707 ± 18% -71.6% 1725940 ± 52% perf-stat.i.iTLB-loads
2.416e+11 -4.2% 2.314e+11 perf-stat.i.instructions
8812 -37.2% 5534 perf-stat.i.instructions-per-iTLB-miss
0.87 -4.3% 0.83 perf-stat.i.ipc
1418 -4.2% 1358 perf-stat.i.metric.M/sec
0.47 -0.0 0.46 perf-stat.overall.branch-miss-rate%
1.15 +4.6% 1.20 perf-stat.overall.cpi
82.47 ± 3% +13.7 96.20 ± 2% perf-stat.overall.iTLB-load-miss-rate%
8484 -37.0% 5347 perf-stat.overall.instructions-per-iTLB-miss
0.87 -4.4% 0.83 perf-stat.overall.ipc
5.773e+10 -4.2% 5.531e+10 perf-stat.ps.branch-instructions
2.693e+08 -5.6% 2.544e+08 perf-stat.ps.branch-misses
161.32 -1.2% 159.42 perf-stat.ps.cpu-migrations
45785331 -4.5% 43716262 perf-stat.ps.dTLB-load-misses
6.146e+10 -4.2% 5.886e+10 perf-stat.ps.dTLB-loads
47549 ± 15% -8.9% 43311 perf-stat.ps.dTLB-store-misses
2.778e+10 -4.3% 2.66e+10 perf-stat.ps.dTLB-stores
28376216 +52.0% 43133756 perf-stat.ps.iTLB-load-misses
6071510 ± 18% -71.6% 1726775 ± 52% perf-stat.ps.iTLB-loads
2.407e+11 -4.2% 2.306e+11 perf-stat.ps.instructions
7.276e+13 -4.2% 6.968e+13 perf-stat.total.instructions
89619 +2.5% 91832 interrupts.CAL:Function_call_interrupts
311.60 +22.6% 382.00 ± 22% interrupts.CPU17.RES:Rescheduling_interrupts
6803 ± 6% +5.3% 7164 ± 5% interrupts.CPU19.NMI:Non-maskable_interrupts
6803 ± 6% +5.3% 7164 ± 5% interrupts.CPU19.PMI:Performance_monitoring_interrupts
6070 ± 20% +18.0% 7163 ± 5% interrupts.CPU2.NMI:Non-maskable_interrupts
6070 ± 20% +18.0% 7163 ± 5% interrupts.CPU2.PMI:Performance_monitoring_interrupts
6802 ± 6% +5.3% 7164 ± 5% interrupts.CPU20.NMI:Non-maskable_interrupts
6802 ± 6% +5.3% 7164 ± 5% interrupts.CPU20.PMI:Performance_monitoring_interrupts
6801 ± 6% +5.2% 7158 ± 5% interrupts.CPU24.NMI:Non-maskable_interrupts
6801 ± 6% +5.2% 7158 ± 5% interrupts.CPU24.PMI:Performance_monitoring_interrupts
7216 ± 6% +4.9% 7572 ± 5% interrupts.CPU26.NMI:Non-maskable_interrupts
7216 ± 6% +4.9% 7572 ± 5% interrupts.CPU26.PMI:Performance_monitoring_interrupts
7216 ± 6% +4.9% 7573 ± 5% interrupts.CPU27.NMI:Non-maskable_interrupts
7216 ± 6% +4.9% 7573 ± 5% interrupts.CPU27.PMI:Performance_monitoring_interrupts
7217 ± 6% +4.9% 7573 ± 5% interrupts.CPU28.NMI:Non-maskable_interrupts
7217 ± 6% +4.9% 7573 ± 5% interrupts.CPU28.PMI:Performance_monitoring_interrupts
7218 ± 6% +4.9% 7573 ± 5% interrupts.CPU29.NMI:Non-maskable_interrupts
7218 ± 6% +4.9% 7573 ± 5% interrupts.CPU29.PMI:Performance_monitoring_interrupts
6071 ± 20% +18.0% 7163 ± 5% interrupts.CPU3.NMI:Non-maskable_interrupts
6071 ± 20% +18.0% 7163 ± 5% interrupts.CPU3.PMI:Performance_monitoring_interrupts
7219 ± 6% +4.9% 7573 ± 5% interrupts.CPU30.NMI:Non-maskable_interrupts
7219 ± 6% +4.9% 7573 ± 5% interrupts.CPU30.PMI:Performance_monitoring_interrupts
7218 ± 6% +4.9% 7574 ± 5% interrupts.CPU31.NMI:Non-maskable_interrupts
7218 ± 6% +4.9% 7574 ± 5% interrupts.CPU31.PMI:Performance_monitoring_interrupts
7217 ± 6% +4.9% 7573 ± 5% interrupts.CPU37.NMI:Non-maskable_interrupts
7217 ± 6% +4.9% 7573 ± 5% interrupts.CPU37.PMI:Performance_monitoring_interrupts
6071 ± 20% +18.0% 7163 ± 5% interrupts.CPU4.NMI:Non-maskable_interrupts
6071 ± 20% +18.0% 7163 ± 5% interrupts.CPU4.PMI:Performance_monitoring_interrupts
308.60 +35.9% 419.25 ± 31% interrupts.CPU40.RES:Rescheduling_interrupts
6436 ± 20% +17.7% 7573 ± 5% interrupts.CPU46.NMI:Non-maskable_interrupts
6436 ± 20% +17.7% 7573 ± 5% interrupts.CPU46.PMI:Performance_monitoring_interrupts
6437 ± 20% +17.7% 7573 ± 5% interrupts.CPU47.NMI:Non-maskable_interrupts
6437 ± 20% +17.7% 7573 ± 5% interrupts.CPU47.PMI:Performance_monitoring_interrupts
6436 ± 20% +17.7% 7573 ± 5% interrupts.CPU48.NMI:Non-maskable_interrupts
6436 ± 20% +17.7% 7573 ± 5% interrupts.CPU48.PMI:Performance_monitoring_interrupts
6436 ± 20% +17.7% 7574 ± 5% interrupts.CPU49.NMI:Non-maskable_interrupts
6436 ± 20% +17.7% 7574 ± 5% interrupts.CPU49.PMI:Performance_monitoring_interrupts
6071 ± 20% +18.0% 7164 ± 5% interrupts.CPU5.NMI:Non-maskable_interrupts
6071 ± 20% +18.0% 7164 ± 5% interrupts.CPU5.PMI:Performance_monitoring_interrupts
6436 ± 20% +17.7% 7574 ± 5% interrupts.CPU50.NMI:Non-maskable_interrupts
6436 ± 20% +17.7% 7574 ± 5% interrupts.CPU50.PMI:Performance_monitoring_interrupts
6435 ± 20% +17.7% 7574 ± 5% interrupts.CPU51.NMI:Non-maskable_interrupts
6435 ± 20% +17.7% 7574 ± 5% interrupts.CPU51.PMI:Performance_monitoring_interrupts
6052 ± 20% +18.0% 7139 ± 5% interrupts.CPU52.NMI:Non-maskable_interrupts
6052 ± 20% +18.0% 7139 ± 5% interrupts.CPU52.PMI:Performance_monitoring_interrupts
6790 ± 6% +5.2% 7142 ± 5% interrupts.CPU54.NMI:Non-maskable_interrupts
6790 ± 6% +5.2% 7142 ± 5% interrupts.CPU54.PMI:Performance_monitoring_interrupts
965.80 ± 62% +59.0% 1535 ± 48% interrupts.CPU54.RES:Rescheduling_interrupts
6057 ± 20% +17.9% 7142 ± 5% interrupts.CPU55.NMI:Non-maskable_interrupts
6057 ± 20% +17.9% 7142 ± 5% interrupts.CPU55.PMI:Performance_monitoring_interrupts
566.20 ± 24% +100.8% 1136 ± 50% interrupts.CPU58.CAL:Function_call_interrupts
6056 ± 20% +17.9% 7138 ± 5% interrupts.CPU58.NMI:Non-maskable_interrupts
6056 ± 20% +17.9% 7138 ± 5% interrupts.CPU58.PMI:Performance_monitoring_interrupts
6055 ± 20% +17.9% 7139 ± 5% interrupts.CPU59.NMI:Non-maskable_interrupts
6055 ± 20% +17.9% 7139 ± 5% interrupts.CPU59.PMI:Performance_monitoring_interrupts
6789 ± 6% +5.1% 7138 ± 5% interrupts.CPU60.NMI:Non-maskable_interrupts
6789 ± 6% +5.1% 7138 ± 5% interrupts.CPU60.PMI:Performance_monitoring_interrupts
6786 ± 6% +5.2% 7138 ± 5% interrupts.CPU61.NMI:Non-maskable_interrupts
6786 ± 6% +5.2% 7138 ± 5% interrupts.CPU61.PMI:Performance_monitoring_interrupts
6787 ± 6% +5.2% 7139 ± 5% interrupts.CPU63.NMI:Non-maskable_interrupts
6787 ± 6% +5.2% 7139 ± 5% interrupts.CPU63.PMI:Performance_monitoring_interrupts
6053 ± 20% +17.9% 7136 ± 5% interrupts.CPU65.NMI:Non-maskable_interrupts
6053 ± 20% +17.9% 7136 ± 5% interrupts.CPU65.PMI:Performance_monitoring_interrupts
314.40 ± 3% +6.1% 333.50 ± 5% interrupts.CPU67.RES:Rescheduling_interrupts
5316 ± 25% +34.2% 7133 ± 5% interrupts.CPU76.NMI:Non-maskable_interrupts
5316 ± 25% +34.2% 7133 ± 5% interrupts.CPU76.PMI:Performance_monitoring_interrupts
6434 ± 20% +17.6% 7570 ± 5% interrupts.CPU78.NMI:Non-maskable_interrupts
6434 ± 20% +17.6% 7570 ± 5% interrupts.CPU78.PMI:Performance_monitoring_interrupts
6433 ± 20% +17.7% 7570 ± 5% interrupts.CPU79.NMI:Non-maskable_interrupts
6433 ± 20% +17.7% 7570 ± 5% interrupts.CPU79.PMI:Performance_monitoring_interrupts
453.80 +142.8% 1102 ± 58% interrupts.CPU8.CAL:Function_call_interrupts
6435 ± 20% +17.6% 7570 ± 5% interrupts.CPU80.NMI:Non-maskable_interrupts
6435 ± 20% +17.6% 7570 ± 5% interrupts.CPU80.PMI:Performance_monitoring_interrupts
6436 ± 20% +17.6% 7571 ± 5% interrupts.CPU81.NMI:Non-maskable_interrupts
6436 ± 20% +17.6% 7571 ± 5% interrupts.CPU81.PMI:Performance_monitoring_interrupts
6436 ± 20% +17.6% 7571 ± 5% interrupts.CPU82.NMI:Non-maskable_interrupts
6436 ± 20% +17.6% 7571 ± 5% interrupts.CPU82.PMI:Performance_monitoring_interrupts
7217 ± 6% +4.9% 7573 ± 5% interrupts.CPU88.NMI:Non-maskable_interrupts
7217 ± 6% +4.9% 7573 ± 5% interrupts.CPU88.PMI:Performance_monitoring_interrupts
7217 ± 6% +4.9% 7571 ± 5% interrupts.CPU89.NMI:Non-maskable_interrupts
7217 ± 6% +4.9% 7571 ± 5% interrupts.CPU89.PMI:Performance_monitoring_interrupts
7218 ± 6% +4.9% 7571 ± 5% interrupts.CPU90.NMI:Non-maskable_interrupts
7218 ± 6% +4.9% 7571 ± 5% interrupts.CPU90.PMI:Performance_monitoring_interrupts
7218 ± 6% +4.9% 7571 ± 5% interrupts.CPU91.NMI:Non-maskable_interrupts
7218 ± 6% +4.9% 7571 ± 5% interrupts.CPU91.PMI:Performance_monitoring_interrupts
7219 ± 6% +4.9% 7571 ± 5% interrupts.CPU92.NMI:Non-maskable_interrupts
7219 ± 6% +4.9% 7571 ± 5% interrupts.CPU92.PMI:Performance_monitoring_interrupts
7218 ± 6% +4.9% 7571 ± 5% interrupts.CPU93.NMI:Non-maskable_interrupts
7218 ± 6% +4.9% 7571 ± 5% interrupts.CPU93.PMI:Performance_monitoring_interrupts
25.63 -0.9 24.69 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
20.44 -0.8 19.59 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
30.38 -0.8 29.63 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
4.49 -0.7 3.75 perf-profile.calltrace.cycles-pp._cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
29.85 -0.7 29.14 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
28.79 -0.7 28.09 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
34.89 -0.6 34.25 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
41.44 -0.6 40.82 perf-profile.calltrace.cycles-pp.__mmap
3.77 ± 4% -0.5 3.26 ± 3% perf-profile.calltrace.cycles-pp.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
3.13 ± 5% -0.4 2.68 ± 3% perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff
0.53 ± 3% -0.3 0.25 ±100% perf-profile.calltrace.cycles-pp.cap_vm_enough_memory.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff
2.64 -0.2 2.41 perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.26 -0.2 1.07 perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
2.36 -0.2 2.18 perf-profile.calltrace.cycles-pp.rcu_all_qs._cond_resched.unmap_page_range.unmap_vmas.unmap_region
3.37 -0.2 3.20 perf-profile.calltrace.cycles-pp.d_path.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
0.55 ± 3% -0.2 0.38 ± 57% perf-profile.calltrace.cycles-pp.down_write.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
2.30 -0.2 2.14 perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
1.57 -0.2 1.42 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
3.03 -0.1 2.89 perf-profile.calltrace.cycles-pp.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.75 -0.1 1.61 perf-profile.calltrace.cycles-pp.prepend_path.d_path.perf_event_mmap.mmap_region.do_mmap
4.31 -0.1 4.17 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.__munmap
1.21 -0.1 1.07 perf-profile.calltrace.cycles-pp.shmem_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.84 -0.1 0.74 perf-profile.calltrace.cycles-pp.touch_atime.shmem_mmap.mmap_region.do_mmap.vm_mmap_pgoff
3.08 -0.1 2.99 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mmap
1.39 -0.1 1.30 perf-profile.calltrace.cycles-pp.find_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
0.62 -0.1 0.53 perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.shmem_mmap.mmap_region.do_mmap
0.98 -0.1 0.89 perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.__do_munmap.__vm_munmap
3.88 -0.1 3.79 perf-profile.calltrace.cycles-pp.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
0.99 -0.1 0.92 perf-profile.calltrace.cycles-pp.kmem_cache_alloc_trace.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
0.73 ± 2% -0.1 0.68 perf-profile.calltrace.cycles-pp.kfree.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
2.97 -0.1 2.92 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__munmap
0.90 ± 3% -0.1 0.85 ± 2% perf-profile.calltrace.cycles-pp.strlen.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
0.57 -0.0 0.52 ± 2% perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.__do_munmap
1.06 -0.0 1.01 perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap
0.63 -0.0 0.59 perf-profile.calltrace.cycles-pp.common_file_perm.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
1.09 -0.0 1.06 perf-profile.calltrace.cycles-pp.prepend.d_path.perf_event_mmap.mmap_region.do_mmap
0.72 ± 2% +0.0 0.76 perf-profile.calltrace.cycles-pp.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
2.71 +0.1 2.78 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__munmap
9.31 +0.1 9.39 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
2.73 +0.1 2.83 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__mmap
4.27 +0.1 4.42 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.__mmap
0.77 +0.2 0.95 ± 2% perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap
1.85 +0.2 2.03 perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.83 +0.5 2.33 perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
52.13 +0.6 52.74 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
58.31 +0.6 58.94 perf-profile.calltrace.cycles-pp.__munmap
45.74 +0.7 46.43 perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
46.83 +0.7 47.54 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
47.29 +0.7 48.01 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
47.59 +0.7 48.34 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
38.38 +1.0 39.38 perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
6.69 +1.2 7.91 perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
25.68 -0.9 24.73 perf-profile.children.cycles-pp.do_mmap
20.58 -0.9 19.71 perf-profile.children.cycles-pp.mmap_region
29.90 -0.7 29.19 perf-profile.children.cycles-pp.ksys_mmap_pgoff
28.83 -0.7 28.13 perf-profile.children.cycles-pp.vm_mmap_pgoff
41.80 -0.6 41.15 perf-profile.children.cycles-pp.__mmap
3.77 ± 4% -0.5 3.26 ± 3% perf-profile.children.cycles-pp.vm_area_alloc
3.21 ± 5% -0.4 2.76 ± 3% perf-profile.children.cycles-pp.kmem_cache_alloc
4.97 -0.3 4.65 perf-profile.children.cycles-pp._cond_resched
2.68 -0.2 2.46 perf-profile.children.cycles-pp.vma_link
14.27 -0.2 14.08 perf-profile.children.cycles-pp.___might_sleep
1.26 -0.2 1.08 perf-profile.children.cycles-pp.__vma_link_rb
3.39 -0.2 3.22 perf-profile.children.cycles-pp.d_path
2.33 -0.2 2.17 perf-profile.children.cycles-pp.zap_pte_range
1.60 -0.2 1.45 perf-profile.children.cycles-pp.free_pgtables
3.04 -0.1 2.89 perf-profile.children.cycles-pp.shmem_get_unmapped_area
6.07 -0.1 5.92 perf-profile.children.cycles-pp.entry_SYSCALL_64
1.78 -0.1 1.64 perf-profile.children.cycles-pp.prepend_path
1.23 -0.1 1.10 perf-profile.children.cycles-pp.shmem_mmap
2.71 -0.1 2.59 perf-profile.children.cycles-pp.rcu_all_qs
1.47 -0.1 1.38 perf-profile.children.cycles-pp.find_vma
0.41 -0.1 0.32 perf-profile.children.cycles-pp.vma_set_page_prot
0.84 -0.1 0.74 perf-profile.children.cycles-pp.touch_atime
1.16 ± 2% -0.1 1.07 ± 2% perf-profile.children.cycles-pp.down_write
3.91 -0.1 3.82 perf-profile.children.cycles-pp.get_unmapped_area
0.99 -0.1 0.90 perf-profile.children.cycles-pp.unlink_file_vma
0.63 -0.1 0.54 perf-profile.children.cycles-pp.atime_needs_update
1.05 -0.1 0.97 perf-profile.children.cycles-pp.kmem_cache_alloc_trace
0.36 -0.1 0.29 ± 2% perf-profile.children.cycles-pp.apparmor_mmap_file
0.21 ± 3% -0.1 0.15 ± 10% perf-profile.children.cycles-pp.uprobe_mmap
0.29 ± 2% -0.1 0.23 perf-profile.children.cycles-pp.vma_merge
0.74 ± 2% -0.1 0.69 perf-profile.children.cycles-pp.kfree
0.36 -0.1 0.31 ± 8% perf-profile.children.cycles-pp.current_time
0.52 ± 2% -0.1 0.47 ± 5% perf-profile.children.cycles-pp.security_mmap_addr
0.31 -0.1 0.26 perf-profile.children.cycles-pp.sync_mm_rss
0.90 ± 3% -0.1 0.85 ± 2% perf-profile.children.cycles-pp.strlen
0.97 -0.0 0.92 perf-profile.children.cycles-pp.__might_sleep
0.33 ± 2% -0.0 0.29 ± 3% perf-profile.children.cycles-pp.__vm_enough_memory
1.07 -0.0 1.02 perf-profile.children.cycles-pp.vm_unmapped_area
0.39 -0.0 0.35 ± 2% perf-profile.children.cycles-pp.lru_add_drain
0.35 -0.0 0.30 ± 3% perf-profile.children.cycles-pp.cap_mmap_addr
0.64 -0.0 0.60 perf-profile.children.cycles-pp.common_file_perm
1.12 -0.0 1.08 perf-profile.children.cycles-pp.prepend
0.31 ± 3% -0.0 0.27 ± 5% perf-profile.children.cycles-pp.percpu_counter_add_batch
0.18 ± 2% -0.0 0.15 ± 5% perf-profile.children.cycles-pp.vm_pgprot_modify
0.38 -0.0 0.34 perf-profile.children.cycles-pp.obj_cgroup_charge
0.74 -0.0 0.71 perf-profile.children.cycles-pp.up_write
0.26 ± 6% -0.0 0.23 ± 8% perf-profile.children.cycles-pp.path_noexec
1.01 -0.0 0.98 perf-profile.children.cycles-pp.memcpy_erms
0.37 -0.0 0.34 perf-profile.children.cycles-pp.downgrade_write
0.31 ± 2% -0.0 0.28 ± 2% perf-profile.children.cycles-pp.lru_add_drain_cpu
0.18 ± 5% -0.0 0.15 perf-profile.children.cycles-pp.cap_capable
0.19 ± 3% -0.0 0.17 ± 3% perf-profile.children.cycles-pp.__x64_sys_mmap
0.32 -0.0 0.29 perf-profile.children.cycles-pp.tlb_gather_mmu
0.54 ± 2% -0.0 0.52 perf-profile.children.cycles-pp.cap_vm_enough_memory
0.24 ± 2% -0.0 0.21 ± 2% perf-profile.children.cycles-pp.vma_interval_tree_remove
0.22 ± 4% -0.0 0.20 ± 2% perf-profile.children.cycles-pp.unlink_anon_vmas
1.04 -0.0 1.02 perf-profile.children.cycles-pp.down_write_killable
0.45 ± 2% -0.0 0.43 perf-profile.children.cycles-pp.vmacache_find
0.14 ± 2% -0.0 0.12 ± 5% perf-profile.children.cycles-pp.timestamp_truncate
0.08 ± 5% -0.0 0.06 ± 6% perf-profile.children.cycles-pp.munmap@plt
0.43 ± 2% +0.0 0.45 perf-profile.children.cycles-pp.syscall_enter_from_user_mode
0.18 ± 3% +0.0 0.20 perf-profile.children.cycles-pp.tlb_flush_mmu
0.10 ± 6% +0.0 0.12 ± 8% perf-profile.children.cycles-pp.aa_file_perm
0.43 ± 3% +0.0 0.46 ± 2% perf-profile.children.cycles-pp.fput_many
0.05 ± 7% +0.0 0.09 ± 5% perf-profile.children.cycles-pp.__x86_indirect_thunk_r9
0.29 +0.0 0.33 perf-profile.children.cycles-pp.cap_mmap_file
0.19 ± 3% +0.0 0.23 ± 2% perf-profile.children.cycles-pp.userfaultfd_unmap_complete
0.73 ± 2% +0.0 0.78 perf-profile.children.cycles-pp.security_vm_enough_memory_mm
0.08 +0.1 0.13 ± 5% perf-profile.children.cycles-pp.get_align_mask
0.16 ± 2% +0.1 0.23 ± 3% perf-profile.children.cycles-pp.blocking_notifier_call_chain
0.08 ± 5% +0.1 0.15 ± 5% perf-profile.children.cycles-pp.__x86_retpoline_rbp
9.37 +0.1 9.45 perf-profile.children.cycles-pp.perf_event_mmap
0.38 ± 7% +0.1 0.46 ± 4% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.42 +0.1 0.52 ± 2% perf-profile.children.cycles-pp.refill_obj_stock
6.10 +0.1 6.24 perf-profile.children.cycles-pp.syscall_return_via_sysret
0.33 ± 3% +0.2 0.49 perf-profile.children.cycles-pp.__x86_retpoline_r9
1.88 +0.2 2.05 perf-profile.children.cycles-pp.security_mmap_file
0.79 +0.2 0.96 ± 2% perf-profile.children.cycles-pp.perf_event_mmap_output
1.85 +0.5 2.35 perf-profile.children.cycles-pp.perf_iterate_sb
58.74 +0.6 59.34 perf-profile.children.cycles-pp.__munmap
45.84 +0.7 46.52 perf-profile.children.cycles-pp.__do_munmap
46.87 +0.7 47.58 perf-profile.children.cycles-pp.__vm_munmap
47.31 +0.7 48.03 perf-profile.children.cycles-pp.__x64_sys_munmap
38.46 +1.0 39.45 perf-profile.children.cycles-pp.unmap_region
6.70 +1.2 7.93 perf-profile.children.cycles-pp.free_pgd_range
12.02 -0.3 11.68 perf-profile.self.cycles-pp.___might_sleep
2.32 -0.2 2.09 perf-profile.self.cycles-pp._cond_resched
1.25 -0.2 1.07 perf-profile.self.cycles-pp.__vma_link_rb
0.90 ± 3% -0.1 0.78 ± 3% perf-profile.self.cycles-pp.shmem_get_unmapped_area
0.84 ± 2% -0.1 0.72 ± 6% perf-profile.self.cycles-pp.prepend_path
1.65 -0.1 1.54 perf-profile.self.cycles-pp.zap_pte_range
1.29 -0.1 1.19 perf-profile.self.cycles-pp.__do_munmap
5.36 -0.1 5.26 perf-profile.self.cycles-pp.entry_SYSCALL_64
0.47 ± 11% -0.1 0.38 ± 2% perf-profile.self.cycles-pp.get_obj_cgroup_from_current
1.45 -0.1 1.37 perf-profile.self.cycles-pp.perf_event_mmap
0.32 -0.1 0.25 perf-profile.self.cycles-pp.apparmor_mmap_file
0.28 ± 2% -0.1 0.21 perf-profile.self.cycles-pp.vma_merge
0.20 ± 3% -0.1 0.14 ± 11% perf-profile.self.cycles-pp.uprobe_mmap
0.54 -0.1 0.48 perf-profile.self.cycles-pp.common_file_perm
0.89 -0.1 0.82 perf-profile.self.cycles-pp.find_vma
2.04 -0.1 1.98 perf-profile.self.cycles-pp.rcu_all_qs
0.73 ± 2% -0.1 0.67 perf-profile.self.cycles-pp.kfree
0.53 -0.1 0.48 perf-profile.self.cycles-pp.vm_area_alloc
0.89 ± 3% -0.1 0.84 ± 2% perf-profile.self.cycles-pp.strlen
0.61 -0.0 0.56 perf-profile.self.cycles-pp.kmem_cache_alloc_trace
0.30 -0.0 0.26 perf-profile.self.cycles-pp.sync_mm_rss
0.89 -0.0 0.84 perf-profile.self.cycles-pp.__might_sleep
0.25 -0.0 0.20 ± 3% perf-profile.self.cycles-pp.__x64_sys_munmap
0.52 ± 3% -0.0 0.47 ± 3% perf-profile.self.cycles-pp.down_write
1.06 -0.0 1.01 perf-profile.self.cycles-pp.vm_unmapped_area
0.15 ± 2% -0.0 0.11 ± 4% perf-profile.self.cycles-pp.vma_set_page_prot
0.30 ± 2% -0.0 0.26 perf-profile.self.cycles-pp.cap_mmap_addr
0.29 ± 3% -0.0 0.25 ± 4% perf-profile.self.cycles-pp.percpu_counter_add_batch
0.17 ± 2% -0.0 0.14 ± 6% perf-profile.self.cycles-pp.vm_pgprot_modify
0.36 -0.0 0.33 perf-profile.self.cycles-pp.obj_cgroup_charge
0.72 -0.0 0.69 perf-profile.self.cycles-pp.up_write
0.30 -0.0 0.26 ± 3% perf-profile.self.cycles-pp.unmap_region
0.17 ± 5% -0.0 0.14 ± 3% perf-profile.self.cycles-pp.cap_capable
0.30 ± 2% -0.0 0.27 perf-profile.self.cycles-pp.lru_add_drain_cpu
0.35 -0.0 0.33 ± 2% perf-profile.self.cycles-pp.downgrade_write
0.44 -0.0 0.42 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
0.46 ± 2% -0.0 0.44 perf-profile.self.cycles-pp.down_write_killable
0.39 -0.0 0.36 ± 2% perf-profile.self.cycles-pp.shmem_mmap
0.31 ± 2% -0.0 0.29 ± 3% perf-profile.self.cycles-pp.tlb_gather_mmu
0.19 ± 2% -0.0 0.17 ± 4% perf-profile.self.cycles-pp.vma_interval_tree_remove
0.18 ± 2% -0.0 0.15 ± 5% perf-profile.self.cycles-pp.__x64_sys_mmap
0.14 -0.0 0.12 ± 13% perf-profile.self.cycles-pp.current_time
0.20 ± 2% -0.0 0.18 ± 4% perf-profile.self.cycles-pp.unlink_anon_vmas
0.13 ± 3% -0.0 0.11 ± 6% perf-profile.self.cycles-pp.timestamp_truncate
0.08 -0.0 0.07 ± 13% perf-profile.self.cycles-pp.ktime_get_coarse_real_ts64
0.13 ± 3% -0.0 0.12 ± 3% perf-profile.self.cycles-pp.unlink_file_vma
0.08 -0.0 0.07 ± 6% perf-profile.self.cycles-pp.lru_add_drain
0.09 -0.0 0.08 perf-profile.self.cycles-pp.__vma_link_file
0.33 ± 2% +0.0 0.35 ± 2% perf-profile.self.cycles-pp.vm_mmap_pgoff
0.12 ± 3% +0.0 0.14 perf-profile.self.cycles-pp.tlb_flush_mmu
0.39 ± 2% +0.0 0.41 ± 2% perf-profile.self.cycles-pp.syscall_enter_from_user_mode
0.09 ± 4% +0.0 0.11 ± 9% perf-profile.self.cycles-pp.aa_file_perm
0.24 ± 2% +0.0 0.27 perf-profile.self.cycles-pp.cap_mmap_file
0.21 ± 2% +0.0 0.26 perf-profile.self.cycles-pp.vma_link
0.06 +0.0 0.10 ± 4% perf-profile.self.cycles-pp.get_align_mask
0.96 +0.0 1.01 perf-profile.self.cycles-pp.do_mmap
0.17 ± 3% +0.0 0.22 perf-profile.self.cycles-pp.userfaultfd_unmap_complete
0.12 ± 3% +0.1 0.18 ± 2% perf-profile.self.cycles-pp.security_vm_enough_memory_mm
0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.__x86_indirect_thunk_r9
0.15 ± 3% +0.1 0.22 perf-profile.self.cycles-pp.blocking_notifier_call_chain
0.06 ± 6% +0.1 0.13 ± 5% perf-profile.self.cycles-pp.__x86_retpoline_rbp
0.36 ± 2% +0.1 0.45 perf-profile.self.cycles-pp.security_mmap_file
0.36 ± 8% +0.1 0.44 ± 5% perf-profile.self.cycles-pp.exit_to_user_mode_prepare
0.31 ± 3% +0.1 0.40 ± 2% perf-profile.self.cycles-pp.get_unmapped_area
0.40 +0.1 0.50 ± 2% perf-profile.self.cycles-pp.refill_obj_stock
0.30 ± 2% +0.1 0.43 perf-profile.self.cycles-pp.__x86_retpoline_r9
6.09 +0.1 6.22 perf-profile.self.cycles-pp.syscall_return_via_sysret
0.75 +0.1 0.89 perf-profile.self.cycles-pp.perf_event_mmap_output
1.40 +0.1 1.54 perf-profile.self.cycles-pp.mmap_region
1.01 ± 2% +0.3 1.28 perf-profile.self.cycles-pp.perf_iterate_sb
10.66 +0.8 11.42 perf-profile.self.cycles-pp.unmap_page_range
6.66 +1.2 7.88 perf-profile.self.cycles-pp.free_pgd_range
will-it-scale.per_process_ops
224000 +------------------------------------------------------------------+
| |
222000 |-+ .+..... |
| .... +......+...... |
220000 |.+ + |
| |
218000 |-+ |
| |
216000 |-+ |
| |
214000 |-+ |
| |
212000 |-+ O O O O O O |
| O O O |
210000 +------------------------------------------------------------------+
will-it-scale.workload
2.32e+07 +----------------------------------------------------------------+
| .+.... |
2.3e+07 |-.... . ...+..... |
|. +... + |
2.28e+07 |-+ |
| |
2.26e+07 |-+ |
| |
2.24e+07 |-+ |
| |
2.22e+07 |-+ |
| O O O |
2.2e+07 |-+ O O O O |
| O O |
2.18e+07 +----------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Rong Chen
View attachment "config-5.9.0-rc7-00001-ga4d63c3732f1a0" of type "text/plain" (170148 bytes)
View attachment "job-script" of type "text/plain" (7573 bytes)
View attachment "job.yaml" of type "text/plain" (5118 bytes)
View attachment "reproduce" of type "text/plain" (338 bytes)
Powered by blists - more mailing lists