[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20201004132237.GQ393@shao2-debian>
Date: Sun, 4 Oct 2020 21:22:37 +0800
From: kernel test robot <rong.a.chen@...el.com>
To: Peter Xu <peterx@...hat.com>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>,
LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
zhengjun.xing@...el.com
Subject: [mm] 70e806e4e6: will-it-scale.per_process_ops 2.7% improvement
Greeting,
FYI, we noticed a 2.7% improvement of will-it-scale.per_process_ops due to commit:
commit: 70e806e4e645019102d0e09d4933654fb5fb58ce ("mm: Do early cow for pinned pages during fork() for ptes")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: will-it-scale
on test machine: 104 threads Skylake with 192G memory
with following parameters:
nr_task: 100%
mode: process
test: mmap2
cpufreq_governor: performance
ucode: 0x2006906
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale
In addition to that, the commit also has significant impact on the following tests:
+------------------+---------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops 2.0% improvement |
| test machine | 104 threads Skylake with 192G memory |
| test parameters | cpufreq_governor=performance |
| | mode=process |
| | nr_task=100% |
| | test=mmap1 |
| | ucode=0x2006906 |
+------------------+---------------------------------------------------------------+
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/100%/debian-10.4-x86_64-20200603.cgz/lkp-skl-fpga01/mmap2/will-it-scale/0x2006906
commit:
7a4830c380 ("mm/fork: Pass new vma pointer into copy_page_range()")
70e806e4e6 ("mm: Do early cow for pinned pages during fork() for ptes")
7a4830c380f3a8b3 70e806e4e645019102d0e09d493
---------------- ---------------------------
%stddev %change %stddev
\ | \
215537 +2.7% 221349 will-it-scale.per_process_ops
22415915 +2.7% 23020376 will-it-scale.workload
839733 ± 3% -35.4% 542743 ± 17% cpuidle.C1.time
282931 ± 8% +13.9% 322181 ± 9% numa-numastat.node0.local_node
306399 ± 5% +8.4% 332275 ± 5% numa-numastat.node0.numa_hit
741.62 ± 7% +9.1% 809.08 ± 6% sched_debug.cfs_rq:/.util_avg.min
0.14 ± 7% +12.2% 0.16 ± 5% sched_debug.cpu.nr_running.stddev
795.75 ± 7% +35.8% 1080 ± 7% numa-vmstat.node0.nr_page_table_pages
11758 ± 8% +21.3% 14260 ± 4% numa-vmstat.node0.nr_slab_reclaimable
1282 ± 4% -21.9% 1002 ± 7% numa-vmstat.node1.nr_page_table_pages
12482 ± 7% -20.9% 9868 ± 8% numa-vmstat.node1.nr_slab_reclaimable
789100 ± 11% -19.3% 636572 ± 14% numa-vmstat.node1.numa_local
47033 ± 8% +21.3% 57044 ± 4% numa-meminfo.node0.KReclaimable
3183 ± 7% +35.8% 4323 ± 7% numa-meminfo.node0.PageTables
47033 ± 8% +21.3% 57044 ± 4% numa-meminfo.node0.SReclaimable
144339 ± 8% +12.7% 162692 ± 6% numa-meminfo.node0.Slab
49936 ± 7% -20.9% 39478 ± 8% numa-meminfo.node1.KReclaimable
5130 ± 4% -21.8% 4013 ± 7% numa-meminfo.node1.PageTables
49936 ± 7% -20.9% 39478 ± 8% numa-meminfo.node1.SReclaimable
567.00 ± 9% +181.8% 1597 ± 33% interrupts.CPU3.CAL:Function_call_interrupts
483.50 ± 8% +20.2% 581.25 ± 12% interrupts.CPU59.CAL:Function_call_interrupts
351.00 ± 12% +43.7% 504.25 ± 20% interrupts.CPU59.RES:Rescheduling_interrupts
322.00 ± 2% +30.7% 421.00 ± 16% interrupts.CPU60.RES:Rescheduling_interrupts
455.75 +17.8% 536.75 ± 17% interrupts.CPU61.CAL:Function_call_interrupts
316.50 +25.0% 395.75 ± 16% interrupts.CPU62.RES:Rescheduling_interrupts
462.75 ± 4% +4.9% 485.25 ± 5% interrupts.CPU72.CAL:Function_call_interrupts
994.00 ± 95% -67.9% 318.75 ± 3% interrupts.CPU98.RES:Rescheduling_interrupts
5.651e+10 +2.8% 5.809e+10 perf-stat.i.branch-instructions
0.48 +0.0 0.48 perf-stat.i.branch-miss-rate%
2.596e+08 +4.0% 2.699e+08 perf-stat.i.branch-misses
10.93 ± 6% -2.2 8.74 ± 4% perf-stat.i.cache-miss-rate%
1.18 -2.7% 1.15 perf-stat.i.cpi
44666128 +3.6% 46270200 perf-stat.i.dTLB-load-misses
6.013e+10 +2.8% 6.179e+10 perf-stat.i.dTLB-loads
42553 +14.9% 48901 ± 15% perf-stat.i.dTLB-store-misses
2.718e+10 +2.8% 2.793e+10 perf-stat.i.dTLB-stores
44533758 -36.9% 28097154 perf-stat.i.iTLB-load-misses
2.356e+11 +2.8% 2.422e+11 perf-stat.i.instructions
5549 +60.7% 8920 perf-stat.i.instructions-per-iTLB-miss
0.85 +2.8% 0.87 perf-stat.i.ipc
1382 +2.8% 1421 perf-stat.i.metric.M/sec
0.46 +0.0 0.46 perf-stat.overall.branch-miss-rate%
11.14 ± 10% -2.4 8.75 ± 4% perf-stat.overall.cache-miss-rate%
1.18 -2.7% 1.15 perf-stat.overall.cpi
0.00 +0.0 0.00 ± 15% perf-stat.overall.dTLB-store-miss-rate%
5292 +63.0% 8627 perf-stat.overall.instructions-per-iTLB-miss
0.85 +2.8% 0.87 perf-stat.overall.ipc
5.632e+10 +2.8% 5.789e+10 perf-stat.ps.branch-instructions
2.588e+08 +3.9% 2.69e+08 perf-stat.ps.branch-misses
44511150 +3.6% 46103111 perf-stat.ps.dTLB-load-misses
5.993e+10 +2.8% 6.158e+10 perf-stat.ps.dTLB-loads
42650 +14.6% 48890 ± 15% perf-stat.ps.dTLB-store-misses
2.709e+10 +2.8% 2.783e+10 perf-stat.ps.dTLB-stores
44371490 -36.9% 27977958 perf-stat.ps.iTLB-load-misses
2.348e+11 +2.8% 2.414e+11 perf-stat.ps.instructions
7.106e+13 +2.6% 7.29e+13 perf-stat.total.instructions
48.23 -0.9 47.34 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
39.18 -0.9 38.30 perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
47.45 -0.9 46.59 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
47.90 -0.9 47.05 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
46.34 -0.8 45.51 perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
52.66 -0.8 51.86 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
15.57 -0.7 14.88 perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
29.18 -0.7 28.50 perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
58.71 -0.6 58.12 perf-profile.calltrace.cycles-pp.__munmap
26.14 -0.6 25.57 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
2.89 -0.3 2.54 perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
7.01 -0.2 6.77 perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
0.76 -0.1 0.65 perf-profile.calltrace.cycles-pp.__vma_rb_erase.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
0.66 -0.1 0.56 perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.__do_munmap
3.21 -0.1 3.10 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mmap
1.01 -0.1 0.94 perf-profile.calltrace.cycles-pp.memcpy_erms.prepend.d_path.perf_event_mmap.mmap_region
1.63 -0.1 1.58 perf-profile.calltrace.cycles-pp.prepend_path.d_path.perf_event_mmap.mmap_region.do_mmap
1.04 -0.0 0.99 perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.__do_munmap.__vm_munmap
0.89 -0.0 0.85 perf-profile.calltrace.cycles-pp.prepend_name.prepend_path.d_path.perf_event_mmap.mmap_region
0.56 -0.0 0.53 ± 2% perf-profile.calltrace.cycles-pp.down_write.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
1.21 -0.0 1.18 perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
1.25 +0.0 1.28 perf-profile.calltrace.cycles-pp.prepend.d_path.perf_event_mmap.mmap_region.do_mmap
0.93 +0.0 0.98 perf-profile.calltrace.cycles-pp.strlen.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.32 +0.1 1.38 perf-profile.calltrace.cycles-pp.find_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
0.57 ± 2% +0.1 0.65 perf-profile.calltrace.cycles-pp.common_file_perm.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
0.67 +0.1 0.76 perf-profile.calltrace.cycles-pp.kfree.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
2.24 +0.1 2.35 perf-profile.calltrace.cycles-pp.rcu_all_qs._cond_resched.unmap_page_range.unmap_vmas.unmap_region
1.98 +0.1 2.10 perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff
25.35 +0.1 25.48 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.39 ± 57% +0.1 0.53 perf-profile.calltrace.cycles-pp.cap_vm_enough_memory.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff
9.39 +0.2 9.54 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.72 +0.2 1.88 perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.38 ± 57% +0.2 0.53 ± 2% perf-profile.calltrace.cycles-pp.down_write_killable.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.38 ± 6% +0.2 2.56 ± 2% perf-profile.calltrace.cycles-pp.kmem_cache_free.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap
2.67 +0.2 2.86 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__mmap
2.58 +0.2 2.79 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__munmap
2.88 ± 4% +0.2 3.09 perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff
0.82 +0.2 1.04 perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap
4.18 ± 5% +0.2 4.40 perf-profile.calltrace.cycles-pp._cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
3.49 ± 3% +0.2 3.73 perf-profile.calltrace.cycles-pp.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
20.14 +0.3 20.46 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
28.30 +0.4 28.71 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
29.38 +0.4 29.82 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
29.88 +0.5 30.34 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
2.16 +0.5 2.64 perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
34.46 +0.5 34.99 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
41.05 +0.6 41.63 perf-profile.calltrace.cycles-pp.__mmap
39.24 -0.9 38.36 perf-profile.children.cycles-pp.unmap_region
47.93 -0.9 47.07 perf-profile.children.cycles-pp.__x64_sys_munmap
47.48 -0.9 46.62 perf-profile.children.cycles-pp.__vm_munmap
46.43 -0.8 45.59 perf-profile.children.cycles-pp.__do_munmap
28.42 -0.7 27.68 perf-profile.children.cycles-pp.unmap_page_range
29.20 -0.7 28.52 perf-profile.children.cycles-pp.unmap_vmas
59.12 -0.6 58.54 perf-profile.children.cycles-pp.__munmap
14.57 -0.5 14.04 perf-profile.children.cycles-pp.___might_sleep
78.17 -0.4 77.73 perf-profile.children.cycles-pp.do_syscall_64
2.94 -0.3 2.59 perf-profile.children.cycles-pp.vma_link
87.19 -0.3 86.92 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
7.03 -0.2 6.79 perf-profile.children.cycles-pp.free_pgd_range
0.32 ± 2% -0.2 0.15 ± 2% perf-profile.children.cycles-pp.__rb_insert_augmented
1.26 -0.1 1.13 perf-profile.children.cycles-pp.down_write
0.77 -0.1 0.65 perf-profile.children.cycles-pp.__vma_rb_erase
0.08 ± 5% -0.1 0.03 ±100% perf-profile.children.cycles-pp.memcpy
1.66 -0.1 1.60 perf-profile.children.cycles-pp.prepend_path
0.17 ± 5% -0.0 0.12 ± 3% perf-profile.children.cycles-pp.__vma_link_file
1.05 -0.0 1.00 perf-profile.children.cycles-pp.unlink_file_vma
0.92 -0.0 0.87 perf-profile.children.cycles-pp.prepend_name
0.36 -0.0 0.32 ± 2% perf-profile.children.cycles-pp.__x86_retpoline_r9
0.12 -0.0 0.08 ± 5% perf-profile.children.cycles-pp.get_align_mask
1.02 -0.0 0.98 perf-profile.children.cycles-pp.__might_sleep
0.17 ± 3% -0.0 0.13 ± 3% perf-profile.children.cycles-pp.timestamp_truncate
0.38 ± 2% -0.0 0.35 ± 3% perf-profile.children.cycles-pp.current_time
0.46 -0.0 0.43 ± 2% perf-profile.children.cycles-pp.syscall_enter_from_user_mode
1.21 -0.0 1.19 perf-profile.children.cycles-pp.__vma_link_rb
0.10 ± 4% -0.0 0.08 perf-profile.children.cycles-pp.get_mmap_base
0.18 ± 2% -0.0 0.17 perf-profile.children.cycles-pp.tlb_flush_mmu
0.07 -0.0 0.06 perf-profile.children.cycles-pp.munmap@plt
0.18 +0.0 0.20 ± 2% perf-profile.children.cycles-pp.testcase
0.35 +0.0 0.36 ± 2% perf-profile.children.cycles-pp.obj_cgroup_charge
0.29 ± 2% +0.0 0.31 ± 2% perf-profile.children.cycles-pp.lru_add_drain_cpu
0.09 +0.0 0.11 ± 3% perf-profile.children.cycles-pp.aa_file_perm
0.44 ± 2% +0.0 0.46 perf-profile.children.cycles-pp.refill_obj_stock
0.27 ± 3% +0.0 0.30 ± 2% perf-profile.children.cycles-pp.vma_merge
1.28 +0.0 1.31 perf-profile.children.cycles-pp.prepend
0.76 +0.0 0.79 perf-profile.children.cycles-pp.up_write
0.32 ± 2% +0.0 0.35 perf-profile.children.cycles-pp.downgrade_write
0.27 ± 3% +0.0 0.30 ± 2% perf-profile.children.cycles-pp.cap_mmap_file
0.07 ± 6% +0.0 0.11 ± 4% perf-profile.children.cycles-pp.ima_file_mmap
0.28 ± 2% +0.0 0.32 ± 2% perf-profile.children.cycles-pp.__vm_enough_memory
0.03 ±100% +0.0 0.07 perf-profile.children.cycles-pp.fput
0.93 +0.0 0.98 perf-profile.children.cycles-pp.strlen
0.34 ± 2% +0.0 0.39 perf-profile.children.cycles-pp.apparmor_mmap_file
0.26 +0.0 0.31 ± 2% perf-profile.children.cycles-pp.sync_mm_rss
0.22 ± 3% +0.1 0.27 ± 2% perf-profile.children.cycles-pp.vma_interval_tree_remove
0.25 +0.1 0.30 ± 6% perf-profile.children.cycles-pp.percpu_counter_add_batch
1.40 +0.1 1.45 perf-profile.children.cycles-pp.find_vma
0.14 ± 7% +0.1 0.20 ± 9% perf-profile.children.cycles-pp.vm_pgprot_modify
0.16 ± 5% +0.1 0.21 ± 2% perf-profile.children.cycles-pp.uprobe_mmap
1.13 +0.1 1.19 perf-profile.children.cycles-pp.memcpy_erms
0.59 ± 2% +0.1 0.66 perf-profile.children.cycles-pp.common_file_perm
4.85 +0.1 4.93 perf-profile.children.cycles-pp._cond_resched
0.67 ± 2% +0.1 0.77 perf-profile.children.cycles-pp.kfree
0.33 ± 3% +0.1 0.43 perf-profile.children.cycles-pp.vma_set_page_prot
2.01 +0.1 2.13 perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
25.40 +0.1 25.53 perf-profile.children.cycles-pp.do_mmap
9.46 +0.2 9.61 perf-profile.children.cycles-pp.perf_event_mmap
8.57 +0.2 8.73 perf-profile.children.cycles-pp.syscall_exit_to_user_mode
1.74 +0.2 1.90 perf-profile.children.cycles-pp.security_mmap_file
2.39 ± 5% +0.2 2.58 ± 2% perf-profile.children.cycles-pp.kmem_cache_free
0.83 +0.2 1.04 perf-profile.children.cycles-pp.vm_unmapped_area
2.96 ± 4% +0.2 3.18 perf-profile.children.cycles-pp.kmem_cache_alloc
3.49 ± 3% +0.2 3.73 perf-profile.children.cycles-pp.vm_area_alloc
20.27 +0.3 20.59 perf-profile.children.cycles-pp.mmap_region
5.89 +0.4 6.30 perf-profile.children.cycles-pp.syscall_return_via_sysret
28.34 +0.4 28.76 perf-profile.children.cycles-pp.vm_mmap_pgoff
29.42 +0.4 29.86 perf-profile.children.cycles-pp.ksys_mmap_pgoff
2.20 +0.5 2.66 perf-profile.children.cycles-pp.zap_pte_range
41.40 +0.6 41.99 perf-profile.children.cycles-pp.__mmap
11.45 -1.0 10.48 perf-profile.self.cycles-pp.unmap_page_range
12.18 -0.4 11.82 perf-profile.self.cycles-pp.___might_sleep
6.98 -0.2 6.74 perf-profile.self.cycles-pp.free_pgd_range
0.31 ± 2% -0.2 0.14 perf-profile.self.cycles-pp.__rb_insert_augmented
5.52 -0.1 5.40 perf-profile.self.cycles-pp.entry_SYSCALL_64
1.04 -0.1 0.93 ± 2% perf-profile.self.cycles-pp.do_mmap
0.75 -0.1 0.64 perf-profile.self.cycles-pp.__vma_rb_erase
0.31 ± 4% -0.1 0.24 ± 6% perf-profile.self.cycles-pp.get_unmapped_area
0.15 ± 2% -0.1 0.10 ± 4% perf-profile.self.cycles-pp.__vma_link_file
0.90 -0.0 0.85 perf-profile.self.cycles-pp.prepend_name
0.93 ± 2% -0.0 0.88 ± 3% perf-profile.self.cycles-pp.shmem_get_unmapped_area
0.54 ± 2% -0.0 0.50 perf-profile.self.cycles-pp.down_write
0.66 -0.0 0.61 ± 2% perf-profile.self.cycles-pp.__mmap
0.94 -0.0 0.90 perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
0.10 ± 4% -0.0 0.06 perf-profile.self.cycles-pp.get_align_mask
0.33 -0.0 0.29 perf-profile.self.cycles-pp.__x86_retpoline_r9
0.16 ± 2% -0.0 0.14 ± 3% perf-profile.self.cycles-pp.current_time
0.92 -0.0 0.89 perf-profile.self.cycles-pp.__might_sleep
0.15 ± 2% -0.0 0.12 ± 3% perf-profile.self.cycles-pp.timestamp_truncate
0.42 -0.0 0.39 ± 2% perf-profile.self.cycles-pp.syscall_enter_from_user_mode
0.14 ± 3% -0.0 0.12 perf-profile.self.cycles-pp.prepend
0.10 ± 4% -0.0 0.08 ± 6% perf-profile.self.cycles-pp.get_mmap_base
0.35 ± 2% -0.0 0.33 perf-profile.self.cycles-pp.security_mmap_file
0.15 ± 3% -0.0 0.14 ± 3% perf-profile.self.cycles-pp.vma_interval_tree_insert
0.13 ± 3% -0.0 0.12 perf-profile.self.cycles-pp.tlb_flush_mmu
0.06 ± 6% -0.0 0.05 perf-profile.self.cycles-pp.munmap@plt
0.05 +0.0 0.06 ± 6% perf-profile.self.cycles-pp.should_failslab
0.11 ± 6% +0.0 0.12 ± 4% perf-profile.self.cycles-pp.security_vm_enough_memory_mm
0.33 +0.0 0.35 ± 2% perf-profile.self.cycles-pp.obj_cgroup_charge
0.50 +0.0 0.52 perf-profile.self.cycles-pp.vm_area_alloc
0.30 +0.0 0.32 ± 3% perf-profile.self.cycles-pp.unmap_vmas
0.24 +0.0 0.26 perf-profile.self.cycles-pp.unmap_region
0.08 ± 5% +0.0 0.10 ± 4% perf-profile.self.cycles-pp.aa_file_perm
0.38 +0.0 0.40 perf-profile.self.cycles-pp.vmacache_find
0.42 ± 2% +0.0 0.45 perf-profile.self.cycles-pp.refill_obj_stock
0.25 +0.0 0.27 ± 2% perf-profile.self.cycles-pp.atime_needs_update
0.28 +0.0 0.30 ± 2% perf-profile.self.cycles-pp.vm_mmap_pgoff
0.26 +0.0 0.28 ± 2% perf-profile.self.cycles-pp.vma_merge
0.11 ± 7% +0.0 0.15 ± 7% perf-profile.self.cycles-pp.vma_set_page_prot
0.31 ± 2% +0.0 0.34 perf-profile.self.cycles-pp.downgrade_write
0.74 +0.0 0.77 perf-profile.self.cycles-pp.up_write
0.83 +0.0 0.87 perf-profile.self.cycles-pp.find_vma
0.07 ± 7% +0.0 0.10 ± 4% perf-profile.self.cycles-pp.ima_file_mmap
0.22 ± 3% +0.0 0.26 perf-profile.self.cycles-pp.cap_mmap_file
0.30 ± 2% +0.0 0.34 perf-profile.self.cycles-pp.apparmor_mmap_file
0.21 ± 2% +0.0 0.26 perf-profile.self.cycles-pp.__x64_sys_munmap
0.93 +0.0 0.97 perf-profile.self.cycles-pp.strlen
0.12 ± 4% +0.0 0.17 perf-profile.self.cycles-pp.free_pgtables
0.24 +0.0 0.29 ± 5% perf-profile.self.cycles-pp.percpu_counter_add_batch
0.26 +0.0 0.31 ± 2% perf-profile.self.cycles-pp.sync_mm_rss
0.17 ± 4% +0.1 0.22 ± 4% perf-profile.self.cycles-pp.vma_interval_tree_remove
0.13 ± 8% +0.1 0.18 ± 11% perf-profile.self.cycles-pp.vm_pgprot_modify
0.50 ± 2% +0.1 0.55 perf-profile.self.cycles-pp.common_file_perm
1.31 +0.1 1.36 perf-profile.self.cycles-pp.mmap_region
0.15 ± 3% +0.1 0.20 ± 2% perf-profile.self.cycles-pp.uprobe_mmap
2.22 +0.1 2.30 perf-profile.self.cycles-pp._cond_resched
1.06 +0.1 1.15 perf-profile.self.cycles-pp.memcpy_erms
0.66 ± 2% +0.1 0.76 perf-profile.self.cycles-pp.kfree
8.18 +0.2 8.36 perf-profile.self.cycles-pp.syscall_exit_to_user_mode
0.82 +0.2 1.04 perf-profile.self.cycles-pp.vm_unmapped_area
1.59 +0.4 2.00 perf-profile.self.cycles-pp.zap_pte_range
5.87 +0.4 6.29 perf-profile.self.cycles-pp.syscall_return_via_sysret
will-it-scale.per_process_ops
223000 +------------------------------------------------------------------+
| O |
222000 |-+ O O |
221000 |-+ O O |
| O O O |
220000 |-+ O |
219000 |-+ O |
| |
218000 |-+ +.. |
217000 |-+ .. .. |
|..... . .+.. |
216000 |-+ +.... .. +..... ...+.... .. .. |
215000 |-+ + +.. +..... ..+.....+. .|
| +.. |
214000 +------------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
***************************************************************************************************
lkp-skl-fpga01: 104 threads Skylake with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/100%/debian-10.4-x86_64-20200603.cgz/lkp-skl-fpga01/mmap1/will-it-scale/0x2006906
commit:
7a4830c380 ("mm/fork: Pass new vma pointer into copy_page_range()")
70e806e4e6 ("mm: Do early cow for pinned pages during fork() for ptes")
7a4830c380f3a8b3 70e806e4e645019102d0e09d493
---------------- ---------------------------
%stddev %change %stddev
\ | \
238782 +2.0% 243561 will-it-scale.per_process_ops
9565 -0.9% 9477 will-it-scale.time.maximum_resident_set_size
24833379 +2.0% 25330430 will-it-scale.workload
18752 ± 9% -14.9% 15950 ± 12% numa-meminfo.node0.Mapped
4793 ± 9% -15.7% 4041 ± 13% numa-vmstat.node0.nr_mapped
12863 ± 12% +18.5% 15248 ± 11% sched_debug.cpu.sched_count.max
1740 ± 6% -12.1% 1530 slabinfo.kmalloc-rcl-512.active_objs
1740 ± 6% -12.1% 1530 slabinfo.kmalloc-rcl-512.num_objs
18707 +9.1% 20407 ± 3% softirqs.CPU16.RCU
17047 ± 2% +8.1% 18429 ± 4% softirqs.CPU40.RCU
82.25 -1.5% 81.00 vmstat.cpu.sy
16.00 +6.2% 17.00 vmstat.cpu.us
8495 ± 38% -96.9% 260.75 ± 82% proc-vmstat.numa_hint_faults
9348 ± 56% -93.2% 632.75 ±118% proc-vmstat.numa_pages_migrated
41007 ± 55% -86.3% 5613 ± 69% proc-vmstat.numa_pte_updates
9348 ± 56% -93.2% 632.75 ±118% proc-vmstat.pgmigrate_success
7387 -37.7% 4602 ± 34% interrupts.CPU10.NMI:Non-maskable_interrupts
7387 -37.7% 4602 ± 34% interrupts.CPU10.PMI:Performance_monitoring_interrupts
313.00 ± 2% +23.9% 387.75 ± 20% interrupts.CPU24.RES:Rescheduling_interrupts
353.75 ± 8% -9.7% 319.50 ± 5% interrupts.CPU37.RES:Rescheduling_interrupts
5248 ± 22% -37.7% 3269 ± 27% interrupts.CPU53.CAL:Function_call_interrupts
6467 ± 24% -28.8% 4602 ± 34% interrupts.CPU6.NMI:Non-maskable_interrupts
6467 ± 24% -28.8% 4602 ± 34% interrupts.CPU6.PMI:Performance_monitoring_interrupts
610.00 ± 23% -24.1% 463.00 interrupts.CPU62.CAL:Function_call_interrupts
6466 ± 24% -28.8% 4603 ± 34% interrupts.CPU7.NMI:Non-maskable_interrupts
6466 ± 24% -28.8% 4603 ± 34% interrupts.CPU7.PMI:Performance_monitoring_interrupts
453.50 +80.8% 819.75 ± 55% interrupts.CPU8.CAL:Function_call_interrupts
6467 ± 24% -28.8% 4602 ± 34% interrupts.CPU8.NMI:Non-maskable_interrupts
6467 ± 24% -28.8% 4602 ± 34% interrupts.CPU8.PMI:Performance_monitoring_interrupts
7388 -37.7% 4603 ± 34% interrupts.CPU9.NMI:Non-maskable_interrupts
7388 -37.7% 4603 ± 34% interrupts.CPU9.PMI:Performance_monitoring_interrupts
5.672e+10 +2.0% 5.784e+10 perf-stat.i.branch-instructions
2.382e+08 +2.7% 2.446e+08 perf-stat.i.branch-misses
1.18 -2.0% 1.16 perf-stat.i.cpi
49520439 +1.9% 50482111 perf-stat.i.dTLB-load-misses
5.872e+10 +2.0% 5.987e+10 perf-stat.i.dTLB-loads
44598 +4.4% 46565 perf-stat.i.dTLB-store-misses
2.605e+10 +2.0% 2.656e+10 perf-stat.i.dTLB-stores
96.00 -7.9 88.06 ± 6% perf-stat.i.iTLB-load-miss-rate%
49281952 +2.2% 50384621 perf-stat.i.iTLB-load-misses
1928684 ± 46% +259.5% 6934327 ± 58% perf-stat.i.iTLB-loads
2.354e+11 +2.0% 2.4e+11 perf-stat.i.instructions
0.85 +2.0% 0.86 perf-stat.i.ipc
1360 +2.0% 1387 perf-stat.i.metric.M/sec
2838 -0.9% 2811 perf-stat.i.minor-faults
2838 -0.9% 2812 perf-stat.i.page-faults
1.18 -2.0% 1.16 perf-stat.overall.cpi
0.00 +0.0 0.00 perf-stat.overall.dTLB-store-miss-rate%
96.28 -8.0 88.30 ± 6% perf-stat.overall.iTLB-load-miss-rate%
0.85 +2.0% 0.86 perf-stat.overall.ipc
5.653e+10 +2.0% 5.765e+10 perf-stat.ps.branch-instructions
2.375e+08 +2.7% 2.438e+08 perf-stat.ps.branch-misses
49350510 +1.9% 50309290 perf-stat.ps.dTLB-load-misses
5.852e+10 +2.0% 5.967e+10 perf-stat.ps.dTLB-loads
44555 +4.3% 46476 perf-stat.ps.dTLB-store-misses
2.596e+10 +2.0% 2.647e+10 perf-stat.ps.dTLB-stores
49109500 +2.2% 50209811 perf-stat.ps.iTLB-load-misses
1915468 ± 45% +260.7% 6909154 ± 58% perf-stat.ps.iTLB-loads
2.346e+11 +2.0% 2.392e+11 perf-stat.ps.instructions
2834 -1.0% 2805 perf-stat.ps.minor-faults
2834 -1.0% 2806 perf-stat.ps.page-faults
7.095e+13 +1.9% 7.23e+13 perf-stat.total.instructions
34.83 -1.1 33.77 perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
44.65 -1.0 43.62 perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
18.45 -0.9 17.52 perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
52.42 -0.7 51.69 perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
53.69 -0.7 52.96 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
31.25 -0.7 30.52 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
54.16 -0.7 53.47 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
54.49 -0.7 53.84 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
2.31 -0.5 1.77 perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
59.40 -0.4 58.98 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
1.62 -0.2 1.38 perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
18.16 -0.2 17.93 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
22.17 -0.2 21.94 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
21.60 -0.2 21.38 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
21.31 -0.2 21.11 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
1.68 ± 2% -0.1 1.54 perf-profile.calltrace.cycles-pp.find_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
0.62 ± 2% -0.0 0.59 perf-profile.calltrace.cycles-pp.security_mmap_addr.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.56 +0.0 0.58 perf-profile.calltrace.cycles-pp.tlb_finish_mmu.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
0.62 +0.0 0.65 perf-profile.calltrace.cycles-pp.cap_vm_enough_memory.security_vm_enough_memory_mm.mmap_region.do_mmap.vm_mmap_pgoff
2.55 +0.1 2.65 perf-profile.calltrace.cycles-pp.rcu_all_qs._cond_resched.unmap_page_range.unmap_vmas.unmap_region
3.15 +0.1 3.26 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__munmap
0.89 +0.1 1.01 ± 2% perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap
3.87 +0.2 4.03 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
4.65 +0.2 4.87 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe.__munmap
2.72 ± 2% +0.2 2.95 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__mmap
1.94 +0.2 2.19 perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
2.68 ± 2% +0.3 2.93 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__munmap
1.00 +0.3 1.26 perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.get_unmapped_area.do_mmap.vm_mmap_pgoff
2.05 +0.3 2.32 perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
5.05 +0.3 5.32 perf-profile.calltrace.cycles-pp._cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
2.80 ± 2% +0.4 3.15 perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
2.93 ± 7% +0.4 3.33 ± 3% perf-profile.calltrace.cycles-pp.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
2.27 ± 9% +0.5 2.73 ± 3% perf-profile.calltrace.cycles-pp.kmem_cache_free.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap
0.00 +0.6 0.58 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
34.85 -1.1 33.80 perf-profile.children.cycles-pp.unmap_vmas
44.73 -1.0 43.70 perf-profile.children.cycles-pp.unmap_region
33.89 -1.0 32.89 perf-profile.children.cycles-pp.unmap_page_range
76.71 -0.9 75.83 perf-profile.children.cycles-pp.do_syscall_64
52.54 -0.7 51.79 perf-profile.children.cycles-pp.__do_munmap
53.73 -0.7 53.00 perf-profile.children.cycles-pp.__vm_munmap
16.84 -0.7 16.13 perf-profile.children.cycles-pp.___might_sleep
54.19 -0.7 53.49 perf-profile.children.cycles-pp.__x64_sys_munmap
86.59 -0.6 86.02 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
2.36 -0.5 1.82 perf-profile.children.cycles-pp.vma_link
1.64 -0.3 1.38 perf-profile.children.cycles-pp.__vma_link_rb
18.21 -0.2 17.98 perf-profile.children.cycles-pp.do_mmap
21.63 -0.2 21.41 perf-profile.children.cycles-pp.ksys_mmap_pgoff
21.36 -0.2 21.16 perf-profile.children.cycles-pp.vm_mmap_pgoff
0.30 ± 2% -0.2 0.11 ± 4% perf-profile.children.cycles-pp.__rb_insert_augmented
1.78 ± 2% -0.2 1.63 perf-profile.children.cycles-pp.find_vma
0.15 ± 2% -0.1 0.07 ± 6% perf-profile.children.cycles-pp.__vma_link_file
0.41 ± 3% -0.1 0.35 perf-profile.children.cycles-pp.cap_mmap_addr
0.76 -0.1 0.71 perf-profile.children.cycles-pp.__might_sleep
0.47 -0.0 0.42 perf-profile.children.cycles-pp.obj_cgroup_charge
0.43 ± 3% -0.0 0.39 ± 3% perf-profile.children.cycles-pp.tlb_gather_mmu
0.64 ± 2% -0.0 0.60 perf-profile.children.cycles-pp.security_mmap_addr
0.23 ± 3% -0.0 0.20 ± 2% perf-profile.children.cycles-pp.vmacache_update
0.25 -0.0 0.22 perf-profile.children.cycles-pp.strlen
0.45 -0.0 0.42 ± 2% perf-profile.children.cycles-pp.apparmor_mmap_file
0.22 -0.0 0.20 ± 2% perf-profile.children.cycles-pp.cap_capable
0.23 -0.0 0.22 perf-profile.children.cycles-pp.__x64_sys_mmap
0.10 +0.0 0.11 perf-profile.children.cycles-pp.vm_area_free
0.10 ± 4% +0.0 0.12 ± 3% perf-profile.children.cycles-pp.__x86_retpoline_rbp
0.10 ± 5% +0.0 0.11 perf-profile.children.cycles-pp.get_mmap_base
0.18 ± 2% +0.0 0.20 perf-profile.children.cycles-pp.testcase
0.08 ± 5% +0.0 0.10 perf-profile.children.cycles-pp.unlink_file_vma
0.58 +0.0 0.60 perf-profile.children.cycles-pp.tlb_finish_mmu
0.47 ± 2% +0.0 0.49 perf-profile.children.cycles-pp.__x86_retpoline_rax
0.29 ± 2% +0.0 0.32 perf-profile.children.cycles-pp.__x86_retpoline_r9
0.47 +0.0 0.50 ± 2% perf-profile.children.cycles-pp.syscall_enter_from_user_mode
0.38 +0.0 0.41 perf-profile.children.cycles-pp.downgrade_write
0.48 ± 3% +0.0 0.52 perf-profile.children.cycles-pp.vma_merge
0.22 ± 3% +0.0 0.26 perf-profile.children.cycles-pp.unlink_anon_vmas
0.16 ± 5% +0.0 0.20 ± 2% perf-profile.children.cycles-pp.blocking_notifier_call_chain
0.35 ± 2% +0.0 0.39 ± 2% perf-profile.children.cycles-pp.percpu_counter_add_batch
0.40 ± 2% +0.1 0.46 perf-profile.children.cycles-pp.refill_obj_stock
0.33 ± 2% +0.1 0.39 perf-profile.children.cycles-pp.__vm_enough_memory
2.90 +0.1 2.98 perf-profile.children.cycles-pp.rcu_all_qs
0.51 +0.1 0.61 perf-profile.children.cycles-pp.free_pgtables
0.33 +0.1 0.44 perf-profile.children.cycles-pp.cap_mmap_file
0.92 +0.1 1.03 ± 2% perf-profile.children.cycles-pp.perf_event_mmap_output
3.94 +0.2 4.09 perf-profile.children.cycles-pp.perf_event_mmap
5.32 +0.2 5.57 perf-profile.children.cycles-pp._cond_resched
1.97 +0.3 2.22 perf-profile.children.cycles-pp.perf_iterate_sb
1.01 +0.3 1.27 perf-profile.children.cycles-pp.vm_unmapped_area
2.09 +0.3 2.36 perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
9.39 +0.3 9.68 perf-profile.children.cycles-pp.syscall_exit_to_user_mode
2.83 ± 2% +0.4 3.19 perf-profile.children.cycles-pp.zap_pte_range
2.97 ± 6% +0.4 3.37 ± 3% perf-profile.children.cycles-pp.remove_vma
2.29 ± 9% +0.5 2.75 ± 3% perf-profile.children.cycles-pp.kmem_cache_free
6.16 +0.5 6.65 perf-profile.children.cycles-pp.syscall_return_via_sysret
13.46 -1.2 12.24 perf-profile.self.cycles-pp.unmap_page_range
14.10 -0.4 13.67 perf-profile.self.cycles-pp.___might_sleep
1.62 -0.2 1.37 perf-profile.self.cycles-pp.__vma_link_rb
0.59 ± 4% -0.2 0.40 perf-profile.self.cycles-pp.get_unmapped_area
0.86 ± 2% -0.2 0.67 ± 2% perf-profile.self.cycles-pp.do_mmap
0.29 ± 2% -0.2 0.11 ± 3% perf-profile.self.cycles-pp.__rb_insert_augmented
1.04 ± 2% -0.1 0.94 perf-profile.self.cycles-pp.find_vma
0.13 -0.1 0.04 ± 57% perf-profile.self.cycles-pp.__vma_link_file
0.63 -0.1 0.54 ± 3% perf-profile.self.cycles-pp.security_mmap_file
1.63 -0.1 1.56 perf-profile.self.cycles-pp.perf_event_mmap
0.36 ± 3% -0.1 0.29 ± 2% perf-profile.self.cycles-pp.cap_mmap_addr
0.70 -0.1 0.65 perf-profile.self.cycles-pp.__might_sleep
0.45 -0.0 0.41 perf-profile.self.cycles-pp.obj_cgroup_charge
0.42 ± 3% -0.0 0.38 ± 2% perf-profile.self.cycles-pp.tlb_gather_mmu
0.40 -0.0 0.37 ± 2% perf-profile.self.cycles-pp.apparmor_mmap_file
0.75 -0.0 0.71 perf-profile.self.cycles-pp.__mmap
0.21 ± 3% -0.0 0.18 ± 3% perf-profile.self.cycles-pp.vmacache_update
0.23 -0.0 0.20 ± 2% perf-profile.self.cycles-pp.vma_link
0.23 -0.0 0.21 ± 2% perf-profile.self.cycles-pp.strlen
0.13 ± 3% -0.0 0.11 ± 3% perf-profile.self.cycles-pp.security_mmap_addr
0.08 +0.0 0.09 perf-profile.self.cycles-pp.vm_area_free
0.09 +0.0 0.10 perf-profile.self.cycles-pp.get_mmap_base
0.07 +0.0 0.08 ± 5% perf-profile.self.cycles-pp.unlink_file_vma
0.09 ± 4% +0.0 0.10 perf-profile.self.cycles-pp.can_vma_merge_before
0.11 ± 3% +0.0 0.13 perf-profile.self.cycles-pp.testcase
0.22 ± 3% +0.0 0.25 perf-profile.self.cycles-pp.userfaultfd_unmap_prep
0.27 +0.0 0.29 perf-profile.self.cycles-pp.__x86_retpoline_r9
0.43 +0.0 0.46 ± 2% perf-profile.self.cycles-pp.syscall_enter_from_user_mode
0.21 ± 3% +0.0 0.24 perf-profile.self.cycles-pp.unlink_anon_vmas
0.15 ± 3% +0.0 0.19 ± 3% perf-profile.self.cycles-pp.blocking_notifier_call_chain
0.36 +0.0 0.40 perf-profile.self.cycles-pp.vm_mmap_pgoff
0.35 +0.0 0.39 perf-profile.self.cycles-pp.tlb_finish_mmu
0.34 ± 2% +0.0 0.37 ± 2% perf-profile.self.cycles-pp.percpu_counter_add_batch
0.51 +0.0 0.55 ± 2% perf-profile.self.cycles-pp.__munmap
0.36 +0.0 0.40 perf-profile.self.cycles-pp.downgrade_write
2.20 +0.0 2.24 perf-profile.self.cycles-pp.rcu_all_qs
0.21 ± 3% +0.1 0.27 perf-profile.self.cycles-pp.free_pgtables
0.38 +0.1 0.44 perf-profile.self.cycles-pp.refill_obj_stock
0.62 +0.1 0.69 perf-profile.self.cycles-pp.vm_area_alloc
0.26 +0.1 0.36 perf-profile.self.cycles-pp.cap_mmap_file
0.86 +0.1 0.97 ± 2% perf-profile.self.cycles-pp.perf_event_mmap_output
0.99 +0.1 1.10 ± 4% perf-profile.self.cycles-pp.perf_iterate_sb
2.45 +0.2 2.63 perf-profile.self.cycles-pp._cond_resched
1.71 ± 8% +0.2 1.94 ± 5% perf-profile.self.cycles-pp.kmem_cache_alloc
0.98 +0.3 1.24 perf-profile.self.cycles-pp.vm_unmapped_area
8.99 +0.3 9.27 perf-profile.self.cycles-pp.syscall_exit_to_user_mode
2.04 ± 2% +0.4 2.39 perf-profile.self.cycles-pp.zap_pte_range
1.65 ± 12% +0.4 2.02 ± 4% perf-profile.self.cycles-pp.kmem_cache_free
6.13 +0.5 6.63 perf-profile.self.cycles-pp.syscall_return_via_sysret
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Rong Chen
View attachment "config-5.9.0-rc6-00341-g70e806e4e64501" of type "text/plain" (170148 bytes)
View attachment "job-script" of type "text/plain" (7548 bytes)
View attachment "job.yaml" of type "text/plain" (5121 bytes)
View attachment "reproduce" of type "text/plain" (338 bytes)
Powered by blists - more mailing lists