lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20180528114019.GF9904@yexl-desktop>
Date:   Mon, 28 May 2018 19:40:19 +0800
From:   kernel test robot <xiaolong.ye@...el.com>
To:     Tejun Heo <tj@...nel.org>
Cc:     LKML <linux-kernel@...r.kernel.org>, lkp@...org
Subject: [lkp-robot] [mm, memcontrol]  309fe96bfc:  vm-scalability.throughput
 +23.0% improvement


Greeting,

FYI, we noticed a +23.0% improvement of vm-scalability.throughput due to commit:


commit: 309fe96bfc0ae387f53612927a8f0dc3eb056efd ("mm, memcontrol: implement memory.swap.events")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master

in testcase: vm-scalability
on test machine: 144 threads Intel(R) Xeon(R) CPU E7-8890 v3 @ 2.50GHz with 512G memory
with following parameters:

	runtime: 300s
	size: 1T
	test: lru-shm
	cpufreq_governor: performance

test-description: The motivation behind this suite is to exercise functions and regions of the mm/ of the Linux kernel which are of interest to us.
test-url: https://git.kernel.org/cgit/linux/kernel/git/wfg/vm-scalability.git/



Details are as below:
-------------------------------------------------------------------------------------------------->
=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/runtime/size/tbox_group/test/testcase:
  gcc-7/performance/x86_64-rhel-7.2/debian-x86_64-2016-08-31.cgz/300s/1T/lkp-hsx04/lru-shm/vm-scalability

commit: 
  ccc2f49f99 ("mm, memcontrol: move swap charge handling into get_swap_page()")
  309fe96bfc ("mm, memcontrol: implement memory.swap.events")

ccc2f49f991f17cd 309fe96bfc0ae387f53612927a 
---------------- -------------------------- 
         %stddev     %change         %stddev
             \          |                \  
  71207426           +23.0%   87612470        vm-scalability.throughput
      0.32 ±  8%     -80.2%       0.06 ±  2%  vm-scalability.free_time
    499213           +23.4%     616000        vm-scalability.median
      0.01 ±  9%     -43.6%       0.00 ± 22%  vm-scalability.median_stddev
  71207426           +23.0%   87612470        vm-scalability.throughput
    305.83            +3.5%     316.49        vm-scalability.time.elapsed_time
    305.83            +3.5%     316.49        vm-scalability.time.elapsed_time.max
 7.933e+08            +8.3%  8.594e+08        vm-scalability.time.minor_page_faults
      2610           -18.8%       2119        vm-scalability.time.percent_of_cpu_this_job_got
      5076           -20.1%       4056        vm-scalability.time.system_time
      2910            -8.9%       2651        vm-scalability.time.user_time
     24540            +8.2%      26563        vm-scalability.time.voluntary_context_switches
 3.566e+09            +8.3%  3.863e+09        vm-scalability.workload
   4435819 ±  2%     +13.1%    5015715 ±  4%  cpuidle.C1E.time
     58453           +12.6%      65828 ± 13%  cpuidle.POLL.time
    630.29            -1.9%     618.41        pmeter.Average_Active_Power
    112976           +25.4%     141673        pmeter.performance_per_watt
     26.00           -19.2%      21.00        vmstat.procs.r
    147037            -1.2%     145251        vmstat.system.in
      0.00 ±173%      +0.0        0.00 ±124%  mpstat.cpu.iowait%
     11.66            -2.6        9.02        mpstat.cpu.sys%
      6.66            -0.8        5.86        mpstat.cpu.usr%
    113669           -12.8%      99110        meminfo.Active
    112018           -13.0%      97459        meminfo.Active(anon)
  23274932           -21.5%   18277464        meminfo.Mapped
     51042           -22.7%      39479        meminfo.PageTables
   5675691           -22.6%    4394275 ±  2%  numa-meminfo.node0.Mapped
     12906 ±  9%     -24.0%       9808 ± 11%  numa-meminfo.node0.PageTables
   5564225 ±  2%     -20.1%    4445143        numa-meminfo.node1.Mapped
     12478 ±  6%     -23.3%       9573 ± 11%  numa-meminfo.node1.PageTables
   5605568 ±  2%     -20.3%    4467557 ±  2%  numa-meminfo.node2.Mapped
     12399 ±  8%     -23.0%       9545 ±  8%  numa-meminfo.node2.PageTables
   5747984 ±  3%     -19.3%    4638538 ±  3%  numa-meminfo.node3.Mapped
     11853 ±  6%     -16.8%       9867 ± 10%  numa-meminfo.node3.PageTables
     40006 ±  2%     +32.6%      53040 ± 23%  numa-meminfo.node3.SUnreclaim
   1394386           -21.2%    1099228 ±  2%  numa-vmstat.node0.nr_mapped
      3220 ±  9%     -23.7%       2457 ±  9%  numa-vmstat.node0.nr_page_table_pages
   1385184 ±  2%     -19.8%    1111569 ±  2%  numa-vmstat.node1.nr_mapped
      3096 ±  6%     -22.4%       2404 ± 11%  numa-vmstat.node1.nr_page_table_pages
   1392379 ±  2%     -18.4%    1135757 ±  2%  numa-vmstat.node2.nr_mapped
      3056 ±  7%     -20.7%       2422 ±  7%  numa-vmstat.node2.nr_page_table_pages
   1477487 ±  2%     -18.7%    1201163 ±  3%  numa-vmstat.node3.nr_mapped
      3074 ±  4%     -17.2%       2546 ± 11%  numa-vmstat.node3.nr_page_table_pages
     10001 ±  2%     +32.6%      13259 ± 23%  numa-vmstat.node3.nr_slab_unreclaimable
      4316 ± 19%     -26.2%       3183 ±  3%  syscalls.sys_mmap.med
     66119 ± 25%     -41.3%      38816 ± 10%  syscalls.sys_newfstat.max
  71070823 ± 55%  -6.4e+07     6980408 ± 20%  syscalls.sys_newfstat.noise.100%
  86378359 ± 45%  -6.5e+07    20983557 ±  6%  syscalls.sys_newfstat.noise.2%
  83896012 ± 47%  -6.5e+07    18902607 ±  7%  syscalls.sys_newfstat.noise.25%
  86279365 ± 46%  -6.5e+07    20864391 ±  6%  syscalls.sys_newfstat.noise.5%
  79533721 ± 49%  -6.4e+07    15258271 ±  8%  syscalls.sys_newfstat.noise.50%
  74988875 ± 52%  -6.4e+07    11205147 ± 13%  syscalls.sys_newfstat.noise.75%
      2034 ±  9%     -16.9%       1690 ±  4%  syscalls.sys_read.med
      1598 ±  6%     -10.5%       1431 ±  3%  syscalls.sys_write.med
 5.102e+12            +9.0%  5.559e+12        perf-stat.branch-instructions
      1.37           -21.4%       1.08        perf-stat.cpi
 2.479e+13           -14.0%  2.132e+13        perf-stat.cpu-cycles
     20771            +2.6%      21302        perf-stat.cpu-migrations
  4.59e+12 ±  2%      +9.2%  5.014e+12        perf-stat.dTLB-loads
 1.483e+12 ±  4%     +10.6%  1.639e+12        perf-stat.dTLB-stores
 2.527e+09            +7.3%  2.712e+09        perf-stat.iTLB-load-misses
 1.804e+13 ±  2%      +9.3%  1.972e+13        perf-stat.instructions
      0.73           +27.1%       0.93        perf-stat.ipc
 7.943e+08            +8.3%  8.605e+08        perf-stat.minor-faults
 2.416e+09            +4.3%  2.519e+09        perf-stat.node-stores
 7.943e+08            +8.3%  8.605e+08        perf-stat.page-faults
     27996           -13.0%      24359        proc-vmstat.nr_active_anon
    237.75            +6.2%     252.50        proc-vmstat.nr_dirtied
  33832161            -1.0%   33504100        proc-vmstat.nr_file_pages
  33515106            -1.0%   33189801        proc-vmstat.nr_inactive_anon
    485.50            +0.8%     489.25        proc-vmstat.nr_inactive_file
     23158            -1.0%      22915        proc-vmstat.nr_kernel_stack
   5811543           -24.2%    4407834        proc-vmstat.nr_mapped
     12781           -25.1%       9571        proc-vmstat.nr_page_table_pages
  33521883            -1.0%   33192863        proc-vmstat.nr_shmem
    222.00           +13.1%     251.00        proc-vmstat.nr_written
     28001           -13.0%      24362        proc-vmstat.nr_zone_active_anon
  33515101            -1.0%   33189795        proc-vmstat.nr_zone_inactive_anon
    485.50            +0.8%     489.25        proc-vmstat.nr_zone_inactive_file
 7.959e+08            +8.3%  8.621e+08        proc-vmstat.numa_hit
 7.958e+08            +8.3%  8.621e+08        proc-vmstat.numa_local
     11401 ±  8%     -69.4%       3491 ± 23%  proc-vmstat.pgactivate
 7.969e+08            +8.4%  8.635e+08        proc-vmstat.pgalloc_normal
 7.944e+08            +8.3%  8.605e+08        proc-vmstat.pgfault
 7.964e+08            +8.4%   8.63e+08        proc-vmstat.pgfree
     76.68 ±173%    -100.0%       0.00 ± 10%  sched_debug.cfs_rq:/.MIN_vruntime.stddev
     29153           -18.2%      23841        sched_debug.cfs_rq:/.exec_clock.avg
     48865 ±  7%     -14.6%      41739 ±  7%  sched_debug.cfs_rq:/.exec_clock.max
     26558           -18.4%      21662        sched_debug.cfs_rq:/.exec_clock.min
     76.68 ±173%    -100.0%       0.00 ± 10%  sched_debug.cfs_rq:/.max_vruntime.stddev
   4166046           -19.1%    3372283        sched_debug.cfs_rq:/.min_vruntime.avg
   4360622           -19.2%    3524394        sched_debug.cfs_rq:/.min_vruntime.max
   3816276           -17.3%    3154309        sched_debug.cfs_rq:/.min_vruntime.min
    105670 ± 15%     -32.9%      70895 ± 16%  sched_debug.cfs_rq:/.min_vruntime.stddev
   -361713           -46.2%    -194567        sched_debug.cfs_rq:/.spread0.min
    105504 ± 15%     -32.8%      70895 ± 16%  sched_debug.cfs_rq:/.spread0.stddev
    309.71 ± 13%     -22.3%     240.75 ± 21%  sched_debug.cfs_rq:/.util_est_enqueued.max
      6.42 ± 19%     -37.9%       3.99 ±  5%  sched_debug.cpu.clock.stddev
      6.42 ± 19%     -37.9%       3.98 ±  5%  sched_debug.cpu.clock_task.stddev
      5.91 ±  7%     -14.8%       5.04 ±  6%  sched_debug.cpu.cpu_load[4].avg
    355621 ± 22%     -28.6%     253956 ±  5%  sched_debug.cpu.nr_switches.max
     40018 ± 12%     -30.5%      27804 ± 16%  sched_debug.cpu.nr_switches.stddev
      0.00 ± 19%    +100.0%       0.01 ± 24%  sched_debug.cpu.nr_uninterruptible.avg
    364939 ± 24%     -26.2%     269378 ±  6%  sched_debug.cpu.sched_count.max
     41878 ± 12%     -27.3%      30433 ± 13%  sched_debug.cpu.sched_count.stddev
    179801 ± 22%     -33.2%     120078 ±  3%  sched_debug.cpu.ttwu_count.max
     20153 ± 12%     -32.8%      13538 ± 19%  sched_debug.cpu.ttwu_count.stddev
    174157 ± 23%     -33.1%     116564 ±  2%  sched_debug.cpu.ttwu_local.max
     19436 ± 12%     -34.2%      12782 ± 20%  sched_debug.cpu.ttwu_local.stddev
     66.14           -66.1        0.00        perf-profile.calltrace.cycles-pp.do_access
     44.18           -44.2        0.00        perf-profile.calltrace.cycles-pp.page_fault.do_access
     44.15           -44.1        0.00        perf-profile.calltrace.cycles-pp.do_page_fault.page_fault.do_access
     44.13           -44.1        0.00        perf-profile.calltrace.cycles-pp.__do_page_fault.do_page_fault.page_fault.do_access
     42.34           -42.3        0.00        perf-profile.calltrace.cycles-pp.handle_mm_fault.__do_page_fault.do_page_fault.page_fault.do_access
     17.02           -17.0        0.00        perf-profile.calltrace.cycles-pp.do_rw_once
      6.85 ± 14%      -6.9        0.00        perf-profile.calltrace.cycles-pp.__munmap
      6.81 ± 14%      -6.8        0.00        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
      6.81 ± 14%      -6.8        0.00        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
      6.81 ± 14%      -6.8        0.00        perf-profile.calltrace.cycles-pp.vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
      6.81 ± 14%      -6.8        0.00        perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
      8.94            -6.4        2.52 ±173%  perf-profile.calltrace.cycles-pp.clear_page_erms.shmem_getpage_gfp.shmem_fault.__do_fault.__handle_mm_fault
      5.83            -5.8        0.00        perf-profile.calltrace.cycles-pp.native_irq_return_iret.do_access
      6.80 ± 14%      -5.6        1.25 ±145%  perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.do_munmap.vm_munmap
      6.80 ± 14%      -5.6        1.25 ±145%  perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.do_munmap.vm_munmap.__x64_sys_munmap
      6.81 ± 14%      -5.6        1.26 ±144%  perf-profile.calltrace.cycles-pp.do_munmap.vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.81 ± 14%      -5.6        1.26 ±144%  perf-profile.calltrace.cycles-pp.unmap_region.do_munmap.vm_munmap.__x64_sys_munmap.do_syscall_64
      8.00            -5.2        2.79 ±173%  perf-profile.calltrace.cycles-pp.filemap_map_pages.__handle_mm_fault.handle_mm_fault.__do_page_fault.do_page_fault
      5.34            -4.8        0.57 ±173%  perf-profile.calltrace.cycles-pp.finish_fault.__handle_mm_fault.handle_mm_fault.__do_page_fault.do_page_fault
      5.30            -4.7        0.56 ±173%  perf-profile.calltrace.cycles-pp.alloc_set_pte.finish_fault.__handle_mm_fault.handle_mm_fault.__do_page_fault
      0.00            +1.1        1.13 ± 91%  perf-profile.calltrace.cycles-pp.native_irq_return_iret
      0.00            +1.2        1.18 ± 31%  perf-profile.calltrace.cycles-pp.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.smp_apic_timer_interrupt
      0.00            +1.2        1.18 ± 32%  perf-profile.calltrace.cycles-pp.get_next_timer_interrupt.tick_nohz_next_event.tick_nohz_get_sleep_length.menu_select.do_idle
      0.00            +1.3        1.27 ± 31%  perf-profile.calltrace.cycles-pp.load_balance.rebalance_domains.__softirqentry_text_start.irq_exit.smp_apic_timer_interrupt
      0.00            +1.3        1.33 ± 33%  perf-profile.calltrace.cycles-pp.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.smp_apic_timer_interrupt.apic_timer_interrupt
      0.00            +1.8        1.78 ± 32%  perf-profile.calltrace.cycles-pp.rebalance_domains.__softirqentry_text_start.irq_exit.smp_apic_timer_interrupt.apic_timer_interrupt
      0.00            +1.9        1.88 ± 33%  perf-profile.calltrace.cycles-pp.tick_nohz_next_event.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry
      0.00            +2.3        2.27 ± 33%  perf-profile.calltrace.cycles-pp.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry.start_secondary
      0.00            +2.6        2.63 ± 32%  perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.smp_apic_timer_interrupt.apic_timer_interrupt.cpuidle_enter_state
      0.23 ±173%      +3.1        3.36 ± 64%  perf-profile.calltrace.cycles-pp.__vfs_write.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.23 ±173%      +3.2        3.38 ± 63%  perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      0.23 ±173%      +3.2        3.39 ± 63%  perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      0.23 ±173%      +3.2        3.41 ± 63%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      0.23 ±173%      +3.2        3.42 ± 63%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
      0.00            +3.2        3.21 ± 31%  perf-profile.calltrace.cycles-pp.__softirqentry_text_start.irq_exit.smp_apic_timer_interrupt.apic_timer_interrupt.cpuidle_enter_state
      0.23 ±173%      +3.3        3.52 ± 61%  perf-profile.calltrace.cycles-pp.write
      0.00            +3.5        3.51 ± 32%  perf-profile.calltrace.cycles-pp.hrtimer_interrupt.smp_apic_timer_interrupt.apic_timer_interrupt.cpuidle_enter_state.do_idle
      0.00            +3.7        3.74 ± 30%  perf-profile.calltrace.cycles-pp.irq_exit.smp_apic_timer_interrupt.apic_timer_interrupt.cpuidle_enter_state.do_idle
      0.00            +4.3        4.35 ± 33%  perf-profile.calltrace.cycles-pp.menu_select.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
      0.87 ±  3%      +7.6        8.51 ± 31%  perf-profile.calltrace.cycles-pp.smp_apic_timer_interrupt.apic_timer_interrupt.cpuidle_enter_state.do_idle.cpu_startup_entry
      0.87 ±  3%      +7.7        8.58 ± 31%  perf-profile.calltrace.cycles-pp.apic_timer_interrupt.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary
      7.11           +46.8       53.92 ± 30%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary
      8.11           +55.7       63.77 ± 30%  perf-profile.calltrace.cycles-pp.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
      8.72           +61.2       69.92 ± 30%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
      8.72           +61.2       69.92 ± 30%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64
      8.72           +61.2       69.92 ± 30%  perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64
      8.73           +61.4       70.12 ± 30%  perf-profile.calltrace.cycles-pp.secondary_startup_64
     66.14           -66.1        0.00        perf-profile.children.cycles-pp.do_access
     17.02           -17.0        0.00        perf-profile.children.cycles-pp.do_rw_once
      6.85 ± 14%      -6.9        0.00        perf-profile.children.cycles-pp.__munmap
      6.81 ± 14%      -5.5        1.30 ±137%  perf-profile.children.cycles-pp.do_munmap
      6.81 ± 14%      -5.5        1.31 ±136%  perf-profile.children.cycles-pp.unmap_vmas
      6.81 ± 14%      -5.5        1.31 ±136%  perf-profile.children.cycles-pp.unmap_page_range
      6.81 ± 14%      -5.5        1.30 ±137%  perf-profile.children.cycles-pp.vm_munmap
      6.81 ± 14%      -5.5        1.30 ±137%  perf-profile.children.cycles-pp.unmap_region
      6.81 ± 14%      -5.5        1.30 ±137%  perf-profile.children.cycles-pp.__x64_sys_munmap
      5.88            -5.1        0.75 ±173%  perf-profile.children.cycles-pp.alloc_set_pte
      5.35            -4.8        0.57 ±173%  perf-profile.children.cycles-pp.finish_fault
      5.89            -4.8        1.13 ± 90%  perf-profile.children.cycles-pp.native_irq_return_iret
      4.50 ± 13%      -3.9        0.63 ±155%  perf-profile.children.cycles-pp.page_remove_rmap
      2.89 ±  8%      -2.2        0.72 ±167%  perf-profile.children.cycles-pp.shmem_alloc_page
      2.86 ±  8%      -2.1        0.71 ±167%  perf-profile.children.cycles-pp.alloc_pages_vma
      2.73 ±  9%      -2.1        0.67 ±165%  perf-profile.children.cycles-pp.__alloc_pages_nodemask
      2.48 ± 10%      -1.9        0.59 ±165%  perf-profile.children.cycles-pp.get_page_from_freelist
      1.57 ± 16%      -1.2        0.34 ±164%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      1.25 ± 20%      -0.8        0.40 ± 48%  perf-profile.children.cycles-pp._raw_spin_lock
      0.00            +0.1        0.08 ± 21%  perf-profile.children.cycles-pp.ret_from_intr
      0.00            +0.1        0.08 ± 24%  perf-profile.children.cycles-pp.update_rq_clock
      0.00            +0.1        0.09 ± 26%  perf-profile.children.cycles-pp.update_group_capacity
      0.00            +0.1        0.09 ± 26%  perf-profile.children.cycles-pp.intel_pmu_disable_all
      0.00            +0.1        0.09 ± 28%  perf-profile.children.cycles-pp.perf_event_task_tick
      0.11 ±  6%      +0.1        0.22 ± 15%  perf-profile.children.cycles-pp.__indirect_thunk_start
      0.00            +0.1        0.11 ± 34%  perf-profile.children.cycles-pp.cpu_load_update
      0.00            +0.1        0.12 ± 25%  perf-profile.children.cycles-pp.run_posix_cpu_timers
      0.00            +0.1        0.12 ± 33%  perf-profile.children.cycles-pp.rb_next
      0.00            +0.1        0.12 ± 19%  perf-profile.children.cycles-pp.interrupt_entry
      0.01 ±173%      +0.1        0.13 ± 21%  perf-profile.children.cycles-pp.ktime_get_update_offsets_now
      0.00            +0.1        0.12 ± 36%  perf-profile.children.cycles-pp.rcu_eqs_exit
      0.00            +0.1        0.14 ± 39%  perf-profile.children.cycles-pp.nr_iowait_cpu
      0.00            +0.1        0.14 ± 38%  perf-profile.children.cycles-pp.rcu_dynticks_eqs_exit
      0.00            +0.1        0.14 ± 38%  perf-profile.children.cycles-pp.leave_mm
      0.00            +0.1        0.14 ± 26%  perf-profile.children.cycles-pp.__intel_pmu_enable_all
      0.00            +0.1        0.14 ± 36%  perf-profile.children.cycles-pp.irq_work_needs_cpu
      0.00            +0.1        0.14 ± 30%  perf-profile.children.cycles-pp.idle_cpu
      0.00            +0.1        0.14 ± 30%  perf-profile.children.cycles-pp.call_cpuidle
      0.00            +0.2        0.15 ± 35%  perf-profile.children.cycles-pp.rcu_irq_exit
      0.00            +0.2        0.16 ± 40%  perf-profile.children.cycles-pp.rcu_needs_cpu
      0.00            +0.2        0.16 ± 31%  perf-profile.children.cycles-pp.get_cpu_device
      0.00            +0.2        0.16 ± 28%  perf-profile.children.cycles-pp.tick_check_oneshot_broadcast_this_cpu
      0.00            +0.2        0.16 ± 38%  perf-profile.children.cycles-pp.native_apic_mem_write
      0.00            +0.2        0.16 ± 36%  perf-profile.children.cycles-pp.find_next_and_bit
      0.00            +0.2        0.17 ± 43%  perf-profile.children.cycles-pp.timekeeping_max_deferment
      0.00            +0.2        0.18 ± 34%  perf-profile.children.cycles-pp.cpumask_next_and
      0.00            +0.2        0.19 ± 37%  perf-profile.children.cycles-pp.timerqueue_add
      0.00            +0.2        0.19 ± 38%  perf-profile.children.cycles-pp.enqueue_hrtimer
      0.00            +0.2        0.20 ± 34%  perf-profile.children.cycles-pp.update_ts_time_stats
      0.00            +0.2        0.20 ± 30%  perf-profile.children.cycles-pp.rcu_idle_exit
      0.04 ± 58%      +0.2        0.25 ± 28%  perf-profile.children.cycles-pp.irq_work_run_list
      0.00            +0.2        0.21 ± 28%  perf-profile.children.cycles-pp.tick_nohz_irq_exit
      0.03 ±100%      +0.2        0.24 ± 28%  perf-profile.children.cycles-pp.irq_work_interrupt
      0.03 ±100%      +0.2        0.24 ± 28%  perf-profile.children.cycles-pp.smp_irq_work_interrupt
      0.03 ±100%      +0.2        0.24 ± 28%  perf-profile.children.cycles-pp.irq_work_run
      0.03 ±100%      +0.2        0.24 ± 28%  perf-profile.children.cycles-pp.printk
      0.00            +0.2        0.22 ± 36%  perf-profile.children.cycles-pp.arch_cpu_idle_enter
      0.00            +0.2        0.23 ± 39%  perf-profile.children.cycles-pp.tsc_verify_tsc_adjust
      0.00            +0.2        0.24 ± 35%  perf-profile.children.cycles-pp.hrtimer_get_next_event
      0.00            +0.2        0.24 ± 33%  perf-profile.children.cycles-pp._raw_spin_trylock
      0.01 ±173%      +0.2        0.25 ± 45%  perf-profile.children.cycles-pp.rcu_process_callbacks
      0.00            +0.3        0.26 ± 34%  perf-profile.children.cycles-pp.pm_qos_read_value
      0.01 ±173%      +0.3        0.28 ± 26%  perf-profile.children.cycles-pp.update_blocked_averages
      0.00            +0.3        0.27 ± 30%  perf-profile.children.cycles-pp.read_tsc
      0.00            +0.3        0.28 ± 36%  perf-profile.children.cycles-pp.timerqueue_del
      0.00            +0.3        0.29 ± 27%  perf-profile.children.cycles-pp.lapic_next_deadline
      0.06 ±  7%      +0.3        0.36 ± 28%  perf-profile.children.cycles-pp.rcu_check_callbacks
      0.03 ±173%      +0.3        0.34 ± 70%  perf-profile.children.cycles-pp.fbcon_putcs
      0.01 ±173%      +0.3        0.33 ± 33%  perf-profile.children.cycles-pp.__remove_hrtimer
      0.03 ±173%      +0.3        0.34 ± 71%  perf-profile.children.cycles-pp.bit_putcs
      0.00            +0.3        0.32 ± 33%  perf-profile.children.cycles-pp.__hrtimer_next_event_base
      0.03 ±173%      +0.3        0.35 ± 70%  perf-profile.children.cycles-pp.fbcon_redraw
      0.03 ±173%      +0.3        0.35 ± 70%  perf-profile.children.cycles-pp.lf
      0.03 ±173%      +0.3        0.35 ± 70%  perf-profile.children.cycles-pp.con_scroll
      0.03 ±173%      +0.3        0.35 ± 70%  perf-profile.children.cycles-pp.fbcon_scroll
      0.03 ±100%      +0.3        0.35 ± 26%  perf-profile.children.cycles-pp.run_rebalance_domains
      0.00            +0.3        0.33 ± 29%  perf-profile.children.cycles-pp.native_sched_clock
      0.03 ±173%      +0.3        0.36 ± 71%  perf-profile.children.cycles-pp.vt_console_print
      0.00            +0.3        0.34 ± 34%  perf-profile.children.cycles-pp.rcu_eqs_enter
      0.06 ±  7%      +0.3        0.40 ± 26%  perf-profile.children.cycles-pp.native_write_msr
      0.00            +0.3        0.35 ± 31%  perf-profile.children.cycles-pp.hrtimer_next_event_without
      0.00            +0.4        0.37 ± 29%  perf-profile.children.cycles-pp.sched_clock
      0.06 ± 11%      +0.4        0.48 ± 26%  perf-profile.children.cycles-pp.clockevents_program_event
      0.17 ±  5%      +0.4        0.59 ± 21%  perf-profile.children.cycles-pp.scheduler_tick
      0.05 ±  9%      +0.4        0.49 ± 42%  perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
      0.04 ± 57%      +0.5        0.51 ± 29%  perf-profile.children.cycles-pp.sched_clock_cpu
      0.07 ± 10%      +0.5        0.57 ± 34%  perf-profile.children.cycles-pp.run_timer_softirq
      0.06            +0.6        0.61 ± 30%  perf-profile.children.cycles-pp.find_next_bit
      0.11 ±  6%      +0.6        0.68 ± 28%  perf-profile.children.cycles-pp.perf_mux_hrtimer_handler
      0.07 ±  7%      +0.6        0.67 ± 31%  perf-profile.children.cycles-pp.tick_irq_enter
      0.09 ± 14%      +0.7        0.75 ± 29%  perf-profile.children.cycles-pp.ktime_get
      0.09 ±  9%      +0.8        0.85 ± 31%  perf-profile.children.cycles-pp.irq_enter
      0.09 ±  4%      +0.8        0.86 ± 32%  perf-profile.children.cycles-pp.find_busiest_group
      0.09 ±  4%      +0.8        0.90 ± 31%  perf-profile.children.cycles-pp.__next_timer_interrupt
      0.27 ±  5%      +0.9        1.15 ± 25%  perf-profile.children.cycles-pp.update_process_times
      0.28 ±  4%      +1.0        1.25 ± 25%  perf-profile.children.cycles-pp.tick_sched_handle
      0.30 ±  5%      +1.1        1.43 ± 26%  perf-profile.children.cycles-pp.tick_sched_timer
      0.12 ±  3%      +1.1        1.25 ± 32%  perf-profile.children.cycles-pp.get_next_timer_interrupt
      0.13 ±  5%      +1.2        1.29 ± 31%  perf-profile.children.cycles-pp.load_balance
      0.07 ±173%      +1.2        1.23 ± 66%  perf-profile.children.cycles-pp.delay_tsc
      0.14 ±173%      +1.5        1.67 ± 62%  perf-profile.children.cycles-pp.io_serial_in
      0.18 ±  4%      +1.6        1.81 ± 32%  perf-profile.children.cycles-pp.rebalance_domains
      0.19 ±  6%      +1.8        2.00 ± 34%  perf-profile.children.cycles-pp.tick_nohz_next_event
      0.22 ±  7%      +2.1        2.33 ± 33%  perf-profile.children.cycles-pp.tick_nohz_get_sleep_length
      0.50 ±  4%      +2.3        2.77 ± 28%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.23 ±143%      +2.5        2.77 ± 64%  perf-profile.children.cycles-pp.serial8250_console_putchar
      0.23 ±143%      +2.6        2.83 ± 64%  perf-profile.children.cycles-pp.uart_console_write
      0.23 ±143%      +2.7        2.91 ± 64%  perf-profile.children.cycles-pp.wait_for_xmitr
      0.24 ±144%      +2.7        2.97 ± 64%  perf-profile.children.cycles-pp.serial8250_console_write
      0.38 ±  4%      +2.9        3.33 ± 30%  perf-profile.children.cycles-pp.__softirqentry_text_start
      0.22 ±173%      +3.0        3.20 ± 69%  perf-profile.children.cycles-pp.devkmsg_write
      0.22 ±173%      +3.0        3.20 ± 69%  perf-profile.children.cycles-pp.printk_emit
      0.64 ±  4%      +3.0        3.66 ± 28%  perf-profile.children.cycles-pp.hrtimer_interrupt
      0.27 ±147%      +3.1        3.34 ± 64%  perf-profile.children.cycles-pp.console_unlock
      0.24 ±159%      +3.1        3.37 ± 64%  perf-profile.children.cycles-pp.__vfs_write
      0.24 ±157%      +3.1        3.39 ± 63%  perf-profile.children.cycles-pp.vfs_write
      0.24 ±157%      +3.1        3.39 ± 63%  perf-profile.children.cycles-pp.ksys_write
      0.24 ±161%      +3.2        3.44 ± 63%  perf-profile.children.cycles-pp.vprintk_emit
      0.25 ±153%      +3.3        3.52 ± 61%  perf-profile.children.cycles-pp.write
      0.44 ±  4%      +3.4        3.85 ± 29%  perf-profile.children.cycles-pp.irq_exit
      0.43 ±  5%      +4.0        4.43 ± 33%  perf-profile.children.cycles-pp.menu_select
      1.22 ±  2%      +7.5        8.74 ± 29%  perf-profile.children.cycles-pp.smp_apic_timer_interrupt
      1.23 ±  2%      +7.6        8.81 ± 29%  perf-profile.children.cycles-pp.apic_timer_interrupt
      7.13           +47.0       54.12 ± 30%  perf-profile.children.cycles-pp.intel_idle
      8.18           +56.3       64.52 ± 30%  perf-profile.children.cycles-pp.cpuidle_enter_state
      8.72           +61.2       69.92 ± 30%  perf-profile.children.cycles-pp.start_secondary
      8.73           +61.4       70.12 ± 30%  perf-profile.children.cycles-pp.secondary_startup_64
      8.73           +61.4       70.12 ± 30%  perf-profile.children.cycles-pp.cpu_startup_entry
      8.74           +61.5       70.20 ± 30%  perf-profile.children.cycles-pp.do_idle
     16.94           -16.9        0.00        perf-profile.self.cycles-pp.do_rw_once
     10.66           -10.7        0.00        perf-profile.self.cycles-pp.do_access
      5.89            -4.8        1.13 ± 90%  perf-profile.self.cycles-pp.native_irq_return_iret
      3.75 ± 12%      -3.2        0.54 ±154%  perf-profile.self.cycles-pp.page_remove_rmap
      1.57 ± 16%      -1.2        0.34 ±164%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      0.00            +0.1        0.07 ± 17%  perf-profile.self.cycles-pp.ret_from_intr
      0.00            +0.1        0.08 ± 31%  perf-profile.self.cycles-pp.rcu_idle_exit
      0.00            +0.1        0.08 ± 26%  perf-profile.self.cycles-pp.tick_irq_enter
      0.00            +0.1        0.09 ± 28%  perf-profile.self.cycles-pp.perf_event_task_tick
      0.00            +0.1        0.11 ± 15%  perf-profile.self.cycles-pp.ktime_get_update_offsets_now
      0.11 ±  6%      +0.1        0.22 ± 15%  perf-profile.self.cycles-pp.__indirect_thunk_start
      0.00            +0.1        0.11 ± 34%  perf-profile.self.cycles-pp.scheduler_tick
      0.00            +0.1        0.11 ± 34%  perf-profile.self.cycles-pp.cpu_load_update
      0.00            +0.1        0.11 ± 27%  perf-profile.self.cycles-pp.__remove_hrtimer
      0.00            +0.1        0.12 ± 25%  perf-profile.self.cycles-pp.run_posix_cpu_timers
      0.00            +0.1        0.12 ± 33%  perf-profile.self.cycles-pp.rb_next
      0.00            +0.1        0.12 ± 19%  perf-profile.self.cycles-pp.interrupt_entry
      0.00            +0.1        0.12 ± 38%  perf-profile.self.cycles-pp.timerqueue_add
      0.00            +0.1        0.13 ± 32%  perf-profile.self.cycles-pp.sched_clock_cpu
      0.00            +0.1        0.13 ± 35%  perf-profile.self.cycles-pp.hrtimer_interrupt
      0.00            +0.1        0.14 ± 39%  perf-profile.self.cycles-pp.nr_iowait_cpu
      0.00            +0.1        0.14 ± 21%  perf-profile.self.cycles-pp.smp_apic_timer_interrupt
      0.00            +0.1        0.14 ± 38%  perf-profile.self.cycles-pp.rcu_dynticks_eqs_exit
      0.00            +0.1        0.14 ± 38%  perf-profile.self.cycles-pp.leave_mm
      0.00            +0.1        0.14 ± 36%  perf-profile.self.cycles-pp.irq_work_needs_cpu
      0.00            +0.1        0.14 ± 30%  perf-profile.self.cycles-pp.idle_cpu
      0.00            +0.1        0.14 ± 30%  perf-profile.self.cycles-pp.call_cpuidle
      0.00            +0.2        0.16 ± 40%  perf-profile.self.cycles-pp.rcu_needs_cpu
      0.00            +0.2        0.16 ± 31%  perf-profile.self.cycles-pp.get_cpu_device
      0.00            +0.2        0.16 ± 28%  perf-profile.self.cycles-pp.tick_check_oneshot_broadcast_this_cpu
      0.00            +0.2        0.16 ± 38%  perf-profile.self.cycles-pp.native_apic_mem_write
      0.00            +0.2        0.16 ± 36%  perf-profile.self.cycles-pp.find_next_and_bit
      0.00            +0.2        0.17 ± 43%  perf-profile.self.cycles-pp.timekeeping_max_deferment
      0.00            +0.2        0.18 ± 29%  perf-profile.self.cycles-pp.perf_mux_hrtimer_handler
      0.00            +0.2        0.19 ± 30%  perf-profile.self.cycles-pp.__hrtimer_run_queues
      0.00            +0.2        0.19 ± 36%  perf-profile.self.cycles-pp.tsc_verify_tsc_adjust
      0.00            +0.2        0.19 ± 27%  perf-profile.self.cycles-pp.update_blocked_averages
      0.00            +0.2        0.23 ± 22%  perf-profile.self.cycles-pp.irq_exit
      0.00            +0.2        0.23 ± 35%  perf-profile.self.cycles-pp.get_next_timer_interrupt
      0.00            +0.2        0.24 ± 33%  perf-profile.self.cycles-pp._raw_spin_trylock
      0.00            +0.3        0.26 ± 34%  perf-profile.self.cycles-pp.pm_qos_read_value
      0.00            +0.3        0.27 ± 29%  perf-profile.self.cycles-pp.rcu_check_callbacks
      0.00            +0.3        0.27 ± 30%  perf-profile.self.cycles-pp.read_tsc
      0.00            +0.3        0.30 ± 35%  perf-profile.self.cycles-pp.rebalance_domains
      0.00            +0.3        0.30 ± 30%  perf-profile.self.cycles-pp.load_balance
      0.00            +0.3        0.32 ± 33%  perf-profile.self.cycles-pp.__hrtimer_next_event_base
      0.04 ± 57%      +0.3        0.36 ± 31%  perf-profile.self.cycles-pp.__softirqentry_text_start
      0.00            +0.3        0.33 ± 29%  perf-profile.self.cycles-pp.native_sched_clock
      0.00            +0.3        0.34 ± 34%  perf-profile.self.cycles-pp.rcu_eqs_enter
      0.06 ±  7%      +0.3        0.40 ± 26%  perf-profile.self.cycles-pp.native_write_msr
      0.05 ±  9%      +0.4        0.45 ± 34%  perf-profile.self.cycles-pp.run_timer_softirq
      0.00            +0.4        0.40 ± 38%  perf-profile.self.cycles-pp.tick_nohz_next_event
      0.03 ±100%      +0.4        0.45 ± 42%  perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
      0.01 ±173%      +0.4        0.44 ± 34%  perf-profile.self.cycles-pp.__next_timer_interrupt
      0.05 ± 59%      +0.5        0.52 ± 32%  perf-profile.self.cycles-pp.ktime_get
      0.05            +0.5        0.52 ± 31%  perf-profile.self.cycles-pp.do_idle
      0.11 ±  7%      +0.5        0.59 ± 34%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.06            +0.6        0.61 ± 30%  perf-profile.self.cycles-pp.find_next_bit
      0.06 ±  6%      +0.6        0.62 ± 32%  perf-profile.self.cycles-pp.find_busiest_group
      0.11 ±  4%      +1.1        1.18 ± 33%  perf-profile.self.cycles-pp.cpuidle_enter_state
      0.07 ±173%      +1.2        1.23 ± 66%  perf-profile.self.cycles-pp.delay_tsc
      0.16 ±  2%      +1.4        1.52 ± 34%  perf-profile.self.cycles-pp.menu_select
      0.14 ±173%      +1.5        1.67 ± 62%  perf-profile.self.cycles-pp.io_serial_in
      7.12           +46.9       54.02 ± 30%  perf-profile.self.cycles-pp.intel_idle


                                                                                
                               vm-scalability.throughput                        
                                                                                
  9.5e+07 +-+---------------------------------------------------------------+   
          |                                                                 |   
    9e+07 +-+                    O O                                        |   
          O O O O O     O  O O O     O O O     O                            |   
          |                                O O                              |   
  8.5e+07 +-+                                                               |   
          |                                                                 |   
    8e+07 +-+       O O                                                     |   
          |                                                                 |   
  7.5e+07 +-+                    +.                                         |   
          |                     +  +.                                       |   
          |.+.+.+.+.   .+..+.+.+     +. .+.+.+.+.+.+.   .+..+.+.+.+.+.+.+.+.|   
    7e+07 +-+       +.+                +             +.+                    |   
          |                                                                 |   
  6.5e+07 +-+---------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Xiaolong

View attachment "config-4.17.0-rc4-00145-g309fe96" of type "text/plain" (164401 bytes)

View attachment "job-script" of type "text/plain" (7336 bytes)

View attachment "job.yaml" of type "text/plain" (4884 bytes)

View attachment "reproduce" of type "text/plain" (299500 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ