[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20181022011621.GF24195@shao2-debian>
Date: Mon, 22 Oct 2018 09:16:21 +0800
From: kernel test robot <rong.a.chen@...el.com>
To: "Rafael J. Wysocki" <rafael.j.wysocki@...el.com>
Cc: Peter Zijlstra <peterz@...radead.org>,
Daniel Lezcano <daniel.lezcano@...aro.org>,
LKML <linux-kernel@...r.kernel.org>,
Stephen Rothwell <sfr@...b.auug.org.au>, lkp@...org
Subject: [LKP] [cpuidle] 23e8ceb9ce: will-it-scale.per_process_ops 8.9%
improvement
Greeting,
FYI, we noticed a 8.9% improvement of will-it-scale.per_process_ops due to commit:
commit: 23e8ceb9ce766c81d62434053aef6e7efea6fcc3 ("cpuidle: menu: Compute first_idx when latency_req is known")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
in testcase: will-it-scale
on test machine: 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory
with following parameters:
nr_task: 16
mode: process
test: context_switch1
cpufreq_governor: performance
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
gcc-7/performance/x86_64-rhel-7.2/process/16/debian-x86_64-2018-04-03.cgz/lkp-bdw-ep3d/context_switch1/will-it-scale
commit:
5f26bdceb9 ("cpuidle: menu: Fix wakeup statistics updates for polling state")
23e8ceb9ce ("cpuidle: menu: Compute first_idx when latency_req is known")
5f26bdceb9c0a5e6 23e8ceb9ce766c81d62434053a
---------------- --------------------------
fail:runs %reproduction fail:runs
| | |
:4 50% 2:4 dmesg.WARNING:at_ip_fsnotify/0x
1:4 -25% :4 kmsg.DHCP/BOOTP:Reply_not_for_us_on_eth#,op[#]xid[#]
%stddev %change %stddev
\ | \
343546 +8.9% 374136 will-it-scale.per_process_ops
5496750 +8.9% 5986197 will-it-scale.workload
8420559 +10.9% 9337969 vmstat.system.cs
89360 ± 2% +930.3% 920646 ± 3% vmstat.system.in
23.79 -8.3 15.53 ± 4% mpstat.cpu.idle%
69.01 +7.5 76.48 mpstat.cpu.sys%
7.19 +0.8 7.98 mpstat.cpu.usr%
7e+08 -100.0% 150525 ± 13% cpuidle.C1.time
8.899e+08 -100.0% 48417 cpuidle.C1.usage
5462325 ± 13% +17.7% 6429151 ± 4% cpuidle.POLL.time
14645584 ± 15% +4307.0% 6.454e+08 ± 3% cpuidle.POLL.usage
2982 ± 4% -19.9% 2388 ± 3% slabinfo.kmalloc-512.active_objs
2998 ± 5% -17.6% 2471 ± 3% slabinfo.kmalloc-512.num_objs
762.50 ± 9% -19.2% 616.25 ± 16% slabinfo.task_group.active_objs
762.50 ± 9% -19.2% 616.25 ± 16% slabinfo.task_group.num_objs
2883 +5.5% 3043 turbostat.Avg_MHz
8.899e+08 -100.0% 47808 turbostat.C1
9.55 -9.6 0.00 turbostat.C1%
5.92 ± 5% -89.8% 0.60 ± 41% turbostat.CPU%c1
27397006 +934.0% 2.833e+08 ± 3% turbostat.IRQ
151.20 +1.4% 153.26 turbostat.PkgWatt
60181 ± 3% +128.7% 137619 ± 2% sched_debug.cfs_rq:/.min_vruntime.stddev
80169 ± 17% +141.5% 193594 ± 6% sched_debug.cfs_rq:/.spread0.avg
170633 ± 15% +89.8% 323899 ± 5% sched_debug.cfs_rq:/.spread0.max
60168 ± 3% +128.7% 137608 ± 2% sched_debug.cfs_rq:/.spread0.stddev
17.88 ± 8% +16.6% 20.83 ± 3% sched_debug.cpu.cpu_load[4].min
984.49 ± 2% +9.5% 1078 ± 3% sched_debug.cpu.curr->pid.avg
53357881 +11.2% 59314624 sched_debug.cpu.nr_switches.avg
58617424 +15.3% 67562245 sched_debug.cpu.nr_switches.max
6233628 +62.8% 10147512 sched_debug.cpu.nr_switches.stddev
3.223e+12 +10.2% 3.551e+12 perf-stat.branch-instructions
1.44 +0.1 1.51 perf-stat.branch-miss-rate%
4.653e+10 +15.6% 5.376e+10 perf-stat.branch-misses
2.585e+09 +11.2% 2.874e+09 perf-stat.context-switches
1.37 -8.2% 1.26 perf-stat.cpi
0.01 ± 17% -0.0 0.01 ± 13% perf-stat.dTLB-load-miss-rate%
4.595e+12 +9.5% 5.031e+12 perf-stat.dTLB-loads
2.901e+12 +9.2% 3.167e+12 perf-stat.dTLB-stores
4.203e+10 -30.7% 2.912e+10 perf-stat.iTLB-loads
1.614e+13 +9.6% 1.768e+13 perf-stat.instructions
0.73 +8.9% 0.79 perf-stat.ipc
13.48 -13.5 0.00 perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary
14.14 -10.6 3.59 ± 8% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
28.40 ± 2% -7.3 21.15 perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
28.48 ± 2% -7.2 21.33 perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64
28.48 ± 2% -7.1 21.34 perf-profile.calltrace.cycles-pp.secondary_startup_64
28.48 ± 2% -7.1 21.34 perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64
1.37 ± 3% -0.1 1.25 perf-profile.calltrace.cycles-pp.cpumask_next_wrap.select_idle_sibling.select_task_rq_fair.try_to_wake_up.autoremove_wake_function
0.71 +0.0 0.74 ± 3% perf-profile.calltrace.cycles-pp.touch_atime.pipe_read.__vfs_read.vfs_read.ksys_read
0.56 ± 4% +0.0 0.61 perf-profile.calltrace.cycles-pp.finish_task_switch.__sched_text_start.schedule.pipe_wait.pipe_read
0.60 ± 3% +0.1 0.66 perf-profile.calltrace.cycles-pp.update_cfs_group.enqueue_entity.enqueue_task_fair.ttwu_do_activate.try_to_wake_up
0.67 +0.1 0.72 ± 2% perf-profile.calltrace.cycles-pp.prepare_to_wait.pipe_wait.pipe_read.__vfs_read.vfs_read
0.75 ± 2% +0.1 0.81 ± 2% perf-profile.calltrace.cycles-pp.__wake_up_common_lock.pipe_read.__vfs_read.vfs_read.ksys_read
0.58 ± 2% +0.1 0.65 ± 2% perf-profile.calltrace.cycles-pp.mutex_lock.pipe_write.__vfs_write.vfs_write.ksys_write
0.77 ± 3% +0.1 0.84 ± 2% perf-profile.calltrace.cycles-pp.file_has_perm.security_file_permission.vfs_write.ksys_write.do_syscall_64
0.98 +0.1 1.05 ± 2% perf-profile.calltrace.cycles-pp.selinux_file_permission.security_file_permission.vfs_read.ksys_read.do_syscall_64
0.99 +0.1 1.06 perf-profile.calltrace.cycles-pp.selinux_file_permission.security_file_permission.vfs_write.ksys_write.do_syscall_64
0.55 ± 3% +0.1 0.63 ± 3% perf-profile.calltrace.cycles-pp.mutex_lock.pipe_read.__vfs_read.vfs_read.ksys_read
0.71 ± 5% +0.1 0.80 perf-profile.calltrace.cycles-pp.copyin.copy_page_from_iter.pipe_write.__vfs_write.vfs_write
0.70 ± 3% +0.1 0.78 ± 2% perf-profile.calltrace.cycles-pp.file_has_perm.security_file_permission.vfs_read.ksys_read.do_syscall_64
0.54 ± 2% +0.1 0.62 perf-profile.calltrace.cycles-pp.update_cfs_group.dequeue_entity.dequeue_task_fair.__sched_text_start.schedule
1.12 +0.1 1.22 ± 2% perf-profile.calltrace.cycles-pp.native_write_msr
0.95 ± 4% +0.1 1.05 perf-profile.calltrace.cycles-pp.load_new_mm_cr3.switch_mm_irqs_off.switch_mm.__sched_text_start.schedule
1.16 ± 2% +0.1 1.27 ± 2% perf-profile.calltrace.cycles-pp.check_preempt_curr.ttwu_do_wakeup.try_to_wake_up.autoremove_wake_function.__wake_up_common
1.76 ± 2% +0.1 1.87 perf-profile.calltrace.cycles-pp.switch_mm_irqs_off.__sched_text_start.schedule_idle.do_idle.cpu_startup_entry
1.32 ± 2% +0.1 1.45 perf-profile.calltrace.cycles-pp.ttwu_do_wakeup.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock
0.62 ± 2% +0.1 0.76 ± 3% perf-profile.calltrace.cycles-pp.tick_nohz_idle_enter.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
0.56 ± 7% +0.1 0.70 perf-profile.calltrace.cycles-pp.hrtimer_next_event_without.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry
1.89 +0.1 2.04 perf-profile.calltrace.cycles-pp.security_file_permission.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.87 ± 4% +0.2 1.05 ± 2% perf-profile.calltrace.cycles-pp.find_next_bit.__next_timer_interrupt.get_next_timer_interrupt.tick_nohz_next_event.tick_nohz_get_sleep_length
1.07 ± 2% +0.2 1.25 perf-profile.calltrace.cycles-pp.update_curr.dequeue_entity.dequeue_task_fair.__sched_text_start.schedule
1.94 +0.2 2.12 perf-profile.calltrace.cycles-pp.copy_page_to_iter.pipe_read.__vfs_read.vfs_read.ksys_read
1.99 +0.2 2.18 perf-profile.calltrace.cycles-pp.security_file_permission.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.69 ± 2% +0.2 0.90 ± 2% perf-profile.calltrace.cycles-pp.copyout.copy_page_to_iter.pipe_read.__vfs_read.vfs_read
1.57 ± 2% +0.2 1.79 ± 2% perf-profile.calltrace.cycles-pp.switch_mm_irqs_off.switch_mm.__sched_text_start.schedule.pipe_wait
1.43 +0.2 1.65 perf-profile.calltrace.cycles-pp.update_load_avg.dequeue_entity.dequeue_task_fair.__sched_text_start.schedule
1.62 +0.2 1.85 ± 3% perf-profile.calltrace.cycles-pp.__switch_to_asm
2.03 ± 2% +0.2 2.26 perf-profile.calltrace.cycles-pp.__switch_to
1.73 ± 2% +0.2 1.97 perf-profile.calltrace.cycles-pp.copy_page_from_iter.pipe_write.__vfs_write.vfs_write.ksys_write
1.67 ± 2% +0.2 1.91 ± 2% perf-profile.calltrace.cycles-pp.switch_mm.__sched_text_start.schedule.pipe_wait.pipe_read
1.12 ± 3% +0.3 1.37 perf-profile.calltrace.cycles-pp.pick_next_task_idle.__sched_text_start.schedule.pipe_wait.pipe_read
0.25 ±100% +0.3 0.54 ± 3% perf-profile.calltrace.cycles-pp.avc_has_perm.file_has_perm.security_file_permission.vfs_write.ksys_write
0.26 ±100% +0.3 0.56 ± 2% perf-profile.calltrace.cycles-pp.mutex_lock.pipe_wait.pipe_read.__vfs_read.vfs_read
1.49 ± 3% +0.3 1.79 perf-profile.calltrace.cycles-pp.__next_timer_interrupt.get_next_timer_interrupt.tick_nohz_next_event.tick_nohz_get_sleep_length.menu_select
0.26 ±100% +0.3 0.59 ± 6% perf-profile.calltrace.cycles-pp.__update_load_avg_se.update_load_avg.dequeue_entity.dequeue_task_fair.__sched_text_start
1.33 ± 2% +0.3 1.66 ± 2% perf-profile.calltrace.cycles-pp.update_load_avg.enqueue_entity.enqueue_task_fair.ttwu_do_activate.try_to_wake_up
0.40 ± 57% +0.4 0.75 perf-profile.calltrace.cycles-pp.__update_idle_core.pick_next_task_idle.__sched_text_start.schedule.pipe_wait
0.13 ±173% +0.4 0.54 ± 3% perf-profile.calltrace.cycles-pp.__update_load_avg_cfs_rq.update_load_avg.dequeue_entity.dequeue_task_fair.__sched_text_start
0.14 ±173% +0.4 0.56 ± 2% perf-profile.calltrace.cycles-pp.account_entity_dequeue.dequeue_entity.dequeue_task_fair.__sched_text_start.schedule
0.13 ±173% +0.4 0.58 ± 4% perf-profile.calltrace.cycles-pp.__update_load_avg_se.update_load_avg.enqueue_entity.enqueue_task_fair.ttwu_do_activate
4.33 +0.5 4.81 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret
2.33 ± 3% +0.5 2.83 perf-profile.calltrace.cycles-pp.get_next_timer_interrupt.tick_nohz_next_event.tick_nohz_get_sleep_length.menu_select.do_idle
0.00 +0.5 0.52 perf-profile.calltrace.cycles-pp.update_rq_clock.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock
0.64 ± 4% +0.5 1.18 ± 2% perf-profile.calltrace.cycles-pp.tick_nohz_idle_exit.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
0.00 +0.6 0.56 ± 4% perf-profile.calltrace.cycles-pp.__update_load_avg_cfs_rq.update_load_avg.enqueue_entity.enqueue_task_fair.ttwu_do_activate
2.71 ± 3% +0.6 3.28 perf-profile.calltrace.cycles-pp.tick_nohz_next_event.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry
4.37 +0.6 5.00 perf-profile.calltrace.cycles-pp.dequeue_entity.dequeue_task_fair.__sched_text_start.schedule.pipe_wait
3.83 +0.6 4.47 perf-profile.calltrace.cycles-pp.enqueue_entity.enqueue_task_fair.ttwu_do_activate.try_to_wake_up.autoremove_wake_function
6.87 +0.7 7.59 perf-profile.calltrace.cycles-pp.dequeue_task_fair.__sched_text_start.schedule.pipe_wait.pipe_read
6.18 +0.8 6.93 perf-profile.calltrace.cycles-pp.enqueue_task_fair.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common
3.43 ± 4% +0.8 4.21 perf-profile.calltrace.cycles-pp.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry.start_secondary
0.00 +0.8 0.83 ± 30% perf-profile.calltrace.cycles-pp.reschedule_interrupt.tick_nohz_idle_exit.do_idle.cpu_startup_entry.start_secondary
6.55 +0.9 7.49 perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock
5.05 ± 3% +1.2 6.25 perf-profile.calltrace.cycles-pp.menu_select.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
16.76 +1.6 18.40 perf-profile.calltrace.cycles-pp.__sched_text_start.schedule.pipe_wait.pipe_read.__vfs_read
17.00 +1.7 18.66 perf-profile.calltrace.cycles-pp.schedule.pipe_wait.pipe_read.__vfs_read.vfs_read
14.34 ± 2% +1.7 16.03 perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.pipe_write
14.83 ± 2% +1.7 16.55 perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.pipe_write.__vfs_write
15.12 ± 2% +1.8 16.88 perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.pipe_write.__vfs_write.vfs_write
15.91 +1.8 17.67 perf-profile.calltrace.cycles-pp.__wake_up_common_lock.pipe_write.__vfs_write.vfs_write.ksys_write
18.89 +1.9 20.80 perf-profile.calltrace.cycles-pp.pipe_wait.pipe_read.__vfs_read.vfs_read.ksys_read
20.19 +2.2 22.43 perf-profile.calltrace.cycles-pp.pipe_write.__vfs_write.vfs_write.ksys_write.do_syscall_64
20.70 +2.3 22.99 perf-profile.calltrace.cycles-pp.__vfs_write.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
24.89 +2.4 27.33 perf-profile.calltrace.cycles-pp.pipe_read.__vfs_read.vfs_read.ksys_read.do_syscall_64
25.49 +2.5 28.02 perf-profile.calltrace.cycles-pp.__vfs_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
23.43 +2.5 25.96 perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
24.23 +2.6 26.85 perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
28.66 +2.8 31.48 perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
29.39 +2.9 32.30 perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.00 +3.2 3.15 ± 9% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary
55.40 +5.7 61.09 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
55.92 +5.8 61.68 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
13.49 -13.5 0.00 perf-profile.children.cycles-pp.intel_idle
14.17 -10.6 3.61 ± 8% perf-profile.children.cycles-pp.cpuidle_enter_state
28.48 ± 2% -7.1 21.34 perf-profile.children.cycles-pp.secondary_startup_64
28.48 ± 2% -7.1 21.34 perf-profile.children.cycles-pp.start_secondary
28.48 ± 2% -7.1 21.34 perf-profile.children.cycles-pp.cpu_startup_entry
28.56 ± 2% -7.1 21.42 perf-profile.children.cycles-pp.do_idle
0.47 ± 4% -0.2 0.23 ± 5% perf-profile.children.cycles-pp.resched_curr
1.45 ± 3% -0.1 1.33 perf-profile.children.cycles-pp.cpumask_next_wrap
0.23 ± 7% -0.1 0.16 ± 9% perf-profile.children.cycles-pp.menu_reflect
0.08 ± 6% +0.0 0.09 perf-profile.children.cycles-pp.tsc_verify_tsc_adjust
0.06 ± 7% +0.0 0.07 ± 5% perf-profile.children.cycles-pp.can_stop_idle_tick
0.05 +0.0 0.07 ± 12% perf-profile.children.cycles-pp.pm_qos_read_value
0.10 ± 4% +0.0 0.12 ± 7% perf-profile.children.cycles-pp.ktime_get_coarse_real_ts64
0.08 ± 5% +0.0 0.10 ± 4% perf-profile.children.cycles-pp.arch_cpu_idle_enter
0.15 ± 2% +0.0 0.17 ± 2% perf-profile.children.cycles-pp.rcu_needs_cpu
0.08 ± 13% +0.0 0.11 ± 9% perf-profile.children.cycles-pp.tick_check_broadcast_expired
0.35 +0.0 0.39 perf-profile.children.cycles-pp.entry_SYSCALL_64_stage2
0.33 +0.0 0.36 ± 3% perf-profile.children.cycles-pp.anon_pipe_buf_release
0.71 +0.0 0.75 ± 2% perf-profile.children.cycles-pp.touch_atime
0.30 ± 7% +0.0 0.33 ± 3% perf-profile.children.cycles-pp.cpuidle_governor_latency_req
0.12 ± 5% +0.0 0.16 ± 4% perf-profile.children.cycles-pp.tick_nohz_tick_stopped
0.70 ± 3% +0.0 0.74 ± 2% perf-profile.children.cycles-pp.account_entity_dequeue
0.27 ± 7% +0.0 0.32 ± 4% perf-profile.children.cycles-pp.__x64_sys_read
1.72 +0.0 1.77 perf-profile.children.cycles-pp.set_next_entity
0.46 ± 3% +0.0 0.51 ± 4% perf-profile.children.cycles-pp.file_update_time
0.47 ± 4% +0.0 0.52 ± 3% perf-profile.children.cycles-pp.current_time
0.88 ± 2% +0.0 0.93 perf-profile.children.cycles-pp.fsnotify
0.66 ± 2% +0.1 0.71 ± 2% perf-profile.children.cycles-pp.copy_user_generic_unrolled
0.15 ± 3% +0.1 0.20 ± 3% perf-profile.children.cycles-pp.__enqueue_entity
0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.cpuidle_get_cpu_driver
0.68 ± 2% +0.1 0.74 ± 2% perf-profile.children.cycles-pp.prepare_to_wait
0.25 ± 10% +0.1 0.30 ± 3% perf-profile.children.cycles-pp.__hrtimer_next_event_base
0.23 ± 4% +0.1 0.29 ± 3% perf-profile.children.cycles-pp.rcu_eqs_enter
0.27 ± 3% +0.1 0.33 ± 2% perf-profile.children.cycles-pp.rcu_all_qs
0.43 ± 5% +0.1 0.49 ± 3% perf-profile.children.cycles-pp.__fsnotify_parent
1.02 +0.1 1.09 perf-profile.children.cycles-pp.___perf_sw_event
0.00 +0.1 0.07 ± 11% perf-profile.children.cycles-pp.native_smp_send_reschedule
0.22 ± 5% +0.1 0.29 ± 4% perf-profile.children.cycles-pp.activate_task
0.00 +0.1 0.08 ± 12% perf-profile.children.cycles-pp.interrupt_entry
0.52 ± 2% +0.1 0.60 perf-profile.children.cycles-pp.read_tsc
0.62 ± 3% +0.1 0.71 perf-profile.children.cycles-pp.__might_fault
0.74 ± 5% +0.1 0.82 perf-profile.children.cycles-pp.copyin
0.40 ± 5% +0.1 0.48 perf-profile.children.cycles-pp.hrtimer_get_next_event
0.21 ± 3% +0.1 0.30 ± 4% perf-profile.children.cycles-pp.sched_ttwu_pending
0.76 ± 4% +0.1 0.85 perf-profile.children.cycles-pp.ktime_get
0.96 ± 2% +0.1 1.05 ± 2% perf-profile.children.cycles-pp.avc_has_perm
0.87 ± 5% +0.1 0.96 perf-profile.children.cycles-pp.finish_task_switch
1.03 +0.1 1.13 ± 3% perf-profile.children.cycles-pp.__fdget_pos
0.07 ± 6% +0.1 0.17 ± 28% perf-profile.children.cycles-pp.__task_rq_lock
0.52 ± 3% +0.1 0.61 perf-profile.children.cycles-pp._cond_resched
0.00 +0.1 0.10 ± 12% perf-profile.children.cycles-pp.x2apic_send_IPI
0.77 ± 2% +0.1 0.88 perf-profile.children.cycles-pp.__might_sleep
0.21 ± 3% +0.1 0.32 ± 4% perf-profile.children.cycles-pp.call_cpuidle
1.17 ± 2% +0.1 1.30 ± 2% perf-profile.children.cycles-pp.check_preempt_curr
0.08 ± 15% +0.1 0.20 ± 29% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
1.99 +0.1 2.12 perf-profile.children.cycles-pp.selinux_file_permission
1.34 ± 2% +0.1 1.47 perf-profile.children.cycles-pp.ttwu_do_wakeup
0.64 ± 2% +0.1 0.78 ± 3% perf-profile.children.cycles-pp.tick_nohz_idle_enter
1.22 +0.1 1.35 perf-profile.children.cycles-pp.___might_sleep
2.00 +0.1 2.14 perf-profile.children.cycles-pp.update_cfs_group
1.48 ± 3% +0.1 1.62 perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
0.57 ± 7% +0.1 0.71 perf-profile.children.cycles-pp.hrtimer_next_event_without
1.53 ± 2% +0.2 1.68 ± 2% perf-profile.children.cycles-pp.file_has_perm
2.91 +0.2 3.07 perf-profile.children.cycles-pp.load_new_mm_cr3
0.35 ± 3% +0.2 0.51 ± 5% perf-profile.children.cycles-pp.rcu_idle_exit
1.50 +0.2 1.67 perf-profile.children.cycles-pp._raw_spin_lock_irqsave
1.41 ± 2% +0.2 1.60 ± 2% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.94 ± 3% +0.2 1.14 ± 2% perf-profile.children.cycles-pp.update_rq_clock
0.71 ± 3% +0.2 0.91 ± 7% perf-profile.children.cycles-pp._raw_spin_lock
1.98 +0.2 2.17 perf-profile.children.cycles-pp.copy_page_to_iter
1.62 +0.2 1.82 ± 3% perf-profile.children.cycles-pp.__update_load_avg_se
1.28 ± 4% +0.2 1.48 perf-profile.children.cycles-pp.native_sched_clock
0.71 ± 2% +0.2 0.93 ± 2% perf-profile.children.cycles-pp.copyout
0.33 ± 4% +0.2 0.54 perf-profile.children.cycles-pp.cpumask_next
1.34 ± 4% +0.2 1.56 perf-profile.children.cycles-pp.sched_clock
0.55 ± 4% +0.2 0.78 perf-profile.children.cycles-pp.__update_idle_core
0.00 +0.2 0.23 ± 8% perf-profile.children.cycles-pp.__x2apic_send_IPI_dest
2.32 ± 3% +0.2 2.55 perf-profile.children.cycles-pp.update_curr
1.67 +0.2 1.91 perf-profile.children.cycles-pp.mutex_lock
0.66 ± 5% +0.2 0.89 perf-profile.children.cycles-pp.copy_user_enhanced_fast_string
1.78 ± 2% +0.2 2.02 perf-profile.children.cycles-pp.copy_page_from_iter
1.93 ± 3% +0.2 2.18 ± 2% perf-profile.children.cycles-pp.find_next_bit
0.00 +0.2 0.25 ± 4% perf-profile.children.cycles-pp.native_apic_msr_eoi_write
1.74 ± 2% +0.3 1.99 ± 3% perf-profile.children.cycles-pp.switch_mm
1.15 ± 3% +0.3 1.41 perf-profile.children.cycles-pp.pick_next_task_idle
1.48 ± 4% +0.3 1.74 perf-profile.children.cycles-pp.sched_clock_cpu
0.00 +0.3 0.26 ± 5% perf-profile.children.cycles-pp.smp_reschedule_interrupt
2.27 ± 2% +0.3 2.55 perf-profile.children.cycles-pp.__switch_to
1.23 +0.3 1.52 ± 2% perf-profile.children.cycles-pp.native_write_msr
3.92 +0.3 4.25 perf-profile.children.cycles-pp.security_file_permission
0.03 ±100% +0.3 0.38 perf-profile.children.cycles-pp.native_irq_return_iret
1.71 ± 3% +0.4 2.07 perf-profile.children.cycles-pp.__next_timer_interrupt
4.77 +0.4 5.15 perf-profile.children.cycles-pp.switch_mm_irqs_off
1.79 +0.5 2.25 ± 2% perf-profile.children.cycles-pp.__switch_to_asm
2.34 ± 3% +0.5 2.85 perf-profile.children.cycles-pp.get_next_timer_interrupt
4.93 +0.6 5.49 perf-profile.children.cycles-pp.syscall_return_via_sysret
2.73 ± 3% +0.6 3.32 perf-profile.children.cycles-pp.tick_nohz_next_event
3.78 +0.6 4.36 perf-profile.children.cycles-pp.update_load_avg
4.47 +0.6 5.11 perf-profile.children.cycles-pp.dequeue_entity
3.93 +0.7 4.62 perf-profile.children.cycles-pp.enqueue_entity
6.90 +0.7 7.63 perf-profile.children.cycles-pp.dequeue_task_fair
6.20 +0.8 6.96 perf-profile.children.cycles-pp.enqueue_task_fair
3.46 ± 4% +0.8 4.24 perf-profile.children.cycles-pp.tick_nohz_get_sleep_length
6.55 +1.0 7.51 perf-profile.children.cycles-pp.ttwu_do_activate
0.06 ± 7% +1.0 1.05 ± 3% perf-profile.children.cycles-pp.reschedule_interrupt
0.67 ± 3% +1.0 1.68 ± 2% perf-profile.children.cycles-pp.tick_nohz_idle_exit
5.08 ± 3% +1.2 6.29 perf-profile.children.cycles-pp.menu_select
17.25 +1.6 18.90 perf-profile.children.cycles-pp.schedule
14.39 ± 2% +1.7 16.10 perf-profile.children.cycles-pp.try_to_wake_up
14.85 ± 2% +1.7 16.59 perf-profile.children.cycles-pp.autoremove_wake_function
15.24 ± 2% +1.8 17.01 perf-profile.children.cycles-pp.__wake_up_common
16.70 +1.8 18.53 perf-profile.children.cycles-pp.__wake_up_common_lock
18.94 +1.9 20.86 perf-profile.children.cycles-pp.pipe_wait
22.95 +2.1 25.07 perf-profile.children.cycles-pp.__sched_text_start
20.25 +2.2 22.49 perf-profile.children.cycles-pp.pipe_write
20.73 +2.3 23.02 perf-profile.children.cycles-pp.__vfs_write
24.95 +2.5 27.41 perf-profile.children.cycles-pp.pipe_read
25.53 +2.5 28.05 perf-profile.children.cycles-pp.__vfs_read
23.48 +2.5 26.02 perf-profile.children.cycles-pp.vfs_write
24.26 +2.6 26.88 perf-profile.children.cycles-pp.ksys_write
28.73 +2.8 31.56 perf-profile.children.cycles-pp.vfs_read
29.43 +2.9 32.34 perf-profile.children.cycles-pp.ksys_read
0.12 ± 11% +3.0 3.16 ± 9% perf-profile.children.cycles-pp.poll_idle
55.47 +5.7 61.16 perf-profile.children.cycles-pp.do_syscall_64
56.01 +5.8 61.76 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
13.47 -13.5 0.00 perf-profile.self.cycles-pp.intel_idle
0.47 ± 5% -0.3 0.22 ± 5% perf-profile.self.cycles-pp.resched_curr
0.72 ± 2% -0.1 0.61 ± 4% perf-profile.self.cycles-pp.copy_page_to_iter
0.18 ± 8% -0.0 0.14 ± 10% perf-profile.self.cycles-pp.menu_reflect
0.19 ± 7% -0.0 0.14 ± 3% perf-profile.self.cycles-pp.cpuidle_enter_state
0.05 +0.0 0.07 ± 7% perf-profile.self.cycles-pp.pm_qos_read_value
0.06 ± 7% +0.0 0.07 ± 5% perf-profile.self.cycles-pp.can_stop_idle_tick
0.11 ± 3% +0.0 0.13 ± 5% perf-profile.self.cycles-pp.__fdget_pos
0.18 ± 2% +0.0 0.21 ± 2% perf-profile.self.cycles-pp.touch_atime
0.14 ± 3% +0.0 0.16 ± 4% perf-profile.self.cycles-pp.rcu_needs_cpu
0.08 ± 6% +0.0 0.10 ± 7% perf-profile.self.cycles-pp.hrtimer_next_event_without
0.08 ± 8% +0.0 0.11 ± 10% perf-profile.self.cycles-pp.tick_check_broadcast_expired
0.25 ± 2% +0.0 0.28 perf-profile.self.cycles-pp._cond_resched
0.12 ± 9% +0.0 0.15 ± 3% perf-profile.self.cycles-pp.tick_nohz_idle_enter
0.23 ± 6% +0.0 0.26 ± 2% perf-profile.self.cycles-pp.current_time
0.39 ± 2% +0.0 0.42 ± 2% perf-profile.self.cycles-pp.file_has_perm
0.34 +0.0 0.37 perf-profile.self.cycles-pp.entry_SYSCALL_64_stage2
0.23 +0.0 0.26 ± 6% perf-profile.self.cycles-pp.ksys_read
0.17 ± 2% +0.0 0.20 ± 4% perf-profile.self.cycles-pp.switch_mm
0.14 ± 7% +0.0 0.17 ± 2% perf-profile.self.cycles-pp.sched_clock_cpu
0.23 ± 5% +0.0 0.27 ± 5% perf-profile.self.cycles-pp.__x64_sys_read
0.39 +0.0 0.43 perf-profile.self.cycles-pp.prepare_to_wait
0.38 ± 2% +0.0 0.42 ± 3% perf-profile.self.cycles-pp.__wake_up_common
0.29 ± 3% +0.0 0.32 ± 3% perf-profile.self.cycles-pp.security_file_permission
0.20 ± 5% +0.0 0.24 ± 5% perf-profile.self.cycles-pp.sched_ttwu_pending
0.10 ± 4% +0.0 0.14 ± 5% perf-profile.self.cycles-pp.tick_nohz_get_sleep_length
0.10 ± 7% +0.0 0.14 ± 3% perf-profile.self.cycles-pp.tick_nohz_tick_stopped
0.16 ± 2% +0.0 0.20 ± 3% perf-profile.self.cycles-pp.tick_nohz_next_event
0.11 ± 4% +0.0 0.16 ± 5% perf-profile.self.cycles-pp.__enqueue_entity
0.61 ± 3% +0.0 0.66 perf-profile.self.cycles-pp.copy_user_generic_unrolled
0.43 ± 6% +0.0 0.48 ± 2% perf-profile.self.cycles-pp.__vfs_write
0.14 ± 6% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.get_next_timer_interrupt
0.39 ± 3% +0.0 0.43 ± 4% perf-profile.self.cycles-pp.vfs_read
0.00 +0.1 0.05 perf-profile.self.cycles-pp.schedule_idle
0.54 +0.1 0.59 perf-profile.self.cycles-pp.dequeue_entity
0.24 ± 3% +0.1 0.30 ± 6% perf-profile.self.cycles-pp.ksys_write
0.37 ± 5% +0.1 0.42 ± 3% perf-profile.self.cycles-pp.vfs_write
0.24 ± 5% +0.1 0.30 ± 2% perf-profile.self.cycles-pp.pipe_wait
0.18 ± 4% +0.1 0.24 perf-profile.self.cycles-pp.__hrtimer_next_event_base
0.90 ± 2% +0.1 0.96 perf-profile.self.cycles-pp.___perf_sw_event
0.22 ± 3% +0.1 0.28 ± 4% perf-profile.self.cycles-pp.rcu_eqs_enter
0.43 ± 2% +0.1 0.49 ± 3% perf-profile.self.cycles-pp.copy_page_from_iter
0.19 ± 4% +0.1 0.25 perf-profile.self.cycles-pp.rcu_all_qs
0.50 ± 7% +0.1 0.56 ± 3% perf-profile.self.cycles-pp.__vfs_read
0.56 ± 2% +0.1 0.62 ± 3% perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
0.38 ± 4% +0.1 0.45 ± 3% perf-profile.self.cycles-pp.__fsnotify_parent
0.21 ± 3% +0.1 0.28 ± 5% perf-profile.self.cycles-pp.activate_task
1.34 ± 2% +0.1 1.41 perf-profile.self.cycles-pp.reweight_entity
0.00 +0.1 0.07 ± 10% perf-profile.self.cycles-pp.native_smp_send_reschedule
0.24 ± 3% +0.1 0.32 ± 4% perf-profile.self.cycles-pp.update_rq_clock
0.70 ± 3% +0.1 0.78 ± 5% perf-profile.self.cycles-pp.enqueue_task_fair
0.61 ± 4% +0.1 0.69 ± 2% perf-profile.self.cycles-pp._raw_spin_lock
0.00 +0.1 0.08 ± 12% perf-profile.self.cycles-pp.interrupt_entry
0.49 ± 2% +0.1 0.57 ± 2% perf-profile.self.cycles-pp.read_tsc
0.81 ± 2% +0.1 0.90 ± 2% perf-profile.self.cycles-pp.dequeue_task_fair
0.73 +0.1 0.82 ± 2% perf-profile.self.cycles-pp.mutex_lock
0.20 ± 4% +0.1 0.29 ± 5% perf-profile.self.cycles-pp.rcu_idle_exit
0.58 ± 5% +0.1 0.68 ± 2% perf-profile.self.cycles-pp.finish_task_switch
0.00 +0.1 0.09 ± 11% perf-profile.self.cycles-pp.x2apic_send_IPI
0.95 ± 2% +0.1 1.04 ± 2% perf-profile.self.cycles-pp.avc_has_perm
0.78 +0.1 0.87 ± 2% perf-profile.self.cycles-pp.pipe_write
1.34 +0.1 1.44 perf-profile.self.cycles-pp.selinux_file_permission
0.71 +0.1 0.81 perf-profile.self.cycles-pp.__might_sleep
0.20 ± 2% +0.1 0.32 ± 3% perf-profile.self.cycles-pp.call_cpuidle
0.12 ± 6% +0.1 0.24 ± 3% perf-profile.self.cycles-pp.ttwu_do_activate
1.13 ± 2% +0.1 1.24 ± 3% perf-profile.self.cycles-pp.pipe_read
0.87 +0.1 0.99 ± 5% perf-profile.self.cycles-pp.do_syscall_64
0.08 ± 15% +0.1 0.20 ± 29% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
1.94 +0.1 2.08 perf-profile.self.cycles-pp.update_cfs_group
1.17 +0.1 1.30 perf-profile.self.cycles-pp.___might_sleep
2.52 +0.1 2.66 perf-profile.self.cycles-pp.__sched_text_start
1.33 ± 2% +0.1 1.48 perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
2.90 +0.2 3.06 perf-profile.self.cycles-pp.load_new_mm_cr3
1.44 +0.2 1.59 perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.73 ± 2% +0.2 0.89 perf-profile.self.cycles-pp.enqueue_entity
1.16 ± 3% +0.2 1.32 perf-profile.self.cycles-pp.select_task_rq_fair
0.12 ± 7% +0.2 0.29 ± 4% perf-profile.self.cycles-pp.cpumask_next
1.64 ± 2% +0.2 1.81 ± 2% perf-profile.self.cycles-pp.find_next_bit
0.81 ± 3% +0.2 0.99 perf-profile.self.cycles-pp.__next_timer_interrupt
1.39 ± 2% +0.2 1.58 ± 2% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
1.56 +0.2 1.76 ± 3% perf-profile.self.cycles-pp.__update_load_avg_se
1.31 ± 2% +0.2 1.51 perf-profile.self.cycles-pp.update_load_avg
1.11 ± 4% +0.2 1.31 perf-profile.self.cycles-pp.update_curr
1.23 ± 3% +0.2 1.43 perf-profile.self.cycles-pp.native_sched_clock
1.86 +0.2 2.08 perf-profile.self.cycles-pp.switch_mm_irqs_off
0.77 ± 2% +0.2 0.99 ± 5% perf-profile.self.cycles-pp.do_idle
0.65 ± 5% +0.2 0.88 ± 2% perf-profile.self.cycles-pp.copy_user_enhanced_fast_string
0.00 +0.2 0.24 ± 5% perf-profile.self.cycles-pp.native_apic_msr_eoi_write
2.19 ± 2% +0.3 2.47 perf-profile.self.cycles-pp.__switch_to
1.22 ± 2% +0.3 1.51 ± 2% perf-profile.self.cycles-pp.native_write_msr
0.70 ± 4% +0.3 1.00 ± 5% perf-profile.self.cycles-pp.try_to_wake_up
1.22 ± 4% +0.3 1.56 perf-profile.self.cycles-pp.menu_select
0.03 ±100% +0.3 0.38 perf-profile.self.cycles-pp.native_irq_return_iret
1.79 +0.5 2.25 ± 2% perf-profile.self.cycles-pp.__switch_to_asm
0.13 ± 8% +0.5 0.64 ± 3% perf-profile.self.cycles-pp.tick_nohz_idle_exit
4.92 +0.6 5.48 perf-profile.self.cycles-pp.syscall_return_via_sysret
0.08 ± 6% +2.9 2.94 ± 9% perf-profile.self.cycles-pp.poll_idle
will-it-scale.per_process_ops
385000 +-+----------------------------------------------------------------+
380000 +-+ O O |
O O O OO O |
375000 +-O O O O O O O OO O O |
370000 +-+ O O O O O O O O |
| O O |
365000 +-+ |
360000 +-+ |
355000 +-+ |
| |
350000 +-+ |
345000 +-+ .+. .+.++.+.+.+. .+. |
|. .+. + +.+.+ +. .+. +.+. .+. .+. .+.+.+.+.++ +.|
340000 +-+ + + + + + ++.+ |
335000 +-+----------------------------------------------------------------+
will-it-scale.workload
6.2e+06 +-+---------------------------------------------------------------+
| O |
6.1e+06 +-+ O O |
O O O O O O O |
6e+06 +-O O OO O O O OO O O O OO |
5.9e+06 +-+ O O OO O |
| |
5.8e+06 +-+ |
| |
5.7e+06 +-+ |
5.6e+06 +-+ |
| .+ .+. |
5.5e+06 +-+ .+.+.+.+ + +.+.+. .+. .+.++. |
|.+.+.++ ++.+.+.+.+.++.+.+.+.+.++.+ + +.|
5.4e+06 +-+---------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Rong Chen
View attachment "config-4.19.0-rc3-00006-g23e8ceb" of type "text/plain" (167686 bytes)
View attachment "job-script" of type "text/plain" (7030 bytes)
View attachment "job.yaml" of type "text/plain" (4674 bytes)
View attachment "reproduce" of type "text/plain" (319 bytes)
Powered by blists - more mailing lists