lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [day] [month] [year] [list]
Date:   Wed, 14 Apr 2021 10:45:26 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Like Xu <like.xu@...ux.intel.com>
Cc:     Peter Zijlstra <peterz@...radead.org>,
        kernel test robot <lkp@...el.com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        ying.huang@...el.com, feng.tang@...el.com, zhengjun.xing@...el.com
Subject: [x86/perf]  abd562df94:  stress-ng.timer.ops_per_sec 6.1% improvement



Greeting,

FYI, we noticed a 6.1% improvement of stress-ng.timer.ops_per_sec due to commit:


commit: abd562df94d19d0a9769971a35801b3f4991715d ("x86/perf: Use static_call for x86_pmu.guest_get_msrs")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: stress-ng
on test machine: 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory
with following parameters:

	nr_threads: 100%
	disk: 1HDD
	testtime: 60s
	class: interrupt
	test: timer
	cpufreq_governor: performance
	ucode: 0x5003006






Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install                job.yaml  # job file is attached in this email
        bin/lkp split-job --compatible job.yaml
        bin/lkp run                    compatible-job.yaml

=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode:
  interrupt/gcc-9/performance/1HDD/x86_64-rhel-8.3/100%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp7/timer/stress-ng/60s/0x5003006

commit: 
  9a7832ce3d ("perf/x86/intel/uncore: With > 8 nodes, get pci bus die id from NUMA info")
  abd562df94 ("x86/perf: Use static_call for x86_pmu.guest_get_msrs")

9a7832ce3d920426 abd562df94d19d0a9769971a358 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      7378            +3.7%       7650        stress-ng.time.percent_of_cpu_this_job_got
 1.622e+09            +6.1%  1.721e+09        stress-ng.timer.ops
  27012179            +6.1%   28671694        stress-ng.timer.ops_per_sec
     16.51            -2.3       14.20 ±  2%  mpstat.cpu.all.irq%
  24897453            +7.2%   26690698        vmstat.system.in
      2254 ±161%    +248.6%       7857 ± 78%  proc-vmstat.numa_pages_migrated
      2254 ±161%    +248.6%       7857 ± 78%  proc-vmstat.pgmigrate_success
      2.09 ± 54%    +182.7%       5.92 ± 39%  sched_debug.cfs_rq:/.removed.runnable_avg.avg
     16.77 ± 44%    +112.3%      35.62 ± 25%  sched_debug.cfs_rq:/.removed.runnable_avg.stddev
      2.09 ± 54%    +177.6%       5.81 ± 38%  sched_debug.cfs_rq:/.removed.util_avg.avg
     16.77 ± 44%    +106.1%      34.57 ± 23%  sched_debug.cfs_rq:/.removed.util_avg.stddev
     30.33 ±  6%     +39.8%      42.42 ± 17%  sched_debug.cpu.nr_uninterruptible.max
      1849 ± 37%     -57.4%     787.67 ± 21%  interrupts.CPU1.CAL:Function_call_interrupts
      1585 ± 35%     -53.0%     745.00 ± 37%  interrupts.CPU2.CAL:Function_call_interrupts
     77.00 ± 14%    +376.6%     367.00 ±130%  interrupts.CPU28.RES:Rescheduling_interrupts
      1019 ± 65%     -45.9%     551.33 ± 20%  interrupts.CPU52.CAL:Function_call_interrupts
      1058 ± 49%     -47.0%     560.50 ± 20%  interrupts.CPU75.CAL:Function_call_interrupts
    116.67 ± 16%     -33.4%      77.67 ± 23%  interrupts.CPU75.RES:Rescheduling_interrupts
      1019 ± 61%     -45.6%     554.83 ± 20%  interrupts.CPU76.CAL:Function_call_interrupts
    115.17 ± 54%     -38.5%      70.83 ±  4%  interrupts.CPU76.RES:Rescheduling_interrupts
      0.64 ± 23%     -37.4%       0.40 ± 10%  perf-stat.i.MPKI
 2.853e+10            +6.3%  3.034e+10        perf-stat.i.branch-instructions
 3.319e+08            +5.5%  3.501e+08        perf-stat.i.branch-misses
  69976777 ± 25%     -52.1%   33544338 ± 18%  perf-stat.i.cache-references
      1.76            -7.5%       1.63        perf-stat.i.cpi
 2.441e+11            -1.8%  2.395e+11        perf-stat.i.cpu-cycles
    158.86            -3.0%     154.16        perf-stat.i.cpu-migrations
      0.01 ± 17%      -0.0        0.00 ± 24%  perf-stat.i.dTLB-load-miss-rate%
   1683057 ± 17%     -45.8%     911716 ± 30%  perf-stat.i.dTLB-load-misses
 4.055e+10            +6.5%   4.32e+10        perf-stat.i.dTLB-loads
  22577249 ±  8%     +14.5%   25859832 ±  3%  perf-stat.i.dTLB-store-misses
 2.411e+10            +6.6%   2.57e+10        perf-stat.i.dTLB-stores
 2.813e+08 ±  2%      +5.2%  2.959e+08        perf-stat.i.iTLB-load-misses
    952108            +6.8%    1017001        perf-stat.i.iTLB-loads
 1.364e+11            +6.4%  1.452e+11        perf-stat.i.instructions
      0.57            +7.7%       0.62        perf-stat.i.ipc
      2.54            -1.8%       2.50        perf-stat.i.metric.GHz
    971.81            +6.4%       1034        perf-stat.i.metric.M/sec
    453092 ± 37%     -49.7%     228059 ± 38%  perf-stat.i.node-loads
   1389772 ± 29%     -52.0%     666567 ± 53%  perf-stat.i.node-store-misses
      0.51 ± 26%     -55.0%       0.23 ± 18%  perf-stat.overall.MPKI
      1.79            -7.8%       1.65        perf-stat.overall.cpi
      0.00 ± 17%      -0.0        0.00 ± 30%  perf-stat.overall.dTLB-load-miss-rate%
      0.56            +8.4%       0.61        perf-stat.overall.ipc
     98.54            -2.3       96.20 ±  2%  perf-stat.overall.node-store-miss-rate%
 2.807e+10            +6.4%  2.986e+10        perf-stat.ps.branch-instructions
 3.266e+08            +5.5%  3.445e+08        perf-stat.ps.branch-misses
  68956196 ± 25%     -52.0%   33103941 ± 18%  perf-stat.ps.cache-references
 2.402e+11            -1.8%  2.358e+11        perf-stat.ps.cpu-cycles
    156.99            -3.0%     152.29        perf-stat.ps.cpu-migrations
   1686029 ± 17%     -45.6%     917216 ± 29%  perf-stat.ps.dTLB-load-misses
  3.99e+10            +6.5%  4.251e+10        perf-stat.ps.dTLB-loads
  22220401 ±  8%     +14.5%   25451152 ±  3%  perf-stat.ps.dTLB-store-misses
 2.373e+10            +6.6%  2.529e+10        perf-stat.ps.dTLB-stores
 2.768e+08            +5.2%  2.912e+08        perf-stat.ps.iTLB-load-misses
    937147            +6.8%    1000585        perf-stat.ps.iTLB-loads
 1.343e+11            +6.4%  1.429e+11        perf-stat.ps.instructions
    449302 ± 37%     -49.3%     227603 ± 38%  perf-stat.ps.node-loads
   1367419 ± 29%     -52.0%     656210 ± 53%  perf-stat.ps.node-store-misses
 8.555e+12            +6.4%  9.102e+12        perf-stat.total.instructions
     10.31 ± 14%      -3.6        6.74 ±  8%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      9.28 ± 14%      -3.2        6.04 ±  7%  perf-profile.calltrace.cycles-pp.syscall_enter_from_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
     17.22 ±  2%      -2.9       14.30 ±  2%  perf-profile.calltrace.cycles-pp.posixtimer_rearm.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare
     19.20            -2.7       16.47 ±  2%  perf-profile.calltrace.cycles-pp.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      5.14 ±  3%      -2.7        2.46 ± 17%  perf-profile.calltrace.cycles-pp.ktime_get.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal.get_signal
      6.53 ± 15%      -2.6        3.91 ±  7%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
     20.90            -2.6       18.30        perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      6.20 ±  3%      -2.6        3.62 ± 11%  perf-profile.calltrace.cycles-pp.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal.get_signal.arch_do_signal_or_restart
      5.91 ± 15%      -2.5        3.43 ±  7%  perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode.do_syscall_64
      5.88 ± 15%      -2.5        3.40 ±  7%  perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.syscall_enter_from_user_mode
      4.79 ± 24%      -1.9        2.90 ± 10%  perf-profile.calltrace.cycles-pp.__x64_sys_timer_getoverrun.do_syscall_64.entry_SYSCALL_64_after_hwframe
      7.99 ± 10%      -1.8        6.19 ±  3%  perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
      2.22 ±  6%      -1.3        0.97 ± 18%  perf-profile.calltrace.cycles-pp.ktime_get_update_offsets_now.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
     37.07            -1.1       36.00        perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
     37.73            -1.0       36.74        perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      2.56 ± 27%      -1.0        1.61 ± 14%  perf-profile.calltrace.cycles-pp.__lock_timer.__x64_sys_timer_getoverrun.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.90 ± 14%      -0.8        1.08 ±  7%  perf-profile.calltrace.cycles-pp.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt
      2.04 ±  3%      -0.8        1.29 ±  9%  perf-profile.calltrace.cycles-pp.ktime_get.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal
     38.61            -0.7       37.96        perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      1.28 ±  7%      -0.5        0.76 ± 11%  perf-profile.calltrace.cycles-pp.ktime_get.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
      0.94 ± 16%      -0.4        0.58 ±  5%  perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.__x64_sys_timer_getoverrun.do_syscall_64.entry_SYSCALL_64_after_hwframe
      5.26            -0.3        4.95        perf-profile.calltrace.cycles-pp.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal
      8.91            -0.2        8.69        perf-profile.calltrace.cycles-pp.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal.arch_do_signal_or_restart
      0.84            -0.1        0.72 ±  2%  perf-profile.calltrace.cycles-pp.clockevents_program_min_delta.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal
      0.63            +0.1        0.68        perf-profile.calltrace.cycles-pp.__get_user_nocheck_4.__fpu__restore_sig.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64
      0.54            +0.1        0.60        perf-profile.calltrace.cycles-pp.__get_user_nocheck_4.copy_fpstate_to_sigframe.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare
      0.66 ±  2%      +0.1        0.72        perf-profile.calltrace.cycles-pp.read_tsc.ktime_get.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm
      0.62 ±  3%      +0.1        0.69        perf-profile.calltrace.cycles-pp.enqueue_hrtimer.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal.get_signal
      0.67            +0.1        0.74        perf-profile.calltrace.cycles-pp.read_tsc.ktime_get.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal
      0.62            +0.1        0.69        perf-profile.calltrace.cycles-pp.syscall_return_via_sysret
      0.87            +0.1        0.94        perf-profile.calltrace.cycles-pp.__set_current_blocked.signal_setup_done.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      0.59 ±  4%      +0.1        0.67 ±  2%  perf-profile.calltrace.cycles-pp.__irqentry_text_end
      0.79 ±  2%      +0.1        0.87        perf-profile.calltrace.cycles-pp.hrtimer_forward.common_hrtimer_rearm.posixtimer_rearm.dequeue_signal.get_signal
      1.02            +0.1        1.11        perf-profile.calltrace.cycles-pp.signal_setup_done.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      0.85 ±  4%      +0.1        0.94        perf-profile.calltrace.cycles-pp.__dequeue_signal.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare
      1.08 ±  2%      +0.1        1.19        perf-profile.calltrace.cycles-pp.__set_current_blocked.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.12            +0.1        1.23        perf-profile.calltrace.cycles-pp.__get_user_nocheck_8.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.77            +0.1        0.87        perf-profile.calltrace.cycles-pp.copy_init_pkru_to_fpregs.fpu__clear.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      0.88            +0.1        0.99        perf-profile.calltrace.cycles-pp.__put_user_nocheck_4.copy_fpstate_to_sigframe.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare
      1.03 ±  2%      +0.1        1.14 ±  2%  perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string._copy_from_user.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64
      1.10            +0.1        1.22 ±  2%  perf-profile.calltrace.cycles-pp._copy_from_user.restore_altstack.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.46            +0.2        1.63 ±  2%  perf-profile.calltrace.cycles-pp._copy_from_user.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.42            +0.2        1.59 ±  2%  perf-profile.calltrace.cycles-pp.restore_altstack.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.57 ± 10%      +0.2        0.78 ±  3%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      0.87 ± 14%      +0.2        1.09 ±  8%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.39 ± 71%      +0.3        0.65 ±  5%  perf-profile.calltrace.cycles-pp.irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
      1.48 ±  7%      +0.3        1.77 ±  5%  perf-profile.calltrace.cycles-pp.posix_timer_fn.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
      1.49 ±  8%      +0.3        1.78 ±  5%  perf-profile.calltrace.cycles-pp.lapic_next_deadline.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
      0.79 ± 18%      +0.4        1.19 ± 13%  perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.68 ± 22%      +0.4        1.08 ± 14%  perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled
      2.79            +0.4        3.20        perf-profile.calltrace.cycles-pp.lapic_next_deadline.clockevents_program_event.hrtimer_start_range_ns.posixtimer_rearm.dequeue_signal
      0.62 ± 25%      +0.4        1.03 ± 17%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
      0.68 ± 23%      +0.4        1.09 ± 14%  perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled._copy_to_user
      2.54 ±  6%      +0.4        2.96 ±  5%  perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
      0.18 ±141%      +0.4        0.62 ± 18%  perf-profile.calltrace.cycles-pp.___might_sleep.__might_fault._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
      0.19 ±141%      +0.5        0.65 ± 18%  perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string
      0.19 ±141%      +0.5        0.65 ± 18%  perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string._copy_to_user
      1.23 ± 15%      +0.5        1.69 ± 18%  perf-profile.calltrace.cycles-pp.__might_fault._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
      4.29            +0.5        4.75        perf-profile.calltrace.cycles-pp.fpu__clear.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      0.72 ± 21%      +0.5        1.19 ± 14%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.75 ± 23%      +0.5        1.23 ± 14%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled._copy_to_user.__x64_sys_rt_sigpending
      0.22 ±141%      +0.5        0.74 ± 17%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.copy_user_enhanced_fast_string._copy_to_user.__x64_sys_rt_sigpending
      0.21 ±141%      +0.5        0.75 ± 15%  perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._copy_to_user
      0.22 ±141%      +0.5        0.75 ± 15%  perf-profile.calltrace.cycles-pp.asm_call_sysvec_on_stack.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._copy_to_user.__x64_sys_rt_sigpending
      0.00            +0.5        0.54 ±  2%  perf-profile.calltrace.cycles-pp.recalc_sigpending.dequeue_signal.get_signal.arch_do_signal_or_restart.exit_to_user_mode_prepare
      0.00            +0.5        0.55 ±  3%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.syscall_exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      0.24 ±141%      +0.6        0.86 ± 15%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
      8.05 ±  3%      +0.7        8.73        perf-profile.calltrace.cycles-pp.copy_fpstate_to_sigframe.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      1.21 ± 17%      +0.7        1.92 ± 11%  perf-profile.calltrace.cycles-pp.copy_user_generic_unrolled._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.71            +0.8        7.46        perf-profile.calltrace.cycles-pp.__fpu__restore_sig.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.97 ± 23%      +0.8        1.81 ± 23%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.copy_user_generic_unrolled._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64
     10.17 ±  2%      +0.9       11.06        perf-profile.calltrace.cycles-pp.__setup_rt_frame.arch_do_signal_or_restart.exit_to_user_mode_prepare.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      8.76            +1.0        9.75        perf-profile.calltrace.cycles-pp.restore_sigcontext.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
     13.12            +1.5       14.58        perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigreturn.do_syscall_64.entry_SYSCALL_64_after_hwframe
      4.57 ± 17%      +2.5        7.03 ± 14%  perf-profile.calltrace.cycles-pp._copy_to_user.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
      7.80 ± 14%      +2.9       10.68 ± 12%  perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigpending.do_syscall_64.entry_SYSCALL_64_after_hwframe
     10.54 ±  3%      -4.9        5.63 ± 13%  perf-profile.children.cycles-pp.ktime_get
     10.09 ± 14%      -3.4        6.67 ±  7%  perf-profile.children.cycles-pp.syscall_enter_from_user_mode
     20.36            -3.2       17.12 ±  2%  perf-profile.children.cycles-pp.hrtimer_interrupt
     21.14            -3.1       18.01 ±  2%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      4.95 ±  4%      -2.9        2.02 ± 19%  perf-profile.children.cycles-pp.ktime_get_update_offsets_now
     17.26 ±  2%      -2.9       14.34 ±  2%  perf-profile.children.cycles-pp.posixtimer_rearm
     23.48            -2.9       20.63        perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
     19.25            -2.7       16.53 ±  2%  perf-profile.children.cycles-pp.dequeue_signal
     20.94            -2.6       18.35        perf-profile.children.cycles-pp.get_signal
      6.24 ±  3%      -2.6        3.65 ± 11%  perf-profile.children.cycles-pp.common_hrtimer_rearm
     28.04            -2.5       25.50        perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
     11.95 ±  7%      -2.4        9.54 ±  4%  perf-profile.children.cycles-pp.asm_call_sysvec_on_stack
      4.94 ± 24%      -2.0        2.99 ± 10%  perf-profile.children.cycles-pp.__x64_sys_timer_getoverrun
     12.07            -1.3       10.76        perf-profile.children.cycles-pp.clockevents_program_event
     37.11            -1.1       36.04        perf-profile.children.cycles-pp.arch_do_signal_or_restart
     37.84            -1.0       36.88        perf-profile.children.cycles-pp.exit_to_user_mode_prepare
     38.73            -0.6       38.10        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      8.98            -0.2        8.78        perf-profile.children.cycles-pp.hrtimer_start_range_ns
      0.84            -0.1        0.73 ±  2%  perf-profile.children.cycles-pp.clockevents_program_min_delta
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.do_sigaltstack
      0.09 ±  7%      +0.0        0.11 ±  4%  perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
      0.13 ±  5%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.rcu_all_qs
      0.11 ±  7%      +0.0        0.13 ±  5%  perf-profile.children.cycles-pp.irq_enter_rcu
      0.22 ±  3%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.complete_signal
      0.17 ±  2%      +0.0        0.19 ±  2%  perf-profile.children.cycles-pp._cond_resched
      0.18 ±  3%      +0.0        0.21 ±  4%  perf-profile.children.cycles-pp.prepare_signal
      0.23 ±  2%      +0.0        0.25        perf-profile.children.cycles-pp.hrtimer_reprogram
      0.30 ±  4%      +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.tick_program_event
      0.43 ±  3%      +0.0        0.47 ±  3%  perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
      0.27 ±  3%      +0.0        0.31 ±  2%  perf-profile.children.cycles-pp.__hrtimer_get_next_event
      0.21 ±  3%      +0.0        0.25 ±  5%  perf-profile.children.cycles-pp.get_xsave_addr
      0.57            +0.1        0.62 ±  2%  perf-profile.children.cycles-pp.__local_bh_enable_ip
      0.48 ±  3%      +0.1        0.53        perf-profile.children.cycles-pp.timerqueue_add
      0.60 ±  2%      +0.1        0.66        perf-profile.children.cycles-pp.__lock_task_sighand
      0.50            +0.1        0.56 ±  2%  perf-profile.children.cycles-pp.__hrtimer_next_event_base
      0.52 ±  2%      +0.1        0.58        perf-profile.children.cycles-pp.recalc_sigpending_tsk
      0.73 ±  2%      +0.1        0.80 ±  2%  perf-profile.children.cycles-pp.fpregs_mark_activate
      0.64 ±  3%      +0.1        0.71        perf-profile.children.cycles-pp.enqueue_hrtimer
      0.90 ±  2%      +0.1        0.98        perf-profile.children.cycles-pp.__set_task_blocked
      0.59 ±  4%      +0.1        0.67 ±  2%  perf-profile.children.cycles-pp.__irqentry_text_end
      0.79 ±  2%      +0.1        0.87        perf-profile.children.cycles-pp.hrtimer_forward
      0.88 ±  4%      +0.1        0.97        perf-profile.children.cycles-pp.__dequeue_signal
      1.02            +0.1        1.11        perf-profile.children.cycles-pp.signal_setup_done
      0.78            +0.1        0.88        perf-profile.children.cycles-pp.copy_init_pkru_to_fpregs
      0.94 ±  2%      +0.1        1.05 ±  2%  perf-profile.children.cycles-pp.native_sched_clock
      0.99 ±  2%      +0.1        1.10 ±  2%  perf-profile.children.cycles-pp.sched_clock
      1.25            +0.1        1.36        perf-profile.children.cycles-pp.__get_user_nocheck_8
      1.02            +0.1        1.14        perf-profile.children.cycles-pp.__put_user_nocheck_4
      1.23 ±  4%      +0.1        1.36 ±  2%  perf-profile.children.cycles-pp.irq_exit_rcu
      1.30            +0.1        1.42        perf-profile.children.cycles-pp.__get_user_nocheck_4
      1.24            +0.1        1.37        perf-profile.children.cycles-pp.recalc_sigpending
      1.11 ±  2%      +0.1        1.24 ±  2%  perf-profile.children.cycles-pp.sched_clock_cpu
      0.83 ±  7%      +0.2        0.99 ±  5%  perf-profile.children.cycles-pp.__might_sleep
      1.88            +0.2        2.05        perf-profile.children.cycles-pp._raw_spin_lock_irq
      1.43            +0.2        1.60 ±  2%  perf-profile.children.cycles-pp.restore_altstack
      1.50 ±  2%      +0.2        1.68        perf-profile.children.cycles-pp.irqtime_account_irq
      2.01            +0.2        2.19        perf-profile.children.cycles-pp.__set_current_blocked
      2.74            +0.2        2.93        perf-profile.children.cycles-pp.read_tsc
      1.82            +0.2        2.03        perf-profile.children.cycles-pp.syscall_return_via_sysret
      2.20            +0.2        2.44        perf-profile.children.cycles-pp.send_sigqueue
      0.65 ± 11%      +0.3        0.92 ±  3%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      1.37 ±  9%      +0.3        1.65 ±  4%  perf-profile.children.cycles-pp.sync_regs
      3.36            +0.3        3.66        perf-profile.children.cycles-pp.posix_timer_fn
      2.64            +0.3        2.94        perf-profile.children.cycles-pp._copy_from_user
      1.42 ±  6%      +0.3        1.77 ±  7%  perf-profile.children.cycles-pp.___might_sleep
      5.91            +0.4        6.30 ±  2%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      4.33            +0.5        4.79        perf-profile.children.cycles-pp.fpu__clear
      2.77 ±  8%      +0.7        3.45 ±  9%  perf-profile.children.cycles-pp.__might_fault
      8.16 ±  3%      +0.7        8.86        perf-profile.children.cycles-pp.copy_fpstate_to_sigframe
      6.89            +0.7        7.62        perf-profile.children.cycles-pp.lapic_next_deadline
      2.77 ±  7%      +0.7        3.50 ±  5%  perf-profile.children.cycles-pp.copy_user_enhanced_fast_string
      6.79            +0.8        7.55        perf-profile.children.cycles-pp.__fpu__restore_sig
     10.22 ±  2%      +0.9       11.12        perf-profile.children.cycles-pp.__setup_rt_frame
      8.77            +0.9        9.68        perf-profile.children.cycles-pp.native_irq_return_iret
      8.78            +1.0        9.78        perf-profile.children.cycles-pp.restore_sigcontext
      2.93 ±  9%      +1.1        4.07 ±  6%  perf-profile.children.cycles-pp.copy_user_generic_unrolled
     13.19            +1.5       14.66        perf-profile.children.cycles-pp.__x64_sys_rt_sigreturn
      4.77 ± 17%      +2.6        7.38 ± 13%  perf-profile.children.cycles-pp._copy_to_user
      8.03 ± 13%      +3.0       11.01 ± 12%  perf-profile.children.cycles-pp.__x64_sys_rt_sigpending
      8.37 ±  5%      -5.1        3.27 ± 23%  perf-profile.self.cycles-pp.ktime_get
      4.29 ±  4%      -2.9        1.35 ± 29%  perf-profile.self.cycles-pp.ktime_get_update_offsets_now
      0.13 ±  3%      +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.__set_task_blocked
      0.12 ±  3%      +0.0        0.14 ±  3%  perf-profile.self.cycles-pp.signal_setup_done
      0.11 ±  4%      +0.0        0.13 ±  3%  perf-profile.self.cycles-pp.rcu_all_qs
      0.11 ±  5%      +0.0        0.13 ±  3%  perf-profile.self.cycles-pp.sched_clock_cpu
      0.18 ±  2%      +0.0        0.19 ±  2%  perf-profile.self.cycles-pp.restore_altstack
      0.16 ±  3%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.__hrtimer_get_next_event
      0.10 ±  5%      +0.0        0.12 ±  3%  perf-profile.self.cycles-pp.do_sigaltstack
      0.17 ±  5%      +0.0        0.19 ±  2%  perf-profile.self.cycles-pp.irq_exit_rcu
      0.19 ±  2%      +0.0        0.21        perf-profile.self.cycles-pp.hrtimer_reprogram
      0.17 ±  2%      +0.0        0.19 ±  4%  perf-profile.self.cycles-pp.prepare_signal
      0.21 ±  3%      +0.0        0.23 ±  2%  perf-profile.self.cycles-pp.complete_signal
      0.29 ±  4%      +0.0        0.32 ±  2%  perf-profile.self.cycles-pp.tick_program_event
      0.21 ±  2%      +0.0        0.24 ±  3%  perf-profile.self.cycles-pp.sysvec_apic_timer_interrupt
      0.29            +0.0        0.32        perf-profile.self.cycles-pp.posix_timer_fn
      0.40 ±  3%      +0.0        0.44 ±  3%  perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
      0.32 ±  2%      +0.0        0.35 ±  3%  perf-profile.self.cycles-pp.dequeue_signal
      0.20 ±  2%      +0.0        0.24 ±  6%  perf-profile.self.cycles-pp.get_xsave_addr
      0.30 ±  3%      +0.0        0.34 ±  2%  perf-profile.self.cycles-pp._copy_from_user
      0.44 ±  2%      +0.0        0.48        perf-profile.self.cycles-pp.__set_current_blocked
      0.40 ±  3%      +0.0        0.44 ±  2%  perf-profile.self.cycles-pp.irqtime_account_irq
      0.54            +0.0        0.59 ±  2%  perf-profile.self.cycles-pp.__local_bh_enable_ip
      0.61 ±  3%      +0.0        0.66 ±  3%  perf-profile.self.cycles-pp.hrtimer_start_range_ns
      0.44 ±  2%      +0.0        0.49        perf-profile.self.cycles-pp.restore_sigcontext
      0.63            +0.1        0.68        perf-profile.self.cycles-pp.get_signal
      0.41 ±  4%      +0.1        0.46        perf-profile.self.cycles-pp.timerqueue_add
      0.63            +0.1        0.68 ±  2%  perf-profile.self.cycles-pp.clockevents_program_event
      0.29 ±  5%      +0.1        0.35        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.49 ±  3%      +0.1        0.54        perf-profile.self.cycles-pp.recalc_sigpending_tsk
      0.50            +0.1        0.56 ±  2%  perf-profile.self.cycles-pp.__hrtimer_next_event_base
      0.74            +0.1        0.80        perf-profile.self.cycles-pp.recalc_sigpending
      0.55            +0.1        0.61        perf-profile.self.cycles-pp.copy_init_pkru_to_fpregs
      0.69 ±  2%      +0.1        0.76 ±  2%  perf-profile.self.cycles-pp.fpregs_mark_activate
      0.52 ±  3%      +0.1        0.59        perf-profile.self.cycles-pp.arch_do_signal_or_restart
      0.65 ±  2%      +0.1        0.72        perf-profile.self.cycles-pp.send_sigqueue
      0.61            +0.1        0.68 ±  2%  perf-profile.self.cycles-pp.__x64_sys_rt_sigreturn
      0.57 ±  4%      +0.1        0.65 ±  2%  perf-profile.self.cycles-pp.__irqentry_text_end
      0.39            +0.1        0.46 ±  6%  perf-profile.self.cycles-pp.do_syscall_64
      0.76 ±  3%      +0.1        0.84 ±  2%  perf-profile.self.cycles-pp.__dequeue_signal
      0.78 ±  2%      +0.1        0.86        perf-profile.self.cycles-pp.hrtimer_forward
      0.84            +0.1        0.93 ±  2%  perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.59 ±  4%      +0.1        0.68        perf-profile.self.cycles-pp.__might_sleep
      0.72 ±  2%      +0.1        0.82        perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      0.31 ±  5%      +0.1        0.41 ±  2%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      0.90            +0.1        1.01 ±  2%  perf-profile.self.cycles-pp.native_sched_clock
      1.21            +0.1        1.32        perf-profile.self.cycles-pp.__get_user_nocheck_8
      1.25            +0.1        1.37        perf-profile.self.cycles-pp.__get_user_nocheck_4
      0.98            +0.1        1.10        perf-profile.self.cycles-pp.__put_user_nocheck_4
      0.56 ±  4%      +0.1        0.68 ±  7%  perf-profile.self.cycles-pp.__might_fault
      1.07 ±  2%      +0.1        1.20 ±  2%  perf-profile.self.cycles-pp.hrtimer_interrupt
      1.80            +0.2        1.96        perf-profile.self.cycles-pp._raw_spin_lock_irq
      0.80 ±  4%      +0.2        0.98 ±  3%  perf-profile.self.cycles-pp.asm_sysvec_apic_timer_interrupt
      0.65 ±  8%      +0.2        0.84 ±  6%  perf-profile.self.cycles-pp.__x64_sys_rt_sigpending
      2.93            +0.2        3.11        perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
      2.68            +0.2        2.87        perf-profile.self.cycles-pp.read_tsc
      1.10 ±  2%      +0.2        1.29 ±  3%  perf-profile.self.cycles-pp.___might_sleep
      1.94            +0.2        2.14        perf-profile.self.cycles-pp.__setup_rt_frame
      1.81            +0.2        2.02        perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.36 ± 12%      +0.2        0.58 ± 13%  perf-profile.self.cycles-pp._copy_to_user
      1.34 ±  9%      +0.3        1.61 ±  4%  perf-profile.self.cycles-pp.sync_regs
      2.97            +0.3        3.30        perf-profile.self.cycles-pp.fpu__clear
      2.11 ±  3%      +0.4        2.47 ±  2%  perf-profile.self.cycles-pp.copy_user_enhanced_fast_string
      4.35            +0.4        4.73        perf-profile.self.cycles-pp.copy_fpstate_to_sigframe
      1.93 ±  3%      +0.5        2.40 ±  2%  perf-profile.self.cycles-pp.copy_user_generic_unrolled
      4.39            +0.5        4.89        perf-profile.self.cycles-pp.__fpu__restore_sig
      6.87            +0.7        7.59        perf-profile.self.cycles-pp.lapic_next_deadline
      8.75            +0.9        9.65        perf-profile.self.cycles-pp.native_irq_return_iret


                                                                                
                              stress-ng.timer.ops_per_sec                       
                                                                                
     3e+07 +----------------------------------------------------------------+   
           |                              O                                 |   
  2.95e+07 |-+     O O  O         O     O                                   |   
           |          O   O O   O           O  O     O     O                |   
   2.9e+07 |-O O              O     O O      O     O     O        O O       |   
           |                                     O     O         O    O O   |   
  2.85e+07 |-+   O                                           O O          O |   
           |                 .+                                             |   
   2.8e+07 |.+. .+. .+      +  +                                            |   
           |   +   + :     +    +.+. .+    .+                               |   
  2.75e+07 |-+        : +.+         +  :  + :  +.                           |   
           |          :+               : +   :+  +.+.+. .+.+.+              |   
   2.7e+07 |-+        +                 +    +         +      +             |   
           |                                                   +.+          |   
  2.65e+07 +----------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


---
0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation

Thanks,
Oliver Sang


View attachment "config-5.11.0-rc2-00003-gabd562df94d1" of type "text/plain" (172366 bytes)

View attachment "job-script" of type "text/plain" (8206 bytes)

View attachment "job.yaml" of type "text/plain" (5723 bytes)

View attachment "reproduce" of type "text/plain" (407 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ