lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20200714052624.GA3874@shao2-debian>
Date:   Tue, 14 Jul 2020 13:26:24 +0800
From:   kernel test robot <rong.a.chen@...el.com>
To:     John Ogness <john.ogness@...utronix.de>
Cc:     Petr Mladek <pmladek@...e.com>,
        Peter Zijlstra <peterz@...radead.org>,
        Sergey Senozhatsky <sergey.senozhatsky.work@...il.com>,
        Sergey Senozhatsky <sergey.senozhatsky@...il.com>,
        Steven Rostedt <rostedt@...dmis.org>,
        Linus Torvalds <torvalds@...ux-foundation.org>,
        Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
        Andrea Parri <parri.andrea@...il.com>,
        Thomas Gleixner <tglx@...utronix.de>,
        Paul McKenney <paulmck@...nel.org>, kexec@...ts.infradead.org,
        linux-kernel@...r.kernel.org, lkp@...ts.01.org
Subject: [printk] 96d5c61b1f: stress-ng.timerfd.ops_per_sec 8.3% improvement

Greeting,

FYI, we noticed a 8.3% improvement of stress-ng.timerfd.ops_per_sec due to commit:


commit: 96d5c61b1fff364edfae607e2bf2a66ee9616ff2 ("[PATCH v5 4/4] printk: use the lockless ringbuffer")
url: https://github.com/0day-ci/linux/commits/John-Ogness/printk-replace-ringbuffer/20200709-212623
base: https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git 9ebcfadb0610322ac537dd7aa5d9cbc2b2894c68

in testcase: stress-ng
on test machine: 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 256G memory
with following parameters:

	nr_threads: 100%
	disk: 1HDD
	testtime: 30s
	class: interrupt
	cpufreq_governor: performance
	ucode: 0x5002f01






Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install job.yaml  # job file is attached in this email
        bin/lkp run     job.yaml

=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/testcase/testtime/ucode:
  interrupt/gcc-9/performance/1HDD/x86_64-rhel-8.3/100%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp6/stress-ng/30s/0x5002f01

commit: 
  72621a0b10 ("Revert "printk: lock/unlock console only for new logbuf entries"")
  96d5c61b1f ("printk: use the lockless ringbuffer")

72621a0b1077a8ad 96d5c61b1fff364edfae607e2bf 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
 2.274e+08           +18.6%  2.697e+08        stress-ng.sigq.ops
   7580582           +18.6%    8991276        stress-ng.sigq.ops_per_sec
 1.513e+08 ±  5%      -8.3%  1.387e+08 ±  2%  stress-ng.time.involuntary_context_switches
 3.768e+08 ±  3%      -5.9%  3.544e+08 ±  2%  stress-ng.time.voluntary_context_switches
 1.598e+09 ±  2%      +8.3%   1.73e+09        stress-ng.timerfd.ops
  53265971 ±  2%      +8.3%   57674750        stress-ng.timerfd.ops_per_sec
 3.716e+08 ±  2%     +10.7%  4.115e+08 ±  3%  cpuidle.C1.time
    174505 ± 10%     -14.5%     149157 ± 13%  numa-meminfo.node0.AnonHugePages
     32959            +2.1%      33668        proc-vmstat.nr_slab_reclaimable
     90647 ± 43%     -60.2%      36039 ±124%  numa-vmstat.node0.numa_other
     71315 ± 55%     +76.4%     125804 ± 35%  numa-vmstat.node1.numa_other
   1075081 ±  3%      -5.6%    1014755        vmstat.system.cs
   3762322            +4.7%    3940838        vmstat.system.in
     61530 ±  3%     -12.1%      54104 ±  4%  slabinfo.Acpi-State.active_objs
      1209 ±  3%     -11.9%       1065 ±  4%  slabinfo.Acpi-State.active_slabs
     61710 ±  3%     -11.9%      54359 ±  4%  slabinfo.Acpi-State.num_objs
      1209 ±  3%     -11.9%       1065 ±  4%  slabinfo.Acpi-State.num_slabs
     47136 ±  5%      +6.0%      49962 ±  6%  softirqs.CPU11.SCHED
     45028 ±  2%     +22.3%      55067 ± 10%  softirqs.CPU16.SCHED
     44744 ±  2%     +22.1%      54634 ±  5%  softirqs.CPU19.SCHED
     45345 ±  2%     +20.7%      54736 ±  8%  softirqs.CPU23.SCHED
     51163 ±  8%     -10.2%      45932        softirqs.CPU30.SCHED
     50701 ±  9%     -10.2%      45552        softirqs.CPU36.SCHED
     50848 ±  3%     -11.0%      45253        softirqs.CPU41.SCHED
     46949 ±  6%     +17.2%      55030 ±  4%  softirqs.CPU51.SCHED
     45567 ±  2%     +15.4%      52589 ± 11%  softirqs.CPU55.SCHED
     45734           +13.5%      51922 ±  9%  softirqs.CPU64.SCHED
     45342           +15.4%      52344 ±  5%  softirqs.CPU67.SCHED
     45901           +13.4%      52030 ±  9%  softirqs.CPU68.SCHED
     45578           +16.8%      53235 ±  7%  softirqs.CPU69.SCHED
     50784 ±  8%      -9.3%      46080        softirqs.CPU77.SCHED
   1187223 ± 26%     -47.3%     626108 ± 43%  sched_debug.cfs_rq:/.MIN_vruntime.avg
     44095 ± 13%     -37.3%      27628 ± 16%  sched_debug.cfs_rq:/.load.avg
     87004 ±  2%     -24.3%      65828 ± 13%  sched_debug.cfs_rq:/.load.stddev
   1187223 ± 26%     -47.3%     626108 ± 43%  sched_debug.cfs_rq:/.max_vruntime.avg
     87.24 ± 10%     -17.7%      71.80 ±  7%  sched_debug.cfs_rq:/.nr_spread_over.avg
     10.92 ±  8%     -25.6%       8.12 ± 10%  sched_debug.cfs_rq:/.nr_spread_over.min
     95.86 ± 12%     -15.1%      81.42 ±  9%  sched_debug.cfs_rq:/.nr_spread_over.stddev
    133.70 ± 30%     -81.5%      24.68 ±173%  sched_debug.cfs_rq:/.removed.load_avg.max
     58.28 ± 25%     -80.1%      11.62 ±173%  sched_debug.cfs_rq:/.removed.runnable_avg.max
      6.81 ± 25%     -77.8%       1.51 ±173%  sched_debug.cfs_rq:/.removed.runnable_avg.stddev
    744305 ± 35%     -57.0%     319901 ± 93%  sched_debug.cfs_rq:/.spread0.avg
     27562 ±  8%     -48.5%      14191 ± 13%  sched_debug.cpu.curr->pid.avg
     35526 ±  5%     -42.5%      20423 ± 11%  sched_debug.cpu.curr->pid.max
     14383 ± 47%     -55.9%       6346 ± 36%  sched_debug.cpu.curr->pid.min
     39.65 ±  4%     +16.5%      46.19 ±  7%  sched_debug.cpu.nr_running.avg
      1.17 ± 25%     -76.2%       0.28 ±163%  sched_debug.cpu.nr_uninterruptible.avg
   3769782 ±  4%      -8.4%    3451639 ±  3%  sched_debug.cpu.sched_count.max
      0.00       +2.2e+104%     216.75 ±138%  interrupts.102:PCI-MSI.31981635-edge.i40e-eth0-TxRx-66
   8246728 ±  4%      +7.0%    8826785        interrupts.CAL:Function_call_interrupts
     65853 ±  7%     +35.4%      89155 ± 18%  interrupts.CPU19.CAL:Function_call_interrupts
    133669 ±  7%     +79.6%     240043 ± 33%  interrupts.CPU19.RES:Rescheduling_interrupts
     85710 ±  6%     +13.6%      97401        interrupts.CPU32.CAL:Function_call_interrupts
    201257 ±  3%     -12.8%     175586 ±  4%  interrupts.CPU42.RES:Rescheduling_interrupts
      7732           -38.0%       4798 ± 34%  interrupts.CPU45.NMI:Non-maskable_interrupts
      7732           -38.0%       4798 ± 34%  interrupts.CPU45.PMI:Performance_monitoring_interrupts
    197808 ±  4%      -7.1%     183824 ±  6%  interrupts.CPU46.RES:Rescheduling_interrupts
    149600 ± 16%     +70.4%     254923 ± 25%  interrupts.CPU54.RES:Rescheduling_interrupts
    150479 ± 16%     +64.7%     247795 ± 37%  interrupts.CPU57.RES:Rescheduling_interrupts
    150695 ± 16%     +48.9%     224356 ± 22%  interrupts.CPU60.RES:Rescheduling_interrupts
     60718 ±  6%     +18.7%      72058 ±  3%  interrupts.CPU62.CAL:Function_call_interrupts
     68390 ±  4%     +20.6%      82506 ± 10%  interrupts.CPU63.CAL:Function_call_interrupts
      0.00       +2.2e+104%     216.25 ±138%  interrupts.CPU66.102:PCI-MSI.31981635-edge.i40e-eth0-TxRx-66
     62888 ±  4%     +22.3%      76902 ±  8%  interrupts.CPU7.CAL:Function_call_interrupts
     62071 ± 10%     +29.5%      80355 ± 10%  interrupts.CPU71.CAL:Function_call_interrupts
    231484 ± 21%     -25.0%     173696 ±  3%  interrupts.CPU75.RES:Rescheduling_interrupts
    234587 ± 12%     -17.0%     194613 ± 13%  interrupts.CPU77.RES:Rescheduling_interrupts
    209574 ±  3%     -15.0%     178038 ±  4%  interrupts.CPU83.RES:Rescheduling_interrupts
     87531 ±  4%     +31.1%     114721 ±  7%  interrupts.CPU88.CAL:Function_call_interrupts
      7732           -37.8%       4806 ± 34%  interrupts.CPU89.NMI:Non-maskable_interrupts
      7732           -37.8%       4806 ± 34%  interrupts.CPU89.PMI:Performance_monitoring_interrupts
     94766 ±  7%     +19.1%     112844 ± 10%  interrupts.CPU91.CAL:Function_call_interrupts
     91308 ±  6%     +22.9%     112175 ±  8%  interrupts.CPU92.CAL:Function_call_interrupts
     87571 ± 11%     +31.6%     115238 ± 14%  interrupts.CPU94.CAL:Function_call_interrupts
      7.54           +13.5%       8.56 ± 19%  perf-stat.i.MPKI
 1.312e+10            -2.8%  1.276e+10        perf-stat.i.branch-instructions
 1.246e+08            -1.4%  1.228e+08        perf-stat.i.branch-misses
  71092689 ±  3%     -19.7%   57085156 ±  2%  perf-stat.i.cache-misses
 2.672e+08           -10.2%    2.4e+08        perf-stat.i.cache-references
    100471            -4.4%      96054        perf-stat.i.cpu-clock
 2.436e+11            -5.8%  2.295e+11        perf-stat.i.cpu-cycles
     28918 ±  2%      +8.8%      31448 ±  2%  perf-stat.i.cycles-between-cache-misses
      0.03 ±  5%      +0.0        0.05 ± 68%  perf-stat.i.dTLB-load-miss-rate%
   3585193 ±  5%     -17.4%    2960337 ±  3%  perf-stat.i.dTLB-load-misses
      0.06            +0.0        0.06 ± 11%  perf-stat.i.dTLB-store-miss-rate%
   6767973            -6.1%    6356168 ±  2%  perf-stat.i.dTLB-store-misses
  1.05e+10            -1.4%  1.036e+10        perf-stat.i.dTLB-stores
   7143251 ±  4%      -7.7%    6593711 ±  2%  perf-stat.i.iTLB-loads
 6.422e+10            -2.4%  6.265e+10        perf-stat.i.instructions
      2.55            -2.7%       2.48        perf-stat.i.metric.GHz
      0.63 ±  3%     +10.7%       0.70 ±  5%  perf-stat.i.metric.K/sec
     92744            -3.1%      89912        perf-stat.i.minor-faults
     87.88            -1.4       86.43        perf-stat.i.node-load-miss-rate%
   8868728 ± 13%     -46.6%    4739379 ±  3%  perf-stat.i.node-loads
     94.74            -3.0       91.76 ±  2%  perf-stat.i.node-store-miss-rate%
   2508354 ± 11%     -33.3%    1673742        perf-stat.i.node-stores
    691526            -1.6%     680596        perf-stat.i.page-faults
     99599            -3.6%      96043        perf-stat.i.task-clock
      4.23            -4.5%       4.04        perf-stat.overall.MPKI
      0.95            +0.0        0.96        perf-stat.overall.branch-miss-rate%
      3.84            -2.4%       3.74        perf-stat.overall.cpi
      0.02 ±  3%      -0.0        0.02 ±  2%  perf-stat.overall.dTLB-load-miss-rate%
      0.06            -0.0        0.06 ±  2%  perf-stat.overall.dTLB-store-miss-rate%
      0.26            +2.5%       0.27        perf-stat.overall.ipc
     47.96            +4.3       52.28 ±  3%  perf-stat.overall.node-load-miss-rate%
     70.13            +2.3       72.43 ±  2%  perf-stat.overall.node-store-miss-rate%
 1.232e+10            +2.0%  1.257e+10        perf-stat.ps.branch-instructions
 1.166e+08            +3.3%  1.205e+08        perf-stat.ps.branch-misses
  67786783            -5.2%   64234227 ±  4%  perf-stat.ps.cache-misses
   1078561 ±  3%      -5.5%    1019000        perf-stat.ps.context-switches
 1.714e+10            +2.3%  1.753e+10        perf-stat.ps.dTLB-loads
 9.794e+09            +2.6%  1.005e+10        perf-stat.ps.dTLB-stores
  87676758            +4.4%   91553262        perf-stat.ps.iTLB-load-misses
  6.02e+10            +2.2%   6.15e+10        perf-stat.ps.instructions
   9026151           -12.8%    7873090 ± 10%  perf-stat.ps.node-loads
   2511686            -9.8%    2265767 ±  6%  perf-stat.ps.node-stores
    642630            +1.6%     652856        perf-stat.ps.page-faults
  3.27e+13            +2.2%  3.341e+13        perf-stat.total.instructions
     26.27 ±  2%      -2.7       23.56 ±  2%  perf-profile.calltrace.cycles-pp.security_task_kill.group_send_sig_info.kill_pid_info.do_rt_sigqueueinfo.__x64_sys_rt_sigqueueinfo
     26.23 ±  2%      -2.7       23.52 ±  2%  perf-profile.calltrace.cycles-pp.apparmor_task_kill.security_task_kill.group_send_sig_info.kill_pid_info.do_rt_sigqueueinfo
     26.45 ±  2%      -2.7       23.74 ±  2%  perf-profile.calltrace.cycles-pp.group_send_sig_info.kill_pid_info.do_rt_sigqueueinfo.__x64_sys_rt_sigqueueinfo.do_syscall_64
     13.52 ±  2%      -2.4       11.11 ±  2%  perf-profile.calltrace.cycles-pp.aa_get_task_label.apparmor_task_kill.security_task_kill.group_send_sig_info.kill_pid_info
     47.08            -2.4       44.67        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
     47.16            -2.4       44.76        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
     46.12            -2.3       43.82        perf-profile.calltrace.cycles-pp.kill_pid_info.do_rt_sigqueueinfo.__x64_sys_rt_sigqueueinfo.do_syscall_64.entry_SYSCALL_64_after_hwframe
     46.36            -2.3       44.07        perf-profile.calltrace.cycles-pp.do_rt_sigqueueinfo.__x64_sys_rt_sigqueueinfo.do_syscall_64.entry_SYSCALL_64_after_hwframe
     46.71            -2.3       44.43        perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigqueueinfo.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.99 ± 14%      -1.0        1.98 ± 14%  perf-profile.calltrace.cycles-pp.__lll_unlock_wake
      2.83 ± 14%      -1.0        1.83 ± 15%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__lll_unlock_wake
      2.80 ± 14%      -1.0        1.81 ± 15%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__lll_unlock_wake
      2.67 ± 14%      -0.9        1.73 ± 15%  perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.__lll_unlock_wake
      2.63 ± 14%      -0.9        1.70 ± 15%  perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.__lll_unlock_wake
      7.36 ±  3%      -0.7        6.70 ±  2%  perf-profile.calltrace.cycles-pp.__lll_lock_wait
      6.73 ±  3%      -0.6        6.14 ±  2%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__lll_lock_wait
      6.71 ±  3%      -0.6        6.13 ±  2%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__lll_lock_wait
      4.23 ±  4%      -0.5        3.77 ±  3%  perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.__lll_lock_wait
      4.19 ±  4%      -0.5        3.74 ±  3%  perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.__lll_lock_wait
      5.59 ±  2%      -0.4        5.22 ±  2%  perf-profile.calltrace.cycles-pp.futex_wait.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe
      4.78 ±  2%      -0.3        4.47 ±  2%  perf-profile.calltrace.cycles-pp.futex_wait_queue_me.futex_wait.do_futex.__x64_sys_futex.do_syscall_64
      0.52 ±  2%      -0.3        0.25 ±100%  perf-profile.calltrace.cycles-pp.__lock_task_sighand.do_send_sig_info.kill_pid_info.do_rt_sigqueueinfo.__x64_sys_rt_sigqueueinfo
      4.12 ±  2%      -0.2        3.88 ±  2%  perf-profile.calltrace.cycles-pp.schedule.futex_wait_queue_me.futex_wait.do_futex.__x64_sys_futex
      4.06 ±  2%      -0.2        3.81 ±  2%  perf-profile.calltrace.cycles-pp.__sched_text_start.schedule.futex_wait_queue_me.futex_wait.do_futex
      0.76 ±  4%      -0.1        0.62 ±  6%  perf-profile.calltrace.cycles-pp._raw_spin_lock.__sched_text_start.schedule.futex_wait_queue_me.futex_wait
      0.69 ±  4%      -0.1        0.56 ±  6%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.__sched_text_start.schedule.futex_wait_queue_me
      1.08 ±  3%      -0.1        0.99        perf-profile.calltrace.cycles-pp.get_signal.do_signal.__prepare_exit_to_usermode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.32 ±  2%      -0.1        1.24        perf-profile.calltrace.cycles-pp.stack_trace_save_tsk.__account_scheduler_latency.update_stats_enqueue_sleeper.enqueue_entity.enqueue_task_fair
      0.75 ±  4%      -0.1        0.68 ±  3%  perf-profile.calltrace.cycles-pp.dequeue_signal.get_signal.do_signal.__prepare_exit_to_usermode.do_syscall_64
      0.96 ±  3%      -0.1        0.90        perf-profile.calltrace.cycles-pp.generic_file_buffered_read.new_sync_read.vfs_read.ksys_pread64.do_syscall_64
      1.08 ±  3%      -0.1        1.02        perf-profile.calltrace.cycles-pp.new_sync_read.vfs_read.ksys_pread64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.77            -0.1        0.71        perf-profile.calltrace.cycles-pp.__pthread_mutex_lock
      0.73 ±  2%      -0.0        0.68 ±  3%  perf-profile.calltrace.cycles-pp.dequeue_task_fair.__sched_text_start.schedule.futex_wait_queue_me.futex_wait
      0.55            +0.0        0.56        perf-profile.calltrace.cycles-pp.select_idle_sibling.select_task_rq_fair.try_to_wake_up.signal_wake_up_state.__send_signal
      0.64 ±  2%      +0.1        0.70 ±  2%  perf-profile.calltrace.cycles-pp.aa_may_signal.apparmor_task_kill.security_task_kill.group_send_sig_info.kill_pid_info
      1.47            +0.1        1.55        perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_cond_timedwait@@GLIBC_2.3.2
      1.43            +0.1        1.51        perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_cond_timedwait@@GLIBC_2.3.2
      1.56            +0.1        1.65        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_cond_timedwait@@GLIBC_2.3.2
      1.58            +0.1        1.67        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.pthread_cond_timedwait@@GLIBC_2.3.2
      1.79            +0.1        1.88        perf-profile.calltrace.cycles-pp.pthread_cond_timedwait@@GLIBC_2.3.2
      0.74 ±  9%      +0.2        0.98 ±  4%  perf-profile.calltrace.cycles-pp.enqueue_entity.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending
      0.76 ±  8%      +0.2        1.00 ±  4%  perf-profile.calltrace.cycles-pp.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single.asm_call_on_stack.sysvec_call_function_single
      0.76 ±  8%      +0.2        1.00 ±  4%  perf-profile.calltrace.cycles-pp.activate_task.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single.asm_call_on_stack
      0.76 ±  8%      +0.2        1.00 ±  4%  perf-profile.calltrace.cycles-pp.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single
      0.81 ±  8%      +0.3        1.07 ±  4%  perf-profile.calltrace.cycles-pp.sched_ttwu_pending.__sysvec_call_function_single.asm_call_on_stack.sysvec_call_function_single.asm_sysvec_call_function_single
      0.84 ±  8%      +0.3        1.09 ±  4%  perf-profile.calltrace.cycles-pp.asm_call_on_stack.sysvec_call_function_single.asm_sysvec_call_function_single.finish_task_switch.__sched_text_start
      0.85 ±  7%      +0.3        1.11 ±  4%  perf-profile.calltrace.cycles-pp.sysvec_call_function_single.asm_sysvec_call_function_single.finish_task_switch.__sched_text_start.schedule_idle
      0.83 ±  8%      +0.3        1.09 ±  4%  perf-profile.calltrace.cycles-pp.__sysvec_call_function_single.asm_call_on_stack.sysvec_call_function_single.asm_sysvec_call_function_single.finish_task_switch
      0.91 ±  7%      +0.3        1.18 ±  4%  perf-profile.calltrace.cycles-pp.finish_task_switch.__sched_text_start.schedule_idle.do_idle.cpu_startup_entry
      0.88 ±  7%      +0.3        1.15 ±  4%  perf-profile.calltrace.cycles-pp.asm_sysvec_call_function_single.finish_task_switch.__sched_text_start.schedule_idle.do_idle
      1.57 ± 10%      +0.4        1.95 ±  5%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.try_to_wake_up.wake_up_q.futex_wake
      1.60 ± 10%      +0.4        1.98 ±  5%  perf-profile.calltrace.cycles-pp._raw_spin_lock.try_to_wake_up.wake_up_q.futex_wake.do_futex
      4.32 ±  7%      +0.7        5.00 ±  3%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.__sched_text_start.schedule_idle.do_idle
      4.38 ±  6%      +0.7        5.06 ±  3%  perf-profile.calltrace.cycles-pp._raw_spin_lock.__sched_text_start.schedule_idle.do_idle.cpu_startup_entry
      6.08 ±  6%      +1.0        7.08 ±  3%  perf-profile.calltrace.cycles-pp.__sched_text_start.schedule_idle.do_idle.cpu_startup_entry.start_secondary
      6.10 ±  6%      +1.0        7.10 ±  3%  perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
      5.92 ±  6%      +1.1        7.03 ±  3%  perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_cond_signal@@GLIBC_2.3.2
      5.94 ±  6%      +1.1        7.05 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_cond_signal@@GLIBC_2.3.2
      5.95 ±  6%      +1.1        7.07 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.pthread_cond_signal@@GLIBC_2.3.2
      5.93 ±  6%      +1.1        7.04 ±  3%  perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_cond_signal@@GLIBC_2.3.2
      6.08 ±  6%      +1.1        7.20 ±  3%  perf-profile.calltrace.cycles-pp.pthread_cond_signal@@GLIBC_2.3.2
      9.26 ±  7%      +1.8       11.02 ±  3%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry
      9.55 ±  7%      +1.8       11.38 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary
      9.56 ±  7%      +1.8       11.40 ±  3%  perf-profile.calltrace.cycles-pp.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
     16.74 ±  6%      +3.0       19.71 ±  3%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
     16.75 ±  6%      +3.0       19.73 ±  3%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64
     16.75 ±  6%      +3.0       19.73 ±  3%  perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64
     16.90 ±  6%      +3.0       19.93 ±  3%  perf-profile.calltrace.cycles-pp.secondary_startup_64
     76.06            -2.9       73.12        perf-profile.children.cycles-pp.do_syscall_64
     76.41            -2.9       73.48        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     26.27 ±  2%      -2.7       23.56 ±  2%  perf-profile.children.cycles-pp.security_task_kill
     26.25 ±  2%      -2.7       23.54 ±  2%  perf-profile.children.cycles-pp.apparmor_task_kill
     26.45 ±  2%      -2.7       23.75 ±  2%  perf-profile.children.cycles-pp.group_send_sig_info
     13.53 ±  2%      -2.4       11.12 ±  2%  perf-profile.children.cycles-pp.aa_get_task_label
     46.12            -2.3       43.83        perf-profile.children.cycles-pp.kill_pid_info
     46.36            -2.3       44.07        perf-profile.children.cycles-pp.do_rt_sigqueueinfo
     46.72            -2.3       44.44        perf-profile.children.cycles-pp.__x64_sys_rt_sigqueueinfo
      3.02 ± 14%      -1.0        2.01 ± 14%  perf-profile.children.cycles-pp.__lll_unlock_wake
      7.39 ±  3%      -0.7        6.73 ±  2%  perf-profile.children.cycles-pp.__lll_lock_wait
      5.47 ±  3%      -0.4        5.04 ±  3%  perf-profile.children.cycles-pp.schedule
      5.59 ±  2%      -0.4        5.22 ±  2%  perf-profile.children.cycles-pp.futex_wait
      4.30 ±  5%      -0.4        3.94 ±  2%  perf-profile.children.cycles-pp.__prepare_exit_to_usermode
      4.79 ±  2%      -0.3        4.48 ±  2%  perf-profile.children.cycles-pp.futex_wait_queue_me
      1.53 ±  3%      -0.1        1.41        perf-profile.children.cycles-pp.get_signal
      0.66 ±  5%      -0.1        0.56 ±  3%  perf-profile.children.cycles-pp.switch_mm_irqs_off
      1.09 ±  3%      -0.1        0.99 ±  2%  perf-profile.children.cycles-pp.dequeue_signal
      0.62 ±  4%      -0.1        0.54 ±  4%  perf-profile.children.cycles-pp.__task_rq_lock
      0.89 ±  4%      -0.1        0.80 ±  2%  perf-profile.children.cycles-pp.__set_current_blocked
      0.53 ±  5%      -0.1        0.45 ±  3%  perf-profile.children.cycles-pp.recalc_sigpending
      0.67 ±  5%      -0.1        0.60 ±  3%  perf-profile.children.cycles-pp.__switch_to
      0.52 ±  4%      -0.1        0.45 ±  5%  perf-profile.children.cycles-pp.futex_wait_setup
      1.09 ±  2%      -0.1        1.03        perf-profile.children.cycles-pp.new_sync_read
      0.97 ±  3%      -0.1        0.91        perf-profile.children.cycles-pp.generic_file_buffered_read
      0.65 ±  4%      -0.1        0.59 ±  2%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      0.20 ±  5%      -0.0        0.15 ±  3%  perf-profile.children.cycles-pp.put_task_stack
      0.51 ±  3%      -0.0        0.46 ±  2%  perf-profile.children.cycles-pp.__sigqueue_free
      0.77            -0.0        0.72        perf-profile.children.cycles-pp.__pthread_mutex_lock
      0.41 ±  4%      -0.0        0.36 ±  2%  perf-profile.children.cycles-pp.__set_task_blocked
      0.64 ±  2%      -0.0        0.60        perf-profile.children.cycles-pp.pagecache_get_page
      0.30 ±  5%      -0.0        0.26 ±  4%  perf-profile.children.cycles-pp.fpregs_mark_activate
      0.13 ±  9%      -0.0        0.10 ±  7%  perf-profile.children.cycles-pp.switch_fpu_return
      0.24 ±  4%      -0.0        0.21 ±  3%  perf-profile.children.cycles-pp.kmem_cache_free
      0.52 ±  2%      -0.0        0.49 ±  2%  perf-profile.children.cycles-pp.__lock_task_sighand
      0.46 ±  2%      -0.0        0.44 ±  2%  perf-profile.children.cycles-pp.signal_setup_done
      0.26            -0.0        0.23 ±  2%  perf-profile.children.cycles-pp.mark_wake_futex
      0.20 ±  2%      -0.0        0.18 ±  2%  perf-profile.children.cycles-pp.file_remove_privs
      0.15            -0.0        0.13 ±  3%  perf-profile.children.cycles-pp.dentry_needs_remove_privs
      0.14 ±  3%      -0.0        0.12        perf-profile.children.cycles-pp.__unwind_start
      0.07 ±  5%      -0.0        0.06 ±  7%  perf-profile.children.cycles-pp.clockevents_program_event
      0.12 ±  4%      -0.0        0.11        perf-profile.children.cycles-pp.security_inode_need_killpriv
      0.07 ±  7%      -0.0        0.05        perf-profile.children.cycles-pp.__vfs_getxattr
      0.18 ±  2%      -0.0        0.17 ±  4%  perf-profile.children.cycles-pp.__orc_find
      0.08            +0.0        0.09        perf-profile.children.cycles-pp.task_curr
      0.10 ±  4%      +0.0        0.11        perf-profile.children.cycles-pp.hrtimer_cancel
      0.15            +0.0        0.16 ±  2%  perf-profile.children.cycles-pp.check_kill_permission
      0.06 ±  7%      +0.0        0.07        perf-profile.children.cycles-pp.set_next_task_idle
      0.15 ±  4%      +0.0        0.17 ±  3%  perf-profile.children.cycles-pp.__might_sleep
      0.08 ±  5%      +0.0        0.10 ±  4%  perf-profile.children.cycles-pp.__next_timer_interrupt
      0.13 ±  3%      +0.0        0.15 ±  2%  perf-profile.children.cycles-pp._find_next_bit
      0.81            +0.0        0.82        perf-profile.children.cycles-pp.select_idle_sibling
      0.28            +0.0        0.30        perf-profile.children.cycles-pp.__copy_siginfo_from_user
      0.73            +0.0        0.75        perf-profile.children.cycles-pp.getuid
      0.41            +0.0        0.43        perf-profile.children.cycles-pp.available_idle_cpu
      0.43 ±  2%      +0.0        0.46        perf-profile.children.cycles-pp._copy_from_user
      0.93            +0.0        0.97        perf-profile.children.cycles-pp.dequeue_entity
      0.42            +0.0        0.46        perf-profile.children.cycles-pp.menu_select
      0.11 ±  6%      +0.0        0.16 ±  5%  perf-profile.children.cycles-pp.flush_smp_call_function_from_idle
      0.64 ±  2%      +0.1        0.70 ±  2%  perf-profile.children.cycles-pp.aa_may_signal
      1.79            +0.1        1.89        perf-profile.children.cycles-pp.pthread_cond_timedwait@@GLIBC_2.3.2
      1.83 ±  3%      +0.3        2.11        perf-profile.children.cycles-pp.finish_task_switch
      2.64 ±  3%      +0.3        2.98 ±  2%  perf-profile.children.cycles-pp.asm_call_on_stack
      2.03 ±  4%      +0.4        2.39 ±  2%  perf-profile.children.cycles-pp.sysvec_call_function_single
      1.99 ±  4%      +0.4        2.35 ±  2%  perf-profile.children.cycles-pp.__sysvec_call_function_single
      2.09 ±  4%      +0.4        2.46 ±  2%  perf-profile.children.cycles-pp.asm_sysvec_call_function_single
      2.04 ±  4%      +0.4        2.45 ±  2%  perf-profile.children.cycles-pp.sched_ttwu_pending
     11.57            +0.6       12.15        perf-profile.children.cycles-pp.__sched_text_start
     17.81            +0.6       18.45        perf-profile.children.cycles-pp.enqueue_task_fair
     17.83            +0.6       18.47        perf-profile.children.cycles-pp.activate_task
     17.83            +0.6       18.48        perf-profile.children.cycles-pp.ttwu_do_activate
     16.12            +0.6       16.77        perf-profile.children.cycles-pp.update_stats_enqueue_sleeper
     16.08            +0.6       16.72        perf-profile.children.cycles-pp.__account_scheduler_latency
     17.16            +0.7       17.84        perf-profile.children.cycles-pp.enqueue_entity
     25.29            +0.7       26.00        perf-profile.children.cycles-pp.try_to_wake_up
     14.49            +0.7       15.20        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
     13.01 ±  4%      +0.9       13.90        perf-profile.children.cycles-pp._raw_spin_lock
      6.16 ±  6%      +1.0        7.18 ±  3%  perf-profile.children.cycles-pp.schedule_idle
      6.09 ±  6%      +1.1        7.20 ±  3%  perf-profile.children.cycles-pp.pthread_cond_signal@@GLIBC_2.3.2
     25.45 ±  3%      +1.7       27.18        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      9.33 ±  7%      +1.8       11.12 ±  3%  perf-profile.children.cycles-pp.intel_idle
      9.64 ±  7%      +1.9       11.50 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter_state
      9.64 ±  7%      +1.9       11.50 ±  3%  perf-profile.children.cycles-pp.cpuidle_enter
     16.75 ±  6%      +3.0       19.73 ±  3%  perf-profile.children.cycles-pp.start_secondary
     16.90 ±  6%      +3.0       19.93 ±  3%  perf-profile.children.cycles-pp.secondary_startup_64
     16.90 ±  6%      +3.0       19.93 ±  3%  perf-profile.children.cycles-pp.cpu_startup_entry
     16.89 ±  6%      +3.0       19.91 ±  3%  perf-profile.children.cycles-pp.do_idle
     13.42 ±  2%      -2.4       11.04 ±  2%  perf-profile.self.cycles-pp.aa_get_task_label
      0.88 ±  4%      -0.1        0.77        perf-profile.self.cycles-pp._raw_spin_lock
      0.51 ±  5%      -0.1        0.41 ±  3%  perf-profile.self.cycles-pp.switch_mm_irqs_off
      0.43 ±  5%      -0.1        0.35 ±  4%  perf-profile.self.cycles-pp.futex_wait_queue_me
      0.44 ±  6%      -0.1        0.37 ±  3%  perf-profile.self.cycles-pp.recalc_sigpending
      0.63 ±  5%      -0.1        0.56 ±  3%  perf-profile.self.cycles-pp.__switch_to
      0.52 ±  4%      -0.1        0.45 ±  2%  perf-profile.self.cycles-pp._raw_spin_lock_irq
      0.73            -0.1        0.67        perf-profile.self.cycles-pp.__pthread_mutex_lock
      0.20 ±  5%      -0.0        0.15 ±  3%  perf-profile.self.cycles-pp.put_task_stack
      0.51 ±  3%      -0.0        0.46 ±  2%  perf-profile.self.cycles-pp.__sigqueue_free
      0.30 ±  3%      -0.0        0.26 ±  9%  perf-profile.self.cycles-pp.futex_wait_setup
      0.21 ±  3%      -0.0        0.17 ±  4%  perf-profile.self.cycles-pp.pagecache_get_page
      0.29 ±  4%      -0.0        0.25 ±  4%  perf-profile.self.cycles-pp.__lll_lock_wait
      0.29 ±  5%      -0.0        0.25 ±  4%  perf-profile.self.cycles-pp.fpregs_mark_activate
      0.35 ±  3%      -0.0        0.32        perf-profile.self.cycles-pp.generic_file_buffered_read
      0.13 ±  8%      -0.0        0.10 ±  8%  perf-profile.self.cycles-pp.switch_fpu_return
      0.18            -0.0        0.16 ±  4%  perf-profile.self.cycles-pp.simple_write_end
      0.12 ±  3%      -0.0        0.10 ±  8%  perf-profile.self.cycles-pp.ktime_get
      0.10 ± 10%      -0.0        0.08 ±  5%  perf-profile.self.cycles-pp.wake_up_q
      0.15 ±  3%      -0.0        0.13        perf-profile.self.cycles-pp.select_task_rq_fair
      0.08            -0.0        0.07 ±  7%  perf-profile.self.cycles-pp.generic_file_write_iter
      0.18 ±  2%      -0.0        0.17 ±  4%  perf-profile.self.cycles-pp.__orc_find
      0.11 ±  3%      -0.0        0.10        perf-profile.self.cycles-pp.__unqueue_futex
      0.08            -0.0        0.07        perf-profile.self.cycles-pp.arch_stack_walk
      0.08            +0.0        0.09        perf-profile.self.cycles-pp.task_curr
      0.19 ±  2%      +0.0        0.21 ±  2%  perf-profile.self.cycles-pp.prepare_signal
      0.13 ±  3%      +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.__dequeue_signal
      0.13 ±  3%      +0.0        0.14 ±  3%  perf-profile.self.cycles-pp._find_next_bit
      0.09 ±  4%      +0.0        0.11        perf-profile.self.cycles-pp.menu_select
      0.41            +0.0        0.42 ±  2%  perf-profile.self.cycles-pp.available_idle_cpu
      0.63 ±  3%      +0.1        0.70 ±  2%  perf-profile.self.cycles-pp.aa_may_signal
     25.44 ±  3%      +1.7       27.17        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      9.33 ±  7%      +1.8       11.12 ±  3%  perf-profile.self.cycles-pp.intel_idle


                                                                                
                                  stress-ng.sigq.ops                            
                                                                                
    3e+08 +-----------------------------------------------------------------+   
          |  O O  O O    O  O O  O O  O O  O  O O  O O  O O  O O    O  O O  |   
  2.5e+08 |-+                                                     O         |   
          |..+.+..+.+..+.+.. .+..         .+..+.+..+.+..+                   |   
          |                 +    +.+..+.+.                                  |   
    2e+08 |-+                                                               |   
          |                                                                 |   
  1.5e+08 |-+                                                               |   
          |                                                                 |   
    1e+08 |-+                                                               |   
          |                                                                 |   
          |                                                                 |   
    5e+07 |-+                                                               |   
          |                                                                 |   
        0 +-----------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                             stress-ng.sigq.ops_per_sec                         
                                                                                
  1e+07 +-------------------------------------------------------------------+   
  9e+06 |-+O O  O O     O O  O  O O  O O  O  O O  O O  O  O O  O    O  O O  |   
        |                                                         O         |   
  8e+06 |..+.+..+.+..+..+.  .+..         .+..+.+..+.+..+                    |   
  7e+06 |-+               +.    +.+..+.+.                                   |   
        |                                                                   |   
  6e+06 |-+                                                                 |   
  5e+06 |-+                                                                 |   
  4e+06 |-+                                                                 |   
        |                                                                   |   
  3e+06 |-+                                                                 |   
  2e+06 |-+                                                                 |   
        |                                                                   |   
  1e+06 |-+                                                                 |   
      0 +-------------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Rong Chen


View attachment "config-5.8.0-rc3-00004-g96d5c61b1fff3" of type "text/plain" (158352 bytes)

View attachment "job-script" of type "text/plain" (7709 bytes)

View attachment "job.yaml" of type "text/plain" (5246 bytes)

View attachment "reproduce" of type "text/plain" (464 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ