lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:   Tue, 8 Oct 2019 16:57:55 +0800
From:   kernel test robot <rong.a.chen@...el.com>
To:     Song Liu <songliubraving@...com>
Cc:     linux-kernel@...r.kernel.org, kernel-team@...com,
        Song Liu <songliubraving@...com>, stable@...r.kernel.org,
        Peter Zijlstra <peterz@...radead.org>,
        Arnaldo Carvalho de Melo <acme@...nel.org>,
        Thomas Gleixner <tglx@...utronix.de>, lkp@...org
Subject: [perf/core]  a449d92620:  vm-scalability.median -6.8% regression

Greeting,

FYI, we noticed a -6.8% regression of vm-scalability.median due to commit:


commit: a449d926203c733fecddc1fd077cf3d6ecef7f15 ("[PATCH] perf/core: fix corner case in perf_rotate_context()")
url: https://github.com/0day-ci/linux/commits/Song-Liu/perf-core-fix-corner-case-in-perf_rotate_context/20191003-144432


in testcase: vm-scalability
on test machine: 4 threads Intel(R) Core(TM) i3-3220 CPU @ 3.30GHz with 8G memory
with following parameters:

	runtime: 300s
	size: 8T
	test: anon-w-seq
	cpufreq_governor: performance
	ucode: 0x21

test-description: The motivation behind this suite is to exercise functions and regions of the mm/ of the Linux kernel which are of interest to us.
test-url: https://git.kernel.org/cgit/linux/kernel/git/wfg/vm-scalability.git/



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <rong.a.chen@...el.com>


Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install job.yaml  # job file is attached in this email
        bin/lkp run     job.yaml

=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/runtime/size/tbox_group/test/testcase/ucode:
  gcc-7/performance/x86_64-rhel-7.6/debian-x86_64-2019-09-23.cgz/300s/8T/lkp-ivb-d02/anon-w-seq/vm-scalability/0x21

commit: 
  e336b40277 ("kprobes: Prohibit probing on BUG() and WARN() address")
  a449d92620 ("perf/core: fix corner case in perf_rotate_context()")

e336b4027775cb45 a449d926203c733fecddc1fd077 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      0.00 ±  2%      +3.8%       0.00        vm-scalability.free_time
   1235827            -6.8%    1151821        vm-scalability.median
      0.02 ±  8%     +51.1%       0.03 ±  5%  vm-scalability.median_stddev
   4931119            -6.6%    4606513        vm-scalability.throughput
   1624883            -6.4%    1521193        vm-scalability.time.minor_page_faults
    228.43 ±  3%     +20.6%     275.45        vm-scalability.time.system_time
    919.50            -5.6%     867.64        vm-scalability.time.user_time
     11336            -3.9%      10895        vm-scalability.time.voluntary_context_switches
 1.432e+09            -7.1%   1.33e+09        vm-scalability.workload
      2.07 ±  4%      +0.7        2.81 ± 14%  mpstat.cpu.all.idle%
     20.04 ±  3%      +3.8       23.89        mpstat.cpu.all.sys%
     77.00            -5.8%      72.50        vmstat.cpu.us
      7171            -3.0%       6958        vmstat.system.cs
  12591513 ± 16%     +60.0%   20142845 ± 27%  cpuidle.C6.time
     21824 ± 11%     +44.6%      31557 ± 19%  cpuidle.C6.usage
      1123 ± 17%     +86.2%       2092 ± 24%  cpuidle.POLL.usage
      8568           +11.3%       9539        interrupts.CAL:Function_call_interrupts
      7614 ± 33%     +87.5%      14280 ± 25%  interrupts.CPU3.RES:Rescheduling_interrupts
     15566 ± 20%    +120.5%      34321 ± 28%  interrupts.NMI:Non-maskable_interrupts
     15566 ± 20%    +120.5%      34321 ± 28%  interrupts.PMI:Performance_monitoring_interrupts
     37492 ±  9%     +33.8%      50168 ±  2%  interrupts.RES:Rescheduling_interrupts
     31894 ±  6%     -19.7%      25618 ±  5%  softirqs.CPU0.RCU
     32409 ±  6%     -21.5%      25453 ±  8%  softirqs.CPU1.RCU
     35578 ±  3%     -26.0%      26329 ± 11%  softirqs.CPU2.RCU
     33765 ±  9%     -28.8%      24030 ±  3%  softirqs.CPU3.RCU
    133648 ±  4%     -24.1%     101433 ±  5%  softirqs.RCU
     21819 ± 11%     +44.6%      31556 ± 19%  turbostat.C6
      1.04 ± 16%      +0.6        1.66 ± 26%  turbostat.C6%
      0.27 ± 29%    +247.2%       0.94 ± 41%  turbostat.CPU%c6
     18.86            -2.1%      18.46        turbostat.CorWatt
      0.02 ± 44%    +520.0%       0.15 ± 40%  turbostat.Pkg%pc2
      0.02 ±101%   +1466.7%       0.35 ± 63%  turbostat.Pkg%pc6
     37.19            -1.5%      36.64        turbostat.PkgWatt
   2167377           +51.8%    3289728 ±  2%  meminfo.Active
   2165895           +51.8%    3288244 ±  2%  meminfo.Active(anon)
   2006718           +53.7%    3085007 ±  2%  meminfo.AnonHugePages
   2151900           +49.3%    3213568 ±  2%  meminfo.AnonPages
    141552           -60.7%      55678 ±  8%  meminfo.CmaFree
   4584114           -24.5%    3458742        meminfo.MemAvailable
   4694537           -24.0%    3569088        meminfo.MemFree
   3337073           +33.7%    4462514        meminfo.Memused
      7677           +27.5%       9791        meminfo.PageTables
      9532 ± 12%     -38.9%       5820 ± 20%  sched_debug.cfs_rq:/.min_vruntime.stddev
     29.67 ±  4%     -17.8%      24.38 ±  3%  sched_debug.cfs_rq:/.nr_spread_over.max
     11.21 ±  6%     -16.5%       9.36 ±  2%  sched_debug.cfs_rq:/.nr_spread_over.stddev
    103675 ± 28%     -22.0%      80844 ±  6%  sched_debug.cfs_rq:/.runnable_weight.stddev
     20991 ± 40%     -72.0%       5881 ± 52%  sched_debug.cfs_rq:/.spread0.max
      9528 ± 12%     -38.9%       5820 ± 20%  sched_debug.cfs_rq:/.spread0.stddev
    371922 ± 10%     +22.9%     456951 ±  8%  sched_debug.cpu.avg_idle.min
    191541 ± 12%     -19.1%     154992 ±  8%  sched_debug.cpu.avg_idle.stddev
    318.92 ± 68%     -59.2%     130.17 ±124%  sched_debug.cpu.curr->pid.stddev
    540668           +35.4%     732070 ±  3%  proc-vmstat.nr_active_anon
    536027           +30.6%     700171 ±  4%  proc-vmstat.nr_anon_pages
    977.25           +34.4%       1313 ±  4%  proc-vmstat.nr_anon_transparent_hugepages
    114025           -16.8%      94860 ±  2%  proc-vmstat.nr_dirty_background_threshold
    228332           -16.8%     189953 ±  2%  proc-vmstat.nr_dirty_threshold
     35358           -49.5%      17856 ±  8%  proc-vmstat.nr_free_cma
   1174409           -16.3%     982468 ±  2%  proc-vmstat.nr_free_pages
      1920           +17.3%       2252 ±  2%  proc-vmstat.nr_page_table_pages
      6783            +3.2%       7000        proc-vmstat.nr_shmem
    540670           +35.4%     732059 ±  3%  proc-vmstat.nr_zone_active_anon
   2505028            -5.5%    2366487        proc-vmstat.numa_hit
   2505028            -5.5%    2366487        proc-vmstat.numa_local
      1609 ±  5%     +30.7%       2103 ± 10%  proc-vmstat.pgactivate
  72313983            -7.6%   66816607 ±  2%  proc-vmstat.pgalloc_dma32
 2.472e+08            -6.9%  2.301e+08        proc-vmstat.pgalloc_normal
   2007636            -5.0%    1907338        proc-vmstat.pgfault
 3.193e+08            -7.2%  2.963e+08        proc-vmstat.pgfree
    619835            -7.2%     575294        proc-vmstat.thp_deferred_split_page
    620290            -7.1%     576268        proc-vmstat.thp_fault_alloc
 4.948e+09            -8.6%  4.525e+09        perf-stat.i.branch-instructions
  71787571            -6.5%   67122869        perf-stat.i.cache-misses
  79483511            -7.0%   73906455        perf-stat.i.cache-references
      7224            -2.9%       7017        perf-stat.i.context-switches
      0.82            +7.9%       0.88        perf-stat.i.cpi
 1.272e+10            -1.1%  1.258e+10        perf-stat.i.cpu-cycles
     96.49 ± 10%     -22.6%      74.70 ±  2%  perf-stat.i.cpu-migrations
    185.50            +3.7%     192.42        perf-stat.i.cycles-between-cache-misses
 3.354e+09            -6.7%  3.129e+09        perf-stat.i.dTLB-loads
      0.01 ±  5%      +0.0        0.01 ±  4%  perf-stat.i.dTLB-store-miss-rate%
 1.419e+09            -6.9%  1.321e+09        perf-stat.i.dTLB-stores
 1.558e+10            -7.8%  1.437e+10        perf-stat.i.instructions
      1.22            -6.8%       1.14        perf-stat.i.ipc
      6581            -5.4%       6226        perf-stat.i.minor-faults
      6581            -5.4%       6226        perf-stat.i.page-faults
      0.82            +7.2%       0.88        perf-stat.overall.cpi
    177.25            +5.8%     187.47        perf-stat.overall.cycles-between-cache-misses
      1.22            -6.7%       1.14        perf-stat.overall.ipc
 4.936e+09            -8.6%  4.513e+09        perf-stat.ps.branch-instructions
  71615773            -6.5%   66940324        perf-stat.ps.cache-misses
  79293323            -7.0%   73705360        perf-stat.ps.cache-references
      7207            -2.9%       6998        perf-stat.ps.context-switches
 1.269e+10            -1.1%  1.255e+10        perf-stat.ps.cpu-cycles
     96.25 ± 10%     -22.6%      74.50 ±  2%  perf-stat.ps.cpu-migrations
 3.346e+09            -6.7%   3.12e+09        perf-stat.ps.dTLB-loads
 1.416e+09            -6.9%  1.318e+09        perf-stat.ps.dTLB-stores
 1.554e+10            -7.8%  1.433e+10        perf-stat.ps.instructions
      6565            -5.4%       6209        perf-stat.ps.minor-faults
      6565            -5.4%       6209        perf-stat.ps.page-faults
 4.691e+12            -7.6%  4.333e+12        perf-stat.total.instructions
     67.81            -3.9       63.88        perf-profile.calltrace.cycles-pp.do_rw_once
      0.89 ±  8%      -0.3        0.54 ± 57%  perf-profile.calltrace.cycles-pp.smp_apic_timer_interrupt.apic_timer_interrupt.do_rw_once
      0.92 ±  8%      -0.2        0.69 ± 15%  perf-profile.calltrace.cycles-pp.apic_timer_interrupt.do_rw_once
     45.88            +3.6       49.44        perf-profile.calltrace.cycles-pp.do_access
     14.92 ±  4%      +6.1       20.99 ±  4%  perf-profile.calltrace.cycles-pp.clear_page_erms.clear_subpage.clear_huge_page.do_huge_pmd_anonymous_page.__handle_mm_fault
     15.42 ±  3%      +6.3       21.76 ±  4%  perf-profile.calltrace.cycles-pp.clear_subpage.clear_huge_page.do_huge_pmd_anonymous_page.__handle_mm_fault.handle_mm_fault
     15.92 ±  3%      +6.6       22.52 ±  4%  perf-profile.calltrace.cycles-pp.clear_huge_page.do_huge_pmd_anonymous_page.__handle_mm_fault.handle_mm_fault.__do_page_fault
     17.18 ±  3%      +6.6       23.79 ±  4%  perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.__do_page_fault.do_page_fault.page_fault
     17.00 ±  3%      +6.7       23.68 ±  4%  perf-profile.calltrace.cycles-pp.do_huge_pmd_anonymous_page.__handle_mm_fault.handle_mm_fault.__do_page_fault.do_page_fault
     16.45            +7.0       23.43        perf-profile.calltrace.cycles-pp.page_fault.do_access
     16.29            +7.0       23.27        perf-profile.calltrace.cycles-pp.handle_mm_fault.__do_page_fault.do_page_fault.page_fault.do_access
     16.39            +7.0       23.38        perf-profile.calltrace.cycles-pp.__do_page_fault.do_page_fault.page_fault.do_access
     16.43            +7.0       23.42        perf-profile.calltrace.cycles-pp.do_page_fault.page_fault.do_access
     68.56            -4.7       63.88        perf-profile.children.cycles-pp.do_rw_once
      1.62 ±  4%      -0.3        1.31 ± 13%  perf-profile.children.cycles-pp.smp_apic_timer_interrupt
      1.73 ±  4%      -0.3        1.42 ± 12%  perf-profile.children.cycles-pp.apic_timer_interrupt
      1.15 ±  6%      -0.2        0.90 ± 13%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.83 ±  3%      -0.2        0.63 ± 17%  perf-profile.children.cycles-pp.tick_sched_timer
      0.66 ± 12%      -0.2        0.51 ± 18%  perf-profile.children.cycles-pp.update_process_times
      0.24 ± 32%      -0.1        0.11 ± 15%  perf-profile.children.cycles-pp.irq_work_run_list
      0.19 ± 21%      -0.1        0.09 ± 21%  perf-profile.children.cycles-pp.irq_work_interrupt
      0.19 ± 21%      -0.1        0.09 ± 21%  perf-profile.children.cycles-pp.smp_irq_work_interrupt
      0.18 ± 22%      -0.1        0.09 ± 21%  perf-profile.children.cycles-pp.irq_work_run
      0.18 ± 22%      -0.1        0.09 ± 21%  perf-profile.children.cycles-pp.printk
      0.10 ± 14%      -0.0        0.07 ± 25%  perf-profile.children.cycles-pp.try_to_wake_up
      0.26 ± 13%      +0.0        0.31 ±  4%  perf-profile.children.cycles-pp.__free_pages_ok
      0.34 ±  8%      +0.1        0.41 ±  2%  perf-profile.children.cycles-pp.__x64_sys_munmap
      0.34 ±  8%      +0.1        0.41 ±  3%  perf-profile.children.cycles-pp.__vm_munmap
      0.32 ± 11%      +0.1        0.39 ±  2%  perf-profile.children.cycles-pp.tlb_finish_mmu
      0.32 ± 11%      +0.1        0.39 ±  2%  perf-profile.children.cycles-pp.tlb_flush_mmu
      0.33 ± 12%      +0.1        0.40 ±  3%  perf-profile.children.cycles-pp.release_pages
      0.34 ±  8%      +0.1        0.42 ±  4%  perf-profile.children.cycles-pp.__do_munmap
      0.34 ±  8%      +0.1        0.41 ±  3%  perf-profile.children.cycles-pp.unmap_region
      0.28 ±  9%      +0.1        0.37 ±  5%  perf-profile.children.cycles-pp.munmap
      0.18 ±  8%      +0.1        0.29 ±  9%  perf-profile.children.cycles-pp._cond_resched
      0.08 ± 26%      +0.1        0.19 ± 12%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.12 ± 15%      +0.1        0.24 ± 13%  perf-profile.children.cycles-pp.rcu_all_qs
      0.30 ± 12%      +0.1        0.43 ±  5%  perf-profile.children.cycles-pp.___might_sleep
      0.00            +0.1        0.15 ±  7%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     45.22            +4.3       49.49        perf-profile.children.cycles-pp.do_access
     15.19 ±  3%      +6.4       21.58 ±  3%  perf-profile.children.cycles-pp.clear_page_erms
     15.47 ±  3%      +6.6       22.03 ±  3%  perf-profile.children.cycles-pp.clear_subpage
     17.56 ±  3%      +6.8       24.35 ±  3%  perf-profile.children.cycles-pp.page_fault
     15.98 ±  3%      +6.8       22.78 ±  3%  perf-profile.children.cycles-pp.clear_huge_page
     17.35 ±  3%      +6.8       24.15 ±  3%  perf-profile.children.cycles-pp.handle_mm_fault
     17.30 ±  3%      +6.8       24.10 ±  3%  perf-profile.children.cycles-pp.__handle_mm_fault
     17.51 ±  3%      +6.8       24.32 ±  3%  perf-profile.children.cycles-pp.do_page_fault
     17.47 ±  3%      +6.8       24.29 ±  3%  perf-profile.children.cycles-pp.__do_page_fault
     17.02 ±  2%      +6.9       23.91 ±  3%  perf-profile.children.cycles-pp.do_huge_pmd_anonymous_page
     59.31            -4.4       54.88        perf-profile.self.cycles-pp.do_rw_once
     19.43 ±  2%      -1.9       17.50        perf-profile.self.cycles-pp.do_access
      0.23 ± 12%      +0.1        0.29 ±  7%  perf-profile.self.cycles-pp.__free_pages_ok
      0.29 ± 10%      +0.1        0.40 ±  7%  perf-profile.self.cycles-pp.___might_sleep
      0.07 ±  7%      +0.1        0.18 ± 11%  perf-profile.self.cycles-pp.rcu_all_qs
      0.00            +0.1        0.15 ±  7%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      0.31 ± 19%      +0.2        0.51 ± 13%  perf-profile.self.cycles-pp.clear_subpage
     14.82 ±  3%      +6.4       21.21 ±  3%  perf-profile.self.cycles-pp.clear_page_erms


                                                                                
                           vm-scalability.time.user_time                        
                                                                                
  940 +-+-------------------------------------------------------------------+   
      |                                               +                     |   
  930 +-+          +              +              +. .. +             +    + |   
  920 +-++.      .. +  .+       .. +            +  +    +..+.+..    + :  + :|   
      |..  +..+.+    +.  +  .+.+    +..+.  .+. +                +. +  : +  :|   
  910 +-+                 +.             +.   +                   +    +    |   
      |                                                                     |   
  900 +-+                                                                   |   
      |                                                                     |   
  890 +-+                                                                   |   
  880 +-+                                                                   |   
      O                                                                     |   
  870 +-+     O O         O  O    O    O    O         O O       O O         |   
      |  O O         O  O           O    O    O  O         O                |   
  860 +-+----------O-----------O-------------------O---------O--------------+   
                                                                                
                                                                                                                                                                
                          vm-scalability.time.system_time                       
                                                                                
  290 +-+-------------------------------------------------------------------+   
      |                        O                                            |   
  280 +-+O O    O    O    O         O    O       O         O                |   
  270 O-+     O    O    O    O    O    O    O O    O  O O    O  O O         |   
      |                                                                     |   
  260 +-+                                                                   |   
      |                                                                     |   
  250 +-+                                                                   |   
      |                                                                     |   
  240 +-+                                +..  +                  .+         |   
  230 +-+  +..+.          +..+.+..  +.. +    + +            .+..+  +   +.. :|   
      |   +     +.. .+.. +         +   +    +   + .+.. .+..+        + +    :|   
  220 +-++         +    +         +              +    +              +    + |   
      |                                                                     |   
  210 +-+-------------------------------------------------------------------+   
                                                                                
                                                                                                                                                                
                               vm-scalability.throughput                        
                                                                                
     5e+06 +-+--------------------------------------------------------------+   
           |                                                                |   
  4.95e+06 +-+ .+.    .+.+.+..   .+..+.    .+. .+.+..+.                  .+.|   
           | +.   +.+.        +.+      +.+.   +        +.+..+. .+..+.+.+.   |   
   4.9e+06 +-+                                                +             |   
  4.85e+06 +-+                                                              |   
           |                                                                |   
   4.8e+06 +-+                                                              |   
           |                                                                |   
  4.75e+06 +-+                                                              |   
   4.7e+06 +-+                                                              |   
           |                                                                |   
  4.65e+06 +-+                                                              |   
           |    O   O                O O      O O                           |   
   4.6e+06 O-O----O----O-O-O--O-O-O------O--O-----O--O-O-O--O-O-O--O--------+   
                                                                                
                                                                                                                                                                
                                 vm-scalability.median                          
                                                                                
  1.25e+06 +-+--------------------------------------------------------------+   
  1.24e+06 +-+ .+.     +. .+..             .+. .+.                          |   
           | +.   +. ..  +    +.+.+..+.+.+.   +   +..+.+.+..+.      .+.+..+.|   
  1.23e+06 +-+      +                                         +.+..+        |   
  1.22e+06 +-+                                                              |   
           |                                                                |   
  1.21e+06 +-+                                                              |   
   1.2e+06 +-+                                                              |   
  1.19e+06 +-+                                                              |   
           |                                                                |   
  1.18e+06 +-+                                                              |   
  1.17e+06 +-+                                                              |   
           |                                                                |   
  1.16e+06 O-+  O   O      O         O O      O O                           |   
  1.15e+06 +-O----O----O-O----O-O-O------O--O-----O--O-O-O--O-O-O--O--------+   
                                                                                
                                                                                                                                                                
                                vm-scalability.workload                         
                                                                                
  1.46e+09 +-+--------------------------------------------------------------+   
           |                                                                |   
  1.44e+09 +-+ .+.     +. .+..      .+.    .+. .+.  .+.                     |   
           | +.   +. ..  +    +.+.+.   +.+.   +   +.   +.+..+.+.+..+.+.+..+.|   
  1.42e+09 +-+      +                                                       |   
           |                                                                |   
   1.4e+09 +-+                                                              |   
           |                                                                |   
  1.38e+09 +-+                                                              |   
           |                                                                |   
  1.36e+09 +-+                                                              |   
           |                                                                |   
  1.34e+09 O-+      O                           O                           |   
           | O  O O      O O  O O O  O O O  O O   O  O O O  O O O  O        |   
  1.32e+09 +-+---------O----------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Rong Chen


View attachment "config-5.3.0-rc6-00606-ga449d926203c7" of type "text/plain" (199452 bytes)

View attachment "job-script" of type "text/plain" (7707 bytes)

View attachment "job.yaml" of type "text/plain" (5263 bytes)

View attachment "reproduce" of type "text/plain" (18052 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ