lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Date:	Fri, 13 Feb 2015 13:37:12 +0800
From:	Huang Ying <ying.huang@...el.com>
To:	Will Deacon <will.deacon@....com>
Cc:	LKML <linux-kernel@...r.kernel.org>, LKP ML <lkp@...org>
Subject: [LKP] [mmu_gather] fb7332a9fed: +5.4%
 tlbflush.mem_acc_time_thread_ms -6.2% will-it-scale.per_thread_ops

FYI, we noticed the below changes on

commit fb7332a9fedfd62b1ba6530c86f39f0fa38afd49 ("mmu_gather: move minimal range calculations into generic code")


testbox/testcase/testparams: ivb42/will-it-scale/performance-brk1

63648dd20fa0780a  fb7332a9fedfd62b1ba6530c86  
----------------  --------------------------  
         %stddev     %change         %stddev
             \          |                \  
    733067 ±  0%      -6.2%     687433 ±  0%  will-it-scale.per_thread_ops
   3728462 ±  0%      -5.7%    3516927 ±  0%  will-it-scale.per_process_ops
      0.46 ±  0%      -3.3%       0.44 ±  0%  will-it-scale.scalability
    140435 ± 22%     -75.2%      34877 ± 32%  sched_debug.cpu#19.ttwu_count
        85 ± 32%    +144.6%        208 ± 30%  sched_debug.cfs_rq[8]:/.blocked_load_avg
    892437 ± 34%     -55.5%     397177 ± 16%  sched_debug.cpu#5.nr_switches
    445593 ± 34%     -55.5%     198460 ± 16%  sched_debug.cpu#5.sched_goidle
        92 ± 29%    +136.7%        219 ± 29%  sched_debug.cfs_rq[8]:/.tg_load_contrib
    374967 ± 14%     -56.3%     163891 ± 35%  sched_debug.cpu#5.ttwu_count
    924309 ± 32%     -51.0%     452960 ±  9%  sched_debug.cpu#5.sched_count
        90 ± 41%     +78.2%        161 ± 18%  sched_debug.cfs_rq[30]:/.tg_load_contrib
    205529 ± 15%     -38.3%     126863 ± 32%  sched_debug.cpu#40.ttwu_count
      1152 ± 33%     +50.4%       1734 ± 22%  sched_debug.cpu#13.ttwu_local
      3.44 ± 38%     +86.6%       6.42 ± 18%  perf-profile.cpu-cycles.rwsem_spin_on_owner.rwsem_down_write_failed.call_rwsem_down_write_failed.sys_brk.system_call_fastpath
         6 ± 39%    +100.0%         13 ±  9%  sched_debug.cpu#2.cpu_load[0]
      1.03 ± 11%     +46.6%       1.52 ± 12%  perf-profile.cpu-cycles.find_vma.do_munmap.sys_brk.system_call_fastpath.brk
      0.76 ± 32%     +65.2%       1.26 ± 14%  perf-profile.cpu-cycles.up_write.vma_adjust.vma_merge.do_brk.sys_brk
    538886 ± 38%     +70.3%     917481 ± 20%  sched_debug.cpu#26.ttwu_count
      0.76 ± 21%     +69.6%       1.28 ± 14%  perf-profile.cpu-cycles.find_vma.sys_brk.system_call_fastpath.brk
        16 ±  5%     +25.8%         20 ± 18%  sched_debug.cpu#30.cpu_load[1]
      2224 ± 11%     +21.8%       2709 ± 16%  sched_debug.cpu#41.curr->pid
      3.94 ±  9%     -30.5%       2.74 ± 17%  perf-profile.cpu-cycles._raw_spin_lock.try_to_wake_up.wake_up_process.__rwsem_do_wake.rwsem_wake
        28 ± 14%     -32.1%         19 ± 14%  sched_debug.cfs_rq[25]:/.load
        16 ±  4%     +25.0%         20 ± 12%  sched_debug.cpu#32.cpu_load[2]
        17 ± 16%     +20.3%         20 ± 10%  sched_debug.cfs_rq[34]:/.runnable_load_avg
    180505 ± 26%     -43.4%     102128 ± 18%  sched_debug.cpu#44.ttwu_count
      2135 ±  7%     +28.7%       2747 ± 22%  sched_debug.cpu#44.curr->pid
     13.14 ± 10%     +20.4%      15.82 ±  5%  perf-profile.cpu-cycles.call_rwsem_down_write_failed.sys_brk.system_call_fastpath.brk
     13.05 ± 10%     +20.4%      15.71 ±  5%  perf-profile.cpu-cycles.rwsem_down_write_failed.call_rwsem_down_write_failed.sys_brk.system_call_fastpath.brk
      2.30 ± 10%     +24.1%       2.86 ±  7%  perf-profile.cpu-cycles.vma_adjust.vma_merge.do_brk.sys_brk.system_call_fastpath
      1.70 ±  6%     -13.2%       1.47 ± 11%  perf-profile.cpu-cycles.clockevents_program_event.tick_program_event.__hrtimer_start_range_ns.hrtimer_start_range_ns.tick_nohz_restart
      5512 ±  1%     +27.7%       7040 ± 22%  sched_debug.cfs_rq[20]:/.exec_clock
        17 ±  5%     -30.9%         11 ± 32%  sched_debug.cpu#40.load
      2.73 ±  9%     +21.0%       3.30 ±  7%  perf-profile.cpu-cycles.vma_merge.do_brk.sys_brk.system_call_fastpath.brk
    505131 ± 13%     +26.4%     638526 ±  7%  sched_debug.cpu#32.ttwu_count
      1.09 ±  7%     -14.9%       0.93 ± 11%  perf-profile.cpu-cycles._raw_spin_unlock_irqrestore.rwsem_wake.call_rwsem_wake.sys_brk.system_call_fastpath
        16 ±  2%     +13.8%         18 ±  8%  sched_debug.cpu#32.cpu_load[3]
      1.73 ±  6%     -12.5%       1.52 ± 11%  perf-profile.cpu-cycles.tick_program_event.__hrtimer_start_range_ns.hrtimer_start_range_ns.tick_nohz_restart.tick_nohz_idle_exit
      1.89 ±  5%     -12.2%       1.66 ±  1%  perf-profile.cpu-cycles.set_next_entity.pick_next_task_fair.__sched_text_start.schedule_preempt_disabled.cpu_startup_entry
     17.50 ±  5%     -14.4%      14.98 ±  6%  perf-profile.cpu-cycles.try_to_wake_up.wake_up_process.__rwsem_do_wake.rwsem_wake.call_rwsem_wake
     18.55 ±  5%     -13.8%      16.00 ±  6%  perf-profile.cpu-cycles.wake_up_process.__rwsem_do_wake.rwsem_wake.call_rwsem_wake.sys_brk
       229 ±  6%     -10.5%        205 ±  0%  sched_debug.cfs_rq[2]:/.tg_runnable_contrib
     10557 ±  6%     -10.2%       9478 ±  0%  sched_debug.cfs_rq[2]:/.avg->runnable_avg_sum
     18.66 ±  5%     -13.4%      16.16 ±  5%  perf-profile.cpu-cycles.__rwsem_do_wake.rwsem_wake.call_rwsem_wake.sys_brk.system_call_fastpath
    745968 ±  4%      +9.1%     813977 ±  5%  sched_debug.cpu#10.avg_idle
      3.53 ±  2%     +13.0%       3.98 ±  5%  perf-profile.cpu-cycles.unmap_region.do_munmap.sys_brk.system_call_fastpath.brk
    738761 ±  5%     +13.5%     838441 ±  3%  sched_debug.cpu#2.avg_idle
     21.09 ±  5%     -13.4%      18.27 ±  6%  perf-profile.cpu-cycles.rwsem_wake.call_rwsem_wake.sys_brk.system_call_fastpath.brk
        16 ±  6%     +12.1%         18 ±  2%  sched_debug.cpu#29.cpu_load[2]
     21.21 ±  5%     -13.2%      18.40 ±  6%  perf-profile.cpu-cycles.call_rwsem_wake.sys_brk.system_call_fastpath.brk
     11.09 ±  3%     +11.3%      12.35 ±  2%  perf-profile.cpu-cycles.do_munmap.sys_brk.system_call_fastpath.brk
    678615 ±  4%      +7.4%     728663 ±  3%  sched_debug.cpu#25.avg_idle
      2757 ±  5%      -7.4%       2551 ±  2%  sched_debug.cpu#25.curr->pid
       885 ± 11%     +22.0%       1080 ± 11%  slabinfo.RAW.num_objs
       885 ± 11%     +22.0%       1080 ± 11%  slabinfo.RAW.active_objs
      2528 ±  3%      +4.7%       2647 ±  5%  sched_debug.cpu#35.curr->pid
       230 ±  4%     -10.0%        207 ±  1%  sched_debug.cfs_rq[8]:/.tg_runnable_contrib

testbox/testcase/testparams: lkp-t410/tlbflush/performance-200%-32x-512

63648dd20fa0780a  fb7332a9fedfd62b1ba6530c86  
----------------  --------------------------  
       113 ±  0%      -5.5%        107 ±  0%  tlbflush.mem_acc_cost_ns_time
      8758 ±  0%      +5.4%       9227 ±  0%  tlbflush.mem_acc_time_thread_ms
      3314 ± 11%     -21.2%       2610 ± 15%  slabinfo.anon_vma.num_objs

ivb42: Ivytown Ivy Bridge-EP
Memory: 64G

lkp-t410: Westmere
Memory: 2G




                          tlbflush.mem_acc_time_thread_ms

  9300 ++-O-------------------O--------O-------------------O----------------O
       |                   O        O                            O O     O  |
  9200 O+       O  O    O        O        O O  O  O  O  O     O       O     |
  9100 ++    O                                                              |
       |                                                                    |
  9000 ++            O                                                      |
       |                                                                    |
  8900 ++                                        .*                         |
       |  *                     .*..      *    *.  +                        |
  8800 ++. +        .*..*     *.         + + ..     +  .*..  .*..           |
  8700 *+   +     .*     :   :      *.. +   *        *.    *.    *          |
       |     *..*.        :  :         *                                    |
  8600 ++                 : :                                               |
       |                   *                                                |
  8500 ++-------------------------------------------------------------------+


        [*] bisect-good sample
        [O] bisect-bad  sample

To reproduce:

        apt-get install ruby
        git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
        cd lkp-tests
        bin/setup-local job.yaml # the job file attached in this email
        bin/run-local   job.yaml


Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Fengguang



View attachment "job.yaml" of type "text/plain" (1647 bytes)

View attachment "reproduce" of type "text/plain" (1020 bytes)

_______________________________________________
LKP mailing list
LKP@...ux.intel.com

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ