lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20180626060338.GU12146@yexl-desktop>
Date:   Tue, 26 Jun 2018 14:03:38 +0800
From:   Ye Xiaolong <xiaolong.ye@...el.com>
To:     Christoph Hellwig <hch@....de>
Cc:     Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
        "Darrick J. Wong" <darrick.wong@...cle.com>,
        LKML <linux-kernel@...r.kernel.org>,
        Linus Torvalds <torvalds@...ux-foundation.org>, lkp@...org,
        viro@...iv.linux.org.uk
Subject: Re: [lkp-robot] [fs]  3deb642f0d:  will-it-scale.per_process_ops
 -8.8% regression

Hi,

On 06/22, Christoph Hellwig wrote:
>Hi Xiaolong,
>
>can you retest this workload on the following branch:
>
>    git://git.infradead.org/users/hch/vfs.git remove-get-poll-head
>
>Gitweb:
>
>    http://git.infradead.org/users/hch/vfs.git/shortlog/refs/heads/remove-get-poll-head

Here is the comparison for commit 3deb642f0d and commit 8fbedc1 ("fs: replace f_ops->get_poll_head with a static ->f_poll_head pointer") in remove-get-poll-head branch.

3deb642f0de4c14f  8fbedc19c94fd25a2b9b327015  
----------------  --------------------------  
         %stddev      change         %stddev
             \          |                \  
    457120              -7%     424608        will-it-scale.per_process_ops
    238978                      233862        will-it-scale.per_thread_ops
      1755 ± 13%         8%       1899 ± 18%  will-it-scale.time.voluntary_context_switches
      2332                        2342        will-it-scale.time.system_time
       310                         310        will-it-scale.time.elapsed_time
       310                         310        will-it-scale.time.elapsed_time.max
      4096                        4096        will-it-scale.time.page_size
      0.54                        0.54        will-it-scale.scalability
       807                         806        will-it-scale.time.percent_of_cpu_this_job_got
     17218                       17179        will-it-scale.time.minor_page_faults
      9931                        9862        will-it-scale.time.maximum_resident_set_size
       173              -6%        163        will-it-scale.time.user_time
  49024375              -6%   46155690        will-it-scale.workload
     17818 ± 10%       -19%      14397 ±  4%  will-it-scale.time.involuntary_context_switches
    116842 ± 12%        -4%     112098 ±  5%  interrupts.CAL:Function_call_interrupts
     32735                       32635        vmstat.system.in
      2112 ±  7%       -13%       1845 ±  3%  vmstat.system.cs
       150                         150        turbostat.PkgWatt
       123                         122        turbostat.CorWatt
      1573                        1573        turbostat.Avg_MHz
     15.73              13%      17.77 ± 19%  boot-time.kernel_boot
     15.07              12%      16.93 ± 21%  boot-time.dhcp
       771               8%        834 ± 12%  boot-time.idle
     25.69               8%      27.69 ± 12%  boot-time.boot
      1755 ± 13%         8%       1899 ± 18%  time.voluntary_context_switches
      2332                        2342        time.system_time
       310                         310        time.elapsed_time
       310                         310        time.elapsed_time.max
      4096                        4096        time.page_size
       807                         806        time.percent_of_cpu_this_job_got
     17218                       17179        time.minor_page_faults
      9931                        9862        time.maximum_resident_set_size
       173              -6%        163        time.user_time
     17818 ± 10%       -19%      14397 ±  4%  time.involuntary_context_switches
    428813 ±  9%        57%     672385        proc-vmstat.pgalloc_normal
     41736 ± 15%        22%      50828        proc-vmstat.nr_free_cma
     18116               8%      19506 ±  8%  proc-vmstat.nr_slab_unreclaimable
      1029                        1033        proc-vmstat.nr_page_table_pages
      8453                        8471        proc-vmstat.nr_kernel_stack
      6486                        6499        proc-vmstat.nr_mapped
   3193607                     3194517        proc-vmstat.nr_dirty_threshold
   1594853                     1595308        proc-vmstat.nr_dirty_background_threshold
  16061877                    16064831        proc-vmstat.nr_free_pages
     20009                       20005        proc-vmstat.nr_anon_pages
      6303                        6294        proc-vmstat.numa_other
    799772                      797937        proc-vmstat.pgfault
    667803                      665906        proc-vmstat.pgfree
    666440                      663786        proc-vmstat.numa_hit
    660136                      657491        proc-vmstat.numa_local
    313125                      310062        proc-vmstat.nr_file_pages
      1941 ±  5%                  1917 ±  8%  proc-vmstat.numa_pte_updates
      1448 ±  7%                  1421 ±  9%  proc-vmstat.numa_hint_faults_local
      1596 ±  6%                  1558 ± 10%  proc-vmstat.numa_hint_faults
     12893              -6%      12152 ± 11%  proc-vmstat.nr_slab_reclaimable
     22885            -100%          0        proc-vmstat.nr_indirectly_reclaimable
    245443 ± 16%      -100%          0        proc-vmstat.pgalloc_movable
  19861107 ± 14%        34%   26619357 ± 35%  perf-stat.node-load-misses
  51734389 ±  5%        22%   63014695 ± 25%  perf-stat.node-loads
 1.924e+09 ±  3%        21%   2.32e+09 ±  5%  perf-stat.iTLB-load-misses
 2.342e+09 ±  8%        15%  2.695e+09 ±  4%  perf-stat.cache-references
 3.251e+08 ±  7%        11%  3.622e+08 ±  5%  perf-stat.iTLB-loads
 2.106e+08 ±  4%        10%  2.323e+08 ± 11%  perf-stat.cache-misses
      0.74               7%       0.79        perf-stat.cpi
 1.605e+08 ±  7%         6%  1.703e+08 ±  6%  perf-stat.node-stores
  50804799 ± 16%         5%   53535896 ± 18%  perf-stat.node-store-misses
     27.63 ±  8%         5%      29.07 ±  8%  perf-stat.node-load-miss-rate%
     85.55                       86.49        perf-stat.iTLB-load-miss-rate%
      0.25                        0.25        perf-stat.branch-miss-rate%
    778741                      776946        perf-stat.minor-faults
    778753                      776948        perf-stat.page-faults
     23.93 ±  9%                 23.75 ± 12%  perf-stat.node-store-miss-rate%
      9117 ±  4%                  8969 ±  4%  perf-stat.cpu-migrations
  1.59e+13              -4%  1.533e+13        perf-stat.cpu-cycles
    439328 ±  3%        -5%     419250 ±  5%  perf-stat.path-length
      9.05 ±  8%        -5%       8.62 ±  9%  perf-stat.cache-miss-rate%
      0.44 ± 39%        -6%       0.42 ± 31%  perf-stat.dTLB-load-miss-rate%
      1.35              -7%       1.26        perf-stat.ipc
 3.294e+12 ±  3%        -9%  2.988e+12 ±  3%  perf-stat.dTLB-stores
 5.451e+12 ±  4%       -10%  4.905e+12 ±  4%  perf-stat.dTLB-loads
 4.667e+12 ±  3%       -10%  4.195e+12 ±  4%  perf-stat.branch-instructions
 2.154e+13 ±  3%       -10%  1.935e+13 ±  4%  perf-stat.instructions
 1.161e+10 ±  4%       -10%  1.043e+10 ±  5%  perf-stat.branch-misses
 2.401e+10 ± 34%       -13%  2.093e+10 ± 36%  perf-stat.dTLB-load-misses
    653927 ±  8%       -13%     568299 ±  3%  perf-stat.context-switches
     11203 ±  4%       -26%       8344        perf-stat.instructions-per-iTLB-miss
      0.02 ± 41%       -50%       0.01 ± 47%  perf-stat.dTLB-store-miss-rate%
 7.557e+08 ± 37%       -53%  3.521e+08 ± 49%  perf-stat.dTLB-store-misses

Thanks,
Xiaolong

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ