lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date:   Fri, 10 Mar 2023 17:22:51 +0800
From:   kernel test robot <yujie.liu@...el.com>
To:     Christoph Hellwig <hch@....de>
CC:     <oe-lkp@...ts.linux.dev>, <lkp@...el.com>,
        <linux-kernel@...r.kernel.org>,
        Andrew Morton <akpm@...ux-foundation.org>,
        Dan Williams <dan.j.williams@...el.com>,
        Keith Busch <kbusch@...nel.org>,
        Dave Jiang <dave.jiang@...el.com>,
        Ira Weiny <ira.weiny@...el.com>, Jens Axboe <axboe@...nel.dk>,
        Minchan Kim <minchan@...nel.org>,
        Sergey Senozhatsky <senozhatsky@...omium.org>,
        Vishal Verma <vishal.l.verma@...el.com>,
        <linux-fsdevel@...r.kernel.org>, <ying.huang@...el.com>,
        <feng.tang@...el.com>, <zhengjun.xing@...ux.intel.com>,
        <fengwei.yin@...el.com>
Subject: [linus:master] [mpage] 7d28631786: fio.write_iops 27.8% improvement

Greeting,

FYI, we noticed a 27.8% improvement of fio.write_iops due to commit:

commit: 7d28631786b2333c5d48ad25172eb159aaa2945f ("mpage: stop using bdev_{read,write}_page")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: fio-basic
on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz (Cascade Lake) with 512G memory
with following parameters:

	disk: 2pmem
	fs: ext2
	runtime: 200s
	nr_task: 50%
	time_based: tb
	rw: randwrite
	bs: 2M
	ioengine: io_uring
	test_size: 200G
	cpufreq_governor: performance

test-description: Fio is a tool that will spawn a number of threads or processes doing a particular type of I/O action as specified by the user.
test-url: https://github.com/axboe/fio


Details are as below:

=========================================================================================
bs/compiler/cpufreq_governor/disk/fs/ioengine/kconfig/nr_task/rootfs/runtime/rw/tbox_group/test_size/testcase/time_based:
  2M/gcc-11/performance/2pmem/ext2/io_uring/x86_64-rhel-8.3/50%/debian-11.1-x86_64-20220510.cgz/200s/randwrite/lkp-csl-2sp7/200G/fio-basic/tb

commit: 
  9e5fa0ae52 ("mm: refactor va_remove_mappings")
  7d28631786 ("mpage: stop using bdev_{read,write}_page")

9e5fa0ae52fc67de 7d28631786b2333c5d48ad25172 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     14.89 ± 10%     -10.8        4.13 ± 14%  fio.latency_1000ms%
      0.09 ±  9%      +0.0        0.13 ±  8%  fio.latency_10ms%
      7.55 ± 12%      -4.0        3.56 ± 25%  fio.latency_2000ms%
     10.25 ± 11%      +3.9       14.13 ± 17%  fio.latency_250ms%
     13.67 ± 17%     +19.5       33.22 ± 27%  fio.latency_500ms%
      0.33 ± 31%      +0.3        0.61 ± 31%  fio.latency_50ms%
      1.14 ±  3%      -0.4        0.71 ± 17%  fio.latency_>=2000ms%
 2.053e+09           +27.5%  2.618e+09 ±  3%  fio.time.file_system_outputs
      8052 ±  5%     +53.4%      12353 ± 12%  fio.time.involuntary_context_switches
    296137 ±  3%     +14.2%     338218 ±  5%  fio.time.minor_page_faults
      1041 ±  4%     +23.9%       1289 ± 13%  fio.time.percent_of_cpu_this_job_got
      1978 ±  4%     +22.3%       2419 ± 14%  fio.time.system_time
    125.49 ±  8%     +45.3%     182.35 ± 10%  fio.time.user_time
   4440324 ±  8%     -46.7%    2366863 ± 11%  fio.time.voluntary_context_switches
    501829           +27.6%     640275 ±  3%  fio.workload
      4986           +27.8%       6371 ±  3%  fio.write_bw_MBps
 9.731e+08           -28.7%  6.935e+08 ±  5%  fio.write_clat_90%_us
 1.075e+09           -12.9%  9.367e+08 ±  8%  fio.write_clat_95%_us
 2.266e+09 ±  4%     -24.4%  1.714e+09 ±  7%  fio.write_clat_99%_us
 6.123e+08           -21.6%  4.802e+08 ±  4%  fio.write_clat_mean_us
      2493           +27.8%       3185 ±  3%  fio.write_iops
    232414 ±  7%     +15.7%     268867 ±  7%  fio.write_slat_mean_us
  41428534           -13.3%   35901780        cpuidle..usage
      1137 ±  4%     -21.9%     888.17 ± 16%  meminfo.Mlocked
    297.67 ±122%     -83.0%      50.67 ±141%  numa-meminfo.node1.Mlocked
     11.50 ±  4%      +2.3       13.82 ± 12%  mpstat.cpu.all.sys%
      0.67 ±  8%      +0.3        0.96 ±  9%  mpstat.cpu.all.usr%
     49.17 ± 27%  +1.3e+07%    6178756 ±  4%  vmstat.io.bo
     78001 ±  9%     -56.3%      34101 ± 14%  vmstat.system.cs
     37.37            -6.3%      35.01 ±  3%  iostat.cpu.iowait
     12.25 ±  4%     +18.8%      14.55 ± 11%  iostat.cpu.system
      0.66 ±  8%     +42.9%       0.95 ±  9%  iostat.cpu.user
      2931 ± 18%     +55.5%       4558 ± 23%  sched_debug.cpu.avg_idle.min
     51959 ± 15%     -55.0%      23376 ± 13%  sched_debug.cpu.nr_switches.avg
    591108 ± 22%     -75.2%     146571 ± 30%  sched_debug.cpu.nr_switches.max
     90813 ± 17%     -75.3%      22412 ± 18%  sched_debug.cpu.nr_switches.stddev
    401.00 ±  3%     +18.0%     473.17 ± 10%  turbostat.Avg_MHz
     14.51 ±  3%      +2.5       16.98 ± 10%  turbostat.Busy%
   6339607 ± 12%     -75.7%    1541772 ± 22%  turbostat.POLL
    195.79            +5.1%     205.76 ±  2%  turbostat.PkgWatt
     45.88            +5.0%      48.19        turbostat.RAMWatt
  39919870 ±  4%     +30.5%   52080482 ±  3%  numa-numastat.node0.local_node
  93131711 ±  3%     +22.7%  1.143e+08 ±  8%  numa-numastat.node0.numa_foreign
  39955233 ±  4%     +30.5%   52138480 ±  3%  numa-numastat.node0.numa_hit
 1.284e+08 ±  2%     +29.2%  1.658e+08 ±  4%  numa-numastat.node1.local_node
 1.284e+08 ±  2%     +29.2%  1.658e+08 ±  4%  numa-numastat.node1.numa_hit
  93183098 ±  3%     +22.6%  1.143e+08 ±  8%  numa-numastat.node1.numa_miss
  93196371 ±  3%     +22.7%  1.143e+08 ±  8%  numa-numastat.node1.other_node
  37631959 ±  4%     +31.9%   49655093 ±  3%  numa-vmstat.node0.nr_dirtied
  35783356 ±  4%     +33.8%   47880969 ±  3%  numa-vmstat.node0.nr_written
  93131711 ±  3%     +22.7%  1.143e+08 ±  8%  numa-vmstat.node0.numa_foreign
  39955168 ±  4%     +30.5%   52138531 ±  3%  numa-vmstat.node0.numa_hit
  39919805 ±  4%     +30.5%   52080533 ±  3%  numa-vmstat.node0.numa_local
  2.19e+08           +26.7%  2.776e+08 ±  4%  numa-vmstat.node1.nr_dirtied
 2.081e+08           +28.4%  2.673e+08 ±  4%  numa-vmstat.node1.nr_written
 1.284e+08 ±  2%     +29.2%  1.658e+08 ±  4%  numa-vmstat.node1.numa_hit
 1.284e+08 ±  2%     +29.2%  1.658e+08 ±  4%  numa-vmstat.node1.numa_local
  93183098 ±  3%     +22.6%  1.143e+08 ±  8%  numa-vmstat.node1.numa_miss
  93196371 ±  3%     +22.7%  1.143e+08 ±  8%  numa-vmstat.node1.numa_other
 2.567e+08           +27.5%  3.272e+08 ±  3%  proc-vmstat.nr_dirtied
  13024849            -3.1%   12619595        proc-vmstat.nr_dirty
  28301145            -2.1%   27702967        proc-vmstat.nr_file_pages
  47040923            +1.3%   47658720        proc-vmstat.nr_free_pages
  27529221            -2.2%   26931136        proc-vmstat.nr_inactive_file
    810128            -1.9%     795051        proc-vmstat.nr_slab_reclaimable
 2.441e+08           +29.1%  3.151e+08 ±  4%  proc-vmstat.nr_written
  27529221            -2.2%   26931135        proc-vmstat.nr_zone_inactive_file
  13024848            -3.1%   12621112        proc-vmstat.nr_zone_write_pending
  93131711 ±  3%     +22.7%  1.143e+08 ±  8%  proc-vmstat.numa_foreign
    251090 ±  3%     +14.8%     288318 ±  6%  proc-vmstat.numa_hint_faults
    232531 ±  3%     +14.7%     266672 ±  7%  proc-vmstat.numa_hint_faults_local
 1.684e+08           +29.5%   2.18e+08 ±  4%  proc-vmstat.numa_hit
     13951 ±  3%     +15.9%      16170 ±  6%  proc-vmstat.numa_huge_pte_updates
 1.683e+08           +29.5%  2.179e+08 ±  4%  proc-vmstat.numa_local
  93183098 ±  3%     +22.6%  1.143e+08 ±  8%  proc-vmstat.numa_miss
  93241897 ±  3%     +22.6%  1.144e+08 ±  8%  proc-vmstat.numa_other
   7429923 ±  3%     +16.1%    8622634 ±  6%  proc-vmstat.numa_pte_updates
 2.629e+08           +26.9%  3.336e+08 ±  3%  proc-vmstat.pgalloc_normal
    975147            +5.0%    1024241        proc-vmstat.pgfault
 2.367e+08           +29.9%  3.075e+08 ±  4%  proc-vmstat.pgfree
     10020 ± 28%  +1.3e+07%   1.26e+09 ±  4%  proc-vmstat.pgpgout
     93.33 ± 20%  +1.3e+06%    1198216 ± 10%  proc-vmstat.pgrotated
     21.90            -9.9%      19.74        perf-stat.i.MPKI
 3.348e+09           +21.5%  4.068e+09 ±  5%  perf-stat.i.branch-instructions
      0.36            -0.0        0.35        perf-stat.i.branch-miss-rate%
  12061197           +12.4%   13558900 ±  4%  perf-stat.i.branch-misses
     74.33            +5.2       79.58        perf-stat.i.cache-miss-rate%
 2.787e+08           +18.8%  3.312e+08 ±  4%  perf-stat.i.cache-misses
 3.866e+08            +8.5%  4.194e+08 ±  5%  perf-stat.i.cache-references
     79217 ±  9%     -56.7%      34320 ± 14%  perf-stat.i.context-switches
 3.787e+10 ±  3%     +18.1%  4.472e+10 ± 10%  perf-stat.i.cpu-cycles
    211.78           +23.5%     261.65 ±  8%  perf-stat.i.cpu-migrations
 4.812e+09           +22.2%  5.879e+09 ±  4%  perf-stat.i.dTLB-loads
  2.91e+09           +21.9%  3.547e+09 ±  4%  perf-stat.i.dTLB-stores
  1.77e+10           +22.5%  2.168e+10 ±  4%  perf-stat.i.instructions
     15015           +24.9%      18748 ±  4%  perf-stat.i.instructions-per-iTLB-miss
      0.39 ±  3%     +18.1%       0.47 ± 10%  perf-stat.i.metric.GHz
    869.18           +25.6%       1091 ±  6%  perf-stat.i.metric.K/sec
    119.36           +21.4%     144.94 ±  4%  perf-stat.i.metric.M/sec
      3926            +6.2%       4169 ±  2%  perf-stat.i.minor-faults
     35.58            -6.1       29.50 ± 13%  perf-stat.i.node-load-miss-rate%
  27642154           +46.7%   40562253 ± 12%  perf-stat.i.node-loads
     47.43            -6.3       41.18 ±  7%  perf-stat.i.node-store-miss-rate%
  19442397           +36.2%   26480148 ±  9%  perf-stat.i.node-stores
      3927            +6.2%       4170 ±  2%  perf-stat.i.page-faults
     21.85           -11.5%      19.35        perf-stat.overall.MPKI
      0.36            -0.0        0.33        perf-stat.overall.branch-miss-rate%
     72.09            +6.9       78.95        perf-stat.overall.cache-miss-rate%
     15115 ±  2%     +24.8%      18861 ±  4%  perf-stat.overall.instructions-per-iTLB-miss
     36.17            -6.9       29.29 ± 12%  perf-stat.overall.node-load-miss-rate%
     48.39            -6.8       41.56 ±  7%  perf-stat.overall.node-store-miss-rate%
   7111992            -4.1%    6818682        perf-stat.overall.path-length
 3.334e+09           +21.5%  4.051e+09 ±  5%  perf-stat.ps.branch-instructions
  12009193           +12.4%   13497817 ±  4%  perf-stat.ps.branch-misses
 2.776e+08           +18.8%  3.297e+08 ±  4%  perf-stat.ps.cache-misses
  3.85e+08            +8.5%  4.177e+08 ±  5%  perf-stat.ps.cache-references
     78798 ±  9%     -56.6%      34220 ± 14%  perf-stat.ps.context-switches
 3.782e+10 ±  3%     +18.2%   4.47e+10 ± 10%  perf-stat.ps.cpu-cycles
    211.41           +23.5%     261.13 ±  8%  perf-stat.ps.cpu-migrations
 4.791e+09           +22.2%  5.853e+09 ±  4%  perf-stat.ps.dTLB-loads
 2.897e+09           +21.9%  3.531e+09 ±  4%  perf-stat.ps.dTLB-stores
 1.762e+10           +22.5%  2.159e+10 ±  4%  perf-stat.ps.instructions
      3921            +6.2%       4164 ±  2%  perf-stat.ps.minor-faults
  27527246           +46.7%   40382491 ± 12%  perf-stat.ps.node-loads
  19376729           +36.1%   26378339 ±  9%  perf-stat.ps.node-stores
      3922            +6.2%       4165 ±  2%  perf-stat.ps.page-faults
 3.569e+12           +22.4%  4.367e+12 ±  4%  perf-stat.total.instructions
     19.48 ± 13%     -19.5        0.00        perf-profile.calltrace.cycles-pp.bdev_write_page.__mpage_writepage.write_cache_pages.mpage_writepages.do_writepages
     16.17 ± 13%     -16.2        0.00        perf-profile.calltrace.cycles-pp.pmem_rw_page.bdev_write_page.__mpage_writepage.write_cache_pages.mpage_writepages
      8.50 ± 14%      -8.5        0.00        perf-profile.calltrace.cycles-pp.folio_end_writeback.pmem_rw_page.bdev_write_page.__mpage_writepage.write_cache_pages
      7.60 ± 18%      -7.6        0.00        perf-profile.calltrace.cycles-pp.pmem_do_write.pmem_rw_page.bdev_write_page.__mpage_writepage.write_cache_pages
      7.52 ± 18%      -7.5        0.00        perf-profile.calltrace.cycles-pp.__memcpy_flushcache.pmem_do_write.pmem_rw_page.bdev_write_page.__mpage_writepage
      5.79 ± 19%      -5.8        0.00        perf-profile.calltrace.cycles-pp.__folio_end_writeback.folio_end_writeback.pmem_rw_page.bdev_write_page.__mpage_writepage
      4.38 ± 13%      -1.1        3.32 ±  9%  perf-profile.calltrace.cycles-pp.balance_dirty_pages_ratelimited_flags.generic_perform_write.__generic_file_write_iter.generic_file_write_iter.io_write
      0.76 ± 12%      +0.2        0.98 ±  8%  perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.folio_account_dirtied.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write
      1.34 ± 14%      +0.3        1.67 ±  8%  perf-profile.calltrace.cycles-pp.fault_in_readable.fault_in_iov_iter_readable.generic_perform_write.__generic_file_write_iter.generic_file_write_iter
      0.27 ±100%      +0.4        0.64 ±  9%  perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.__mod_lruvec_page_state.folio_account_dirtied.__folio_mark_dirty.mark_buffer_dirty
      0.39 ± 71%      +0.5        0.88 ± 14%  perf-profile.calltrace.cycles-pp.free_buffer_head.try_to_free_buffers.mapping_evict_folio.invalidate_mapping_pagevec.generic_fadvise
      0.50 ± 46%      +0.5        0.99 ± 11%  perf-profile.calltrace.cycles-pp.drop_buffers.try_to_free_buffers.mapping_evict_folio.invalidate_mapping_pagevec.generic_fadvise
      0.39 ± 71%      +0.5        0.88 ± 17%  perf-profile.calltrace.cycles-pp.find_lock_entries.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64
      0.26 ±100%      +0.5        0.79 ± 17%  perf-profile.calltrace.cycles-pp.__free_one_page.free_pcppages_bulk.free_unref_page_list.release_pages.__pagevec_release
      0.09 ±223%      +0.6        0.69 ± 14%  perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.filemap_unaccount_folio.__filemap_remove_folio.__remove_mapping.remove_mapping
      0.10 ±223%      +0.6        0.72 ± 15%  perf-profile.calltrace.cycles-pp.filemap_unaccount_folio.__filemap_remove_folio.__remove_mapping.remove_mapping.invalidate_mapping_pagevec
      0.46 ± 71%      +0.6        1.09 ± 18%  perf-profile.calltrace.cycles-pp.free_pcppages_bulk.free_unref_page_list.release_pages.__pagevec_release.invalidate_mapping_pagevec
      0.78 ± 46%      +0.7        1.44 ± 13%  perf-profile.calltrace.cycles-pp.__filemap_remove_folio.__remove_mapping.remove_mapping.invalidate_mapping_pagevec.generic_fadvise
      0.76 ± 46%      +0.7        1.46 ± 17%  perf-profile.calltrace.cycles-pp.free_unref_page_list.release_pages.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise
      2.19 ± 12%      +0.8        3.02 ± 13%  perf-profile.calltrace.cycles-pp.get_io_u
      1.01 ± 46%      +0.8        1.84 ± 13%  perf-profile.calltrace.cycles-pp.__remove_mapping.remove_mapping.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64
      1.03 ± 46%      +0.8        1.86 ± 13%  perf-profile.calltrace.cycles-pp.remove_mapping.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64
      1.16 ± 25%      +0.9        2.02 ± 12%  perf-profile.calltrace.cycles-pp.try_to_free_buffers.mapping_evict_folio.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64
      1.19 ± 25%      +0.9        2.06 ± 12%  perf-profile.calltrace.cycles-pp.mapping_evict_folio.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64
      0.16 ±223%      +1.1        1.22 ± 52%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__pagevec_release.invalidate_mapping_pagevec
      0.16 ±223%      +1.1        1.23 ± 52%  perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.release_pages.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise
      1.66 ± 29%      +1.2        2.87 ± 17%  perf-profile.calltrace.cycles-pp.rmqueue_bulk.rmqueue.get_page_from_freelist.__alloc_pages.folio_alloc
      0.89 ± 61%      +1.2        2.11 ± 24%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.rmqueue_bulk.rmqueue.get_page_from_freelist
      2.03 ± 25%      +1.2        3.26 ± 15%  perf-profile.calltrace.cycles-pp.rmqueue.get_page_from_freelist.__alloc_pages.folio_alloc.__filemap_get_folio
      0.90 ± 61%      +1.2        2.13 ± 23%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.rmqueue_bulk.rmqueue.get_page_from_freelist.__alloc_pages
      3.04 ± 19%      +1.3        4.33 ± 11%  perf-profile.calltrace.cycles-pp.folio_alloc.__filemap_get_folio.pagecache_get_page.block_write_begin.ext2_write_begin
      2.92 ± 20%      +1.3        4.22 ± 11%  perf-profile.calltrace.cycles-pp.__alloc_pages.folio_alloc.__filemap_get_folio.pagecache_get_page.block_write_begin
      2.73 ± 21%      +1.3        4.03 ± 12%  perf-profile.calltrace.cycles-pp.get_page_from_freelist.__alloc_pages.folio_alloc.__filemap_get_folio.pagecache_get_page
      0.00            +1.4        1.38 ± 24%  perf-profile.calltrace.cycles-pp.__folio_start_writeback.__mpage_writepage.write_cache_pages.mpage_writepages.do_writepages
      7.60 ± 16%      +1.9        9.53 ±  9%  perf-profile.calltrace.cycles-pp.pagecache_get_page.block_write_begin.ext2_write_begin.generic_perform_write.__generic_file_write_iter
      7.57 ± 16%      +1.9        9.50 ± 10%  perf-profile.calltrace.cycles-pp.__filemap_get_folio.pagecache_get_page.block_write_begin.ext2_write_begin.generic_perform_write
      1.89 ± 36%      +2.1        3.94 ± 22%  perf-profile.calltrace.cycles-pp.release_pages.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64
      1.90 ± 36%      +2.1        3.95 ± 22%  perf-profile.calltrace.cycles-pp.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64
      0.00            +2.2        2.20 ± 17%  perf-profile.calltrace.cycles-pp.__folio_end_writeback.folio_end_writeback.mpage_end_io.__submit_bio.__submit_bio_noacct
      0.00            +2.5        2.50 ± 18%  perf-profile.calltrace.cycles-pp.folio_end_writeback.mpage_end_io.__submit_bio.__submit_bio_noacct.__mpage_writepage
      0.00            +2.9        2.94 ± 18%  perf-profile.calltrace.cycles-pp.mpage_end_io.__submit_bio.__submit_bio_noacct.__mpage_writepage.write_cache_pages
      4.79 ± 30%      +4.2        8.94 ± 16%  perf-profile.calltrace.cycles-pp.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64
      0.00            +9.7        9.74 ± 13%  perf-profile.calltrace.cycles-pp.__memcpy_flushcache.pmem_do_write.pmem_submit_bio.__submit_bio.__submit_bio_noacct
      0.00            +9.8        9.84 ± 13%  perf-profile.calltrace.cycles-pp.pmem_do_write.pmem_submit_bio.__submit_bio.__submit_bio_noacct.__mpage_writepage
      0.00            +9.9        9.94 ± 13%  perf-profile.calltrace.cycles-pp.pmem_submit_bio.__submit_bio.__submit_bio_noacct.__mpage_writepage.write_cache_pages
      0.00           +12.9       12.90 ± 14%  perf-profile.calltrace.cycles-pp.__submit_bio_noacct.__mpage_writepage.write_cache_pages.mpage_writepages.do_writepages
      0.00           +12.9       12.90 ± 14%  perf-profile.calltrace.cycles-pp.__submit_bio.__submit_bio_noacct.__mpage_writepage.write_cache_pages.mpage_writepages
     19.50 ± 13%     -19.5        0.00        perf-profile.children.cycles-pp.bdev_write_page
     16.17 ± 13%     -16.2        0.00        perf-profile.children.cycles-pp.pmem_rw_page
      8.51 ± 14%      -6.0        2.51 ± 17%  perf-profile.children.cycles-pp.folio_end_writeback
      5.82 ± 19%      -3.6        2.24 ± 18%  perf-profile.children.cycles-pp.__folio_end_writeback
      2.97 ± 26%      -1.4        1.55 ± 18%  perf-profile.children.cycles-pp.__folio_start_writeback
      4.40 ± 13%      -1.1        3.33 ±  9%  perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags
      0.31 ± 19%      -0.3        0.04 ± 72%  perf-profile.children.cycles-pp.fprop_reflect_period_percpu
      0.42 ± 19%      -0.2        0.22 ± 23%  perf-profile.children.cycles-pp.__fprop_add_percpu
      0.32 ± 23%      -0.2        0.15 ± 17%  perf-profile.children.cycles-pp.__list_add_valid
      0.41 ± 16%      -0.1        0.32 ±  9%  perf-profile.children.cycles-pp.__irq_exit_rcu
      0.30 ± 12%      -0.1        0.22 ± 20%  perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
      0.06 ± 49%      +0.0        0.10 ± 16%  perf-profile.children.cycles-pp.uncharge_folio
      0.10 ± 14%      +0.0        0.15 ± 20%  perf-profile.children.cycles-pp.xas_find_marked
      0.05 ± 71%      +0.1        0.10 ± 18%  perf-profile.children.cycles-pp.xas_find
      0.00            +0.1        0.06 ± 17%  perf-profile.children.cycles-pp.folio_mapping
      0.13 ± 19%      +0.1        0.20 ±  7%  perf-profile.children.cycles-pp.node_page_state
      0.00            +0.1        0.07 ± 16%  perf-profile.children.cycles-pp.__bio_try_merge_page
      0.08 ± 51%      +0.1        0.16 ± 17%  perf-profile.children.cycles-pp.page_counter_uncharge
      0.16 ± 27%      +0.1        0.25 ±  9%  perf-profile.children.cycles-pp.__slab_free
      0.37 ± 17%      +0.1        0.46 ± 12%  perf-profile.children.cycles-pp.__count_memcg_events
      0.12 ± 53%      +0.1        0.22 ± 16%  perf-profile.children.cycles-pp.uncharge_batch
      0.00            +0.1        0.12 ± 17%  perf-profile.children.cycles-pp.bio_add_page
      0.24 ± 15%      +0.1        0.36 ± 18%  perf-profile.children.cycles-pp.__xa_clear_mark
      0.20 ± 36%      +0.1        0.34 ± 15%  perf-profile.children.cycles-pp.__mem_cgroup_uncharge_list
      0.53 ±  8%      +0.2        0.72 ±  8%  perf-profile.children.cycles-pp.__mod_node_page_state
      0.63 ±  9%      +0.2        0.85 ±  8%  perf-profile.children.cycles-pp.__mod_lruvec_state
      0.33 ± 25%      +0.2        0.57 ± 12%  perf-profile.children.cycles-pp.kmem_cache_free
      0.64 ± 18%      +0.3        0.90 ± 21%  perf-profile.children.cycles-pp.filemap_get_folios_tag
      1.36 ± 15%      +0.3        1.69 ±  7%  perf-profile.children.cycles-pp.fault_in_readable
      0.46 ± 24%      +0.3        0.81 ± 17%  perf-profile.children.cycles-pp.__free_one_page
      0.38 ± 36%      +0.3        0.73 ± 14%  perf-profile.children.cycles-pp.filemap_unaccount_folio
      0.51 ± 27%      +0.4        0.88 ± 14%  perf-profile.children.cycles-pp.free_buffer_head
      0.50 ± 27%      +0.4        0.89 ± 17%  perf-profile.children.cycles-pp.find_lock_entries
      0.01 ±223%      +0.4        0.40 ± 23%  perf-profile.children.cycles-pp.page_endio
      0.55 ± 25%      +0.4        1.00 ± 11%  perf-profile.children.cycles-pp.drop_buffers
      1.53 ± 14%      +0.5        1.99 ±  9%  perf-profile.children.cycles-pp.cgroup_rstat_updated
      0.59 ± 27%      +0.5        1.11 ± 18%  perf-profile.children.cycles-pp.free_pcppages_bulk
      1.98 ± 14%      +0.6        2.56 ±  9%  perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
      0.84 ± 28%      +0.6        1.45 ± 13%  perf-profile.children.cycles-pp.__filemap_remove_folio
      1.52 ± 23%      +0.6        2.14 ± 21%  perf-profile.children.cycles-pp.folio_batch_move_lru
      0.83 ± 27%      +0.6        1.47 ± 17%  perf-profile.children.cycles-pp.free_unref_page_list
      1.10 ± 27%      +0.8        1.85 ± 13%  perf-profile.children.cycles-pp.__remove_mapping
      1.11 ± 27%      +0.8        1.87 ± 13%  perf-profile.children.cycles-pp.remove_mapping
      2.20 ± 12%      +0.8        3.03 ± 13%  perf-profile.children.cycles-pp.get_io_u
      1.16 ± 26%      +0.9        2.02 ± 12%  perf-profile.children.cycles-pp.try_to_free_buffers
      1.19 ± 25%      +0.9        2.06 ± 12%  perf-profile.children.cycles-pp.mapping_evict_folio
      2.87 ± 12%      +0.9        3.77 ±  9%  perf-profile.children.cycles-pp.__mod_lruvec_page_state
      1.69 ± 28%      +1.2        2.90 ± 17%  perf-profile.children.cycles-pp.rmqueue_bulk
      2.08 ± 24%      +1.2        3.30 ± 15%  perf-profile.children.cycles-pp.rmqueue
      2.98 ± 20%      +1.3        4.27 ± 11%  perf-profile.children.cycles-pp.__alloc_pages
      2.78 ± 21%      +1.3        4.08 ± 12%  perf-profile.children.cycles-pp.get_page_from_freelist
      3.04 ± 19%      +1.3        4.34 ± 11%  perf-profile.children.cycles-pp.folio_alloc
      0.92 ± 63%      +1.4        2.30 ± 46%  perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
      7.62 ± 16%      +1.9        9.56 ±  9%  perf-profile.children.cycles-pp.__filemap_get_folio
      7.64 ± 16%      +1.9        9.58 ±  9%  perf-profile.children.cycles-pp.pagecache_get_page
      1.99 ± 34%      +2.1        4.06 ± 22%  perf-profile.children.cycles-pp.__pagevec_release
      2.09 ± 32%      +2.1        4.17 ± 21%  perf-profile.children.cycles-pp.release_pages
      7.53 ± 18%      +2.2        9.77 ± 13%  perf-profile.children.cycles-pp.__memcpy_flushcache
      7.60 ± 18%      +2.3        9.86 ± 13%  perf-profile.children.cycles-pp.pmem_do_write
      0.00            +2.9        2.94 ± 18%  perf-profile.children.cycles-pp.mpage_end_io
      4.79 ± 30%      +4.2        8.94 ± 16%  perf-profile.children.cycles-pp.invalidate_mapping_pagevec
      0.00           +10.0        9.96 ± 13%  perf-profile.children.cycles-pp.pmem_submit_bio
      0.00           +12.9       12.92 ± 14%  perf-profile.children.cycles-pp.__submit_bio_noacct
      0.00           +12.9       12.92 ± 14%  perf-profile.children.cycles-pp.__submit_bio
      2.63 ± 12%      -2.4        0.26 ± 17%  perf-profile.self.cycles-pp.folio_end_writeback
      3.43 ±  6%      -1.4        2.01 ± 12%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      1.53 ± 24%      -1.1        0.42 ± 11%  perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags
      1.34 ± 17%      -0.9        0.41 ± 19%  perf-profile.self.cycles-pp.__folio_end_writeback
      0.81 ± 13%      -0.5        0.35 ± 19%  perf-profile.self.cycles-pp.__folio_start_writeback
      0.30 ± 19%      -0.3        0.03 ±102%  perf-profile.self.cycles-pp.fprop_reflect_period_percpu
      0.30 ± 25%      -0.2        0.14 ± 17%  perf-profile.self.cycles-pp.__list_add_valid
      0.24 ± 18%      -0.1        0.13 ±  6%  perf-profile.self.cycles-pp.folio_account_dirtied
      0.12 ± 13%      -0.0        0.09 ± 15%  perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
      0.11 ± 14%      +0.0        0.14 ± 11%  perf-profile.self.cycles-pp.__mod_zone_page_state
      0.06 ± 46%      +0.0        0.10 ± 12%  perf-profile.self.cycles-pp.free_unref_page_commit
      0.06 ± 46%      +0.0        0.09 ± 15%  perf-profile.self.cycles-pp.__filemap_remove_folio
      0.06 ± 50%      +0.0        0.10 ± 18%  perf-profile.self.cycles-pp.uncharge_folio
      0.01 ±223%      +0.1        0.06 ± 15%  perf-profile.self.cycles-pp.try_to_free_buffers
      0.10 ± 18%      +0.1        0.15 ± 21%  perf-profile.self.cycles-pp.xas_find_marked
      0.12 ± 31%      +0.1        0.18 ± 16%  perf-profile.self.cycles-pp.free_unref_page_list
      0.12 ± 19%      +0.1        0.18 ±  6%  perf-profile.self.cycles-pp.node_page_state
      0.00            +0.1        0.07 ± 13%  perf-profile.self.cycles-pp.__bio_try_merge_page
      0.10 ± 45%      +0.1        0.16 ± 13%  perf-profile.self.cycles-pp.__remove_mapping
      0.07 ± 51%      +0.1        0.14 ± 20%  perf-profile.self.cycles-pp.page_counter_uncharge
      0.16 ± 27%      +0.1        0.24 ± 11%  perf-profile.self.cycles-pp.__slab_free
      0.00            +0.1        0.10 ± 27%  perf-profile.self.cycles-pp.pmem_submit_bio
      0.16 ± 28%      +0.1        0.29 ± 10%  perf-profile.self.cycles-pp.kmem_cache_free
      0.38 ± 13%      +0.2        0.54 ± 12%  perf-profile.self.cycles-pp.release_pages
      0.52 ±  9%      +0.2        0.70 ±  8%  perf-profile.self.cycles-pp.__mod_node_page_state
      0.77 ± 13%      +0.2        0.98 ±  9%  perf-profile.self.cycles-pp.__mod_memcg_lruvec_state
      0.53 ± 18%      +0.2        0.74 ± 21%  perf-profile.self.cycles-pp.filemap_get_folios_tag
      0.73 ± 14%      +0.3        1.00 ± 11%  perf-profile.self.cycles-pp.__mod_lruvec_page_state
      0.41 ± 24%      +0.3        0.72 ± 17%  perf-profile.self.cycles-pp.__free_one_page
      1.33 ± 14%      +0.3        1.66 ±  7%  perf-profile.self.cycles-pp.fault_in_readable
      0.45 ± 26%      +0.3        0.79 ± 17%  perf-profile.self.cycles-pp.find_lock_entries
      0.92 ± 16%      +0.4        1.28 ± 13%  perf-profile.self.cycles-pp.cgroup_rstat_updated
      0.94 ± 11%      +0.4        1.31 ± 11%  perf-profile.self.cycles-pp.xas_load
      0.01 ±223%      +0.4        0.39 ± 24%  perf-profile.self.cycles-pp.page_endio
      0.54 ± 25%      +0.4        0.99 ± 11%  perf-profile.self.cycles-pp.drop_buffers
      0.50 ± 19%      +0.5        1.02 ± 19%  perf-profile.self.cycles-pp.__mpage_writepage
      2.18 ± 12%      +0.8        3.00 ± 14%  perf-profile.self.cycles-pp.get_io_u
      7.47 ± 18%      +2.2        9.66 ± 13%  perf-profile.self.cycles-pp.__memcpy_flushcache


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.


Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests

View attachment "config-6.2.0-rc4-00383-g7d28631786b2" of type "text/plain" (167513 bytes)

View attachment "job-script" of type "text/plain" (8681 bytes)

View attachment "job.yaml" of type "text/plain" (6230 bytes)

View attachment "reproduce" of type "text/plain" (938 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ