lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Date:   Thu, 12 May 2022 16:51:00 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Matthew Wilcox <willy@...radead.org>
Cc:     Guo Xuenan <guoxuenan@...wei.com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...ux.intel.com, fengwei.yin@...el.com
Subject: [mm/readahead]  b9ff43dd27:  fio.read_iops 24.6% improvement



Greeting,

FYI, we noticed a 24.6% improvement of fio.read_iops due to commit:


commit: b9ff43dd27434dbd850b908e2e0e1f6e794efd9b ("mm/readahead: Fix readahead with large folios")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: fio-basic
on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory
with following parameters:

	disk: 2pmem
	fs: xfs
	runtime: 200s
	nr_task: 50%
	time_based: tb
	rw: read
	bs: 2M
	ioengine: libaio
	test_size: 200G
	cpufreq_governor: performance
	ucode: 0x500320a

test-description: Fio is a tool that will spawn a number of threads or processes doing a particular type of I/O action as specified by the user.
test-url: https://github.com/axboe/fio





Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.

=========================================================================================
bs/compiler/cpufreq_governor/disk/fs/ioengine/kconfig/nr_task/rootfs/runtime/rw/tbox_group/test_size/testcase/time_based/ucode:
  2M/gcc-11/performance/2pmem/xfs/libaio/x86_64-rhel-8.3/50%/debian-10.4-x86_64-20200603.cgz/200s/read/lkp-csl-2sp7/200G/fio-basic/tb/0x500320a

commit: 
  170f37d6aa ("block: Do not call folio_next() on an unreferenced folio")
  b9ff43dd27 ("mm/readahead: Fix readahead with large folios")

170f37d6aa6ad458 b9ff43dd27434dbd850b908e2e0 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      0.58 ± 19%      +2.0        2.60 ± 26%  fio.latency_100ms%
      0.05 ±  7%      +0.0        0.06 ±  4%  fio.latency_10ms%
      0.08 ±  3%      +0.0        0.10        fio.latency_20ms%
      0.01            +0.0        0.02 ± 19%  fio.latency_20us%
     50.64 ±  6%     +21.6       72.24 ±  4%  fio.latency_250ms%
      0.01 ± 15%      +0.0        0.02 ±  9%  fio.latency_4ms%
     37.61 ± 10%     -18.5       19.12 ± 10%  fio.latency_500ms%
      0.29 ± 13%      +0.3        0.59 ± 23%  fio.latency_50ms%
      0.03 ±  2%      -0.0        0.02 ± 14%  fio.latency_50us%
     10527 ±  3%     +24.6%      13112 ±  2%  fio.read_bw_MBps
 5.019e+08           -34.1%  3.307e+08 ±  4%  fio.read_clat_90%_us
 5.578e+08 ±  3%     +11.5%  6.222e+08        fio.read_clat_99%_us
 2.797e+08 ±  3%     -23.6%  2.136e+08 ±  3%  fio.read_clat_mean_us
 1.281e+08 ±  3%      -8.6%  1.171e+08 ±  4%  fio.read_clat_stddev
      5263 ±  3%     +24.6%       6556 ±  2%  fio.read_iops
   9048337 ±  3%     -23.6%    6917228 ±  3%  fio.read_slat_mean_us
 4.313e+09 ±  3%     +24.6%  5.374e+09 ±  2%  fio.time.file_system_inputs
     23336           +19.0%      27773        fio.time.involuntary_context_switches
     20117            +2.4%      20595        fio.time.voluntary_context_switches
   1052766 ±  3%     +24.6%    1311719 ±  2%  fio.workload
    444.29            +2.6%     455.70        pmeter.Average_Active_Power
    745.95 ± 15%     -25.4%     556.35 ± 26%  sched_debug.cpu.clock_task.stddev
      0.34 ±  4%      -0.1        0.28 ±  9%  mpstat.cpu.all.soft%
      0.46 ±  4%      -0.1        0.40 ±  4%  mpstat.cpu.all.usr%
  10622900 ±  3%     +24.2%   13191820 ±  2%  vmstat.io.bi
      1834            +5.4%       1934 ±  2%  vmstat.system.cs
      0.02           +50.0%       0.03        turbostat.IPC
    270.10            +2.2%     276.04        turbostat.PkgWatt
     55.53            +5.2%      58.42        turbostat.RAMWatt
    109247           +79.2%     195797        meminfo.KReclaimable
    109247           +79.2%     195797        meminfo.SReclaimable
    217681           +19.1%     259279        meminfo.SUnreclaim
    326930           +39.2%     455077        meminfo.Slab
   5737532 ±  8%     -37.1%    3607917 ± 12%  numa-meminfo.node0.MemFree
    199707 ±  9%     +18.8%     237210 ±  8%  numa-meminfo.node0.Slab
  70882880            -7.9%   65286368        numa-meminfo.node1.FilePages
     34447 ± 46%    +181.2%      96863 ± 21%  numa-meminfo.node1.KReclaimable
     34447 ± 46%    +181.2%      96863 ± 21%  numa-meminfo.node1.SReclaimable
     92783 ± 13%     +30.4%     120973 ±  4%  numa-meminfo.node1.SUnreclaim
    127231 ± 16%     +71.2%     217837 ± 10%  numa-meminfo.node1.Slab
   1999077 ± 10%   +2766.7%   57307801 ±  9%  numa-numastat.node0.local_node
   1563227 ±  6%   +1028.4%   17638837 ± 13%  numa-numastat.node0.numa_foreign
   2029367 ±  8%   +2725.7%   57343211 ±  9%  numa-numastat.node0.numa_hit
   2483793         +1789.7%   46937375 ±  3%  numa-numastat.node1.local_node
   2540984 ±  2%   +1748.9%   46979938 ±  3%  numa-numastat.node1.numa_hit
   1563227 ±  6%   +1028.5%   17641255 ± 13%  numa-numastat.node1.numa_miss
   1620419 ±  5%    +991.7%   17690341 ± 13%  numa-numastat.node1.other_node
   1443532 ±  9%     -36.5%     916868 ± 11%  numa-vmstat.node0.nr_free_pages
   1563227 ±  6%   +1028.1%   17634435 ± 13%  numa-vmstat.node0.numa_foreign
   2028586 ±  8%   +2726.7%   57342654 ±  9%  numa-vmstat.node0.numa_hit
   1998296 ± 10%   +2767.8%   57307244 ±  9%  numa-vmstat.node0.numa_local
     36.67 ± 53%   +1811.4%     700.83 ± 77%  numa-vmstat.node0.workingset_nodes
  17728775            -8.0%   16313974        numa-vmstat.node1.nr_file_pages
      8608 ± 46%    +181.3%      24217 ± 21%  numa-vmstat.node1.nr_slab_reclaimable
     23193 ± 13%     +30.4%      30249 ±  4%  numa-vmstat.node1.nr_slab_unreclaimable
   2539777 ±  2%   +1749.7%   46977008 ±  3%  numa-vmstat.node1.numa_hit
   2482586         +1790.5%   46934445 ±  3%  numa-vmstat.node1.numa_local
   1563227 ±  6%   +1028.3%   17638159 ± 13%  numa-vmstat.node1.numa_miss
   1620419 ±  5%    +991.5%   17687245 ± 13%  numa-vmstat.node1.numa_other
     13.33 ± 75%    +825.0%     123.33 ± 49%  numa-vmstat.node1.workingset_nodes
   1763507 ± 55%    +780.7%   15530641 ± 22%  proc-vmstat.compact_free_scanned
    119086 ± 23%   +2522.4%    3122933 ± 22%  proc-vmstat.compact_isolated
  15615625 ± 21%    +187.5%   44901482 ± 20%  proc-vmstat.compact_migrate_scanned
  27095029            -3.3%   26209667        proc-vmstat.nr_file_pages
  49156688            +1.8%   50045472        proc-vmstat.nr_free_pages
  26487821            -3.3%   25602197        proc-vmstat.nr_inactive_file
     27308           +79.2%      48936        proc-vmstat.nr_slab_reclaimable
     54414           +19.1%      64800        proc-vmstat.nr_slab_unreclaimable
  26487796            -3.3%   25602050        proc-vmstat.nr_zone_inactive_file
   1563227 ±  6%   +1028.4%   17638837 ± 13%  proc-vmstat.numa_foreign
   4571892 ±  5%   +2181.9%  1.043e+08 ±  5%  proc-vmstat.numa_hit
   4484412 ±  5%   +2224.7%  1.042e+08 ±  5%  proc-vmstat.numa_local
   1563227 ±  6%   +1028.5%   17641255 ± 13%  proc-vmstat.numa_miss
   1650404 ±  6%    +974.2%   17728591 ± 13%  proc-vmstat.numa_other
   4869791 ±  7%    +193.1%   14274576 ± 15%  proc-vmstat.pgalloc_dma32
 5.349e+08 ±  3%     +44.0%  7.701e+08 ±  2%  proc-vmstat.pgalloc_normal
 5.148e+08 ±  3%     +48.5%  7.648e+08 ±  2%  proc-vmstat.pgfree
    438716 ±  3%    +348.4%    1967046 ± 18%  proc-vmstat.pgmigrate_success
 2.156e+09 ±  3%     +24.6%  2.687e+09 ±  2%  proc-vmstat.pgpgin
      0.83 ± 82%  +93540.0%     780.33 ± 49%  proc-vmstat.pgrotated
   4204993 ± 60%    +348.7%   18868882 ± 48%  proc-vmstat.pgscan_file
   4203671 ± 60%    +348.8%   18865293 ± 48%  proc-vmstat.pgscan_kswapd
   4204381 ± 60%    +343.2%   18635881 ± 48%  proc-vmstat.pgsteal_file
   4203058 ± 60%    +343.3%   18632292 ± 48%  proc-vmstat.pgsteal_kswapd
     49.17 ± 54%   +1609.8%     840.67 ± 76%  proc-vmstat.workingset_nodes
     60.67           -16.5%      50.68        perf-stat.i.MPKI
 1.803e+09 ±  3%     +64.5%  2.966e+09 ±  2%  perf-stat.i.branch-instructions
   6184540 ±  2%     +65.7%   10250205 ±  4%  perf-stat.i.branch-misses
     92.07            -5.4       86.67        perf-stat.i.cache-miss-rate%
 5.857e+08 ±  3%     +22.3%  7.161e+08 ±  2%  perf-stat.i.cache-misses
 6.342e+08 ±  3%     +30.4%  8.268e+08 ±  3%  perf-stat.i.cache-references
      1701            +6.1%       1805 ±  2%  perf-stat.i.context-switches
     14.93 ±  3%     -37.1%       9.38 ±  3%  perf-stat.i.cpi
    294.07 ±  2%     -18.7%     239.15 ±  4%  perf-stat.i.cycles-between-cache-misses
      0.01 ±  6%      +0.0        0.01 ±  6%  perf-stat.i.dTLB-load-miss-rate%
     94987 ±  5%    +259.1%     341113 ±  8%  perf-stat.i.dTLB-load-misses
 1.937e+09 ±  3%     +68.9%   3.27e+09 ±  2%  perf-stat.i.dTLB-loads
      0.00 ±  2%      +0.0        0.01 ±  3%  perf-stat.i.dTLB-store-miss-rate%
     70696 ±  3%    +268.4%     260434 ±  5%  perf-stat.i.dTLB-store-misses
 1.673e+09 ±  3%     +51.9%  2.542e+09 ±  2%  perf-stat.i.dTLB-stores
     40.11 ±  6%      +6.7       46.85 ±  5%  perf-stat.i.iTLB-load-miss-rate%
    979065 ±  4%     +60.4%    1570764 ±  7%  perf-stat.i.iTLB-load-misses
 1.035e+10 ±  3%     +57.2%  1.627e+10 ±  2%  perf-stat.i.instructions
      0.08 ±  2%     +57.1%       0.12 ±  2%  perf-stat.i.ipc
     15.20            -2.9%      14.75        perf-stat.i.major-faults
      1584 ±  3%     -25.4%       1182 ±  8%  perf-stat.i.metric.K/sec
     64.32 ±  3%     +59.1%     102.30 ±  2%  perf-stat.i.metric.M/sec
     39.71 ±  5%     -10.0       29.70 ±  5%  perf-stat.i.node-load-miss-rate%
  39204201 ±  3%      -7.6%   36215971 ±  2%  perf-stat.i.node-load-misses
  83792143 ±  5%     +29.0%  1.081e+08 ±  3%  perf-stat.i.node-loads
     38.00 ±  5%     -12.9       25.08 ±  7%  perf-stat.i.node-store-miss-rate%
  49443672 ±  2%     -19.6%   39741233 ±  5%  perf-stat.i.node-store-misses
 1.042e+08 ±  4%     +40.2%  1.461e+08 ±  4%  perf-stat.i.node-stores
     61.24           -17.1%      50.79        perf-stat.overall.MPKI
     92.36            -5.7       86.66        perf-stat.overall.cache-miss-rate%
     13.47 ±  3%     -36.9%       8.50 ±  2%  perf-stat.overall.cpi
    238.08 ±  3%     -18.9%     193.04 ±  2%  perf-stat.overall.cycles-between-cache-misses
      0.00 ±  6%      +0.0        0.01 ±  7%  perf-stat.overall.dTLB-load-miss-rate%
      0.00 ±  2%      +0.0        0.01 ±  4%  perf-stat.overall.dTLB-store-miss-rate%
     40.38 ±  6%      +8.1       48.52 ±  5%  perf-stat.overall.iTLB-load-miss-rate%
      0.07 ±  2%     +58.5%       0.12 ±  2%  perf-stat.overall.ipc
     32.50 ±  5%      -7.1       25.37 ±  4%  perf-stat.overall.node-load-miss-rate%
     32.71 ±  4%     -11.1       21.63 ±  7%  perf-stat.overall.node-store-miss-rate%
   1954187           +26.8%    2478091        perf-stat.overall.path-length
 1.778e+09 ±  3%     +65.0%  2.934e+09 ±  2%  perf-stat.ps.branch-instructions
   6104343 ±  2%     +65.9%   10124383 ±  4%  perf-stat.ps.branch-misses
 5.775e+08 ±  3%     +22.8%   7.09e+08 ±  2%  perf-stat.ps.cache-misses
 6.252e+08 ±  3%     +30.8%  8.181e+08 ±  3%  perf-stat.ps.cache-references
      1685            +6.1%       1788 ±  2%  perf-stat.ps.context-switches
    130.84            +1.3%     132.52        perf-stat.ps.cpu-migrations
     93751 ±  5%    +259.7%     337214 ±  8%  perf-stat.ps.dTLB-load-misses
 1.911e+09 ±  3%     +69.3%  3.236e+09 ±  2%  perf-stat.ps.dTLB-loads
     69811 ±  3%    +268.8%     257492 ±  5%  perf-stat.ps.dTLB-store-misses
 1.651e+09 ±  3%     +52.4%  2.515e+09 ±  2%  perf-stat.ps.dTLB-stores
    971749 ±  4%     +59.6%    1550920 ±  7%  perf-stat.ps.iTLB-load-misses
 1.021e+10 ±  3%     +57.7%   1.61e+10 ±  2%  perf-stat.ps.instructions
  39356027 ±  3%      -8.0%   36218463 ±  2%  perf-stat.ps.node-load-misses
  81906530 ±  5%     +30.3%  1.067e+08 ±  4%  perf-stat.ps.node-loads
  49511032 ±  3%     -19.7%   39746307 ±  5%  perf-stat.ps.node-store-misses
  1.02e+08 ±  5%     +41.5%  1.443e+08 ±  4%  perf-stat.ps.node-stores
 2.057e+12 ±  3%     +58.0%   3.25e+12 ±  2%  perf-stat.total.instructions
      0.00            +0.7        0.67 ±  7%  perf-profile.calltrace.cycles-pp.folio_mark_accessed.filemap_read.xfs_file_buffered_read.xfs_file_read_iter.aio_read
      0.00            +0.7        0.71 ±  8%  perf-profile.calltrace.cycles-pp.rmqueue.get_page_from_freelist.__alloc_pages.folio_alloc.page_cache_ra_order
      0.00            +0.8        0.80 ±  9%  perf-profile.calltrace.cycles-pp.get_page_from_freelist.__alloc_pages.folio_alloc.page_cache_ra_order.filemap_get_pages
      0.00            +0.8        0.84 ±  8%  perf-profile.calltrace.cycles-pp.__alloc_pages.folio_alloc.page_cache_ra_order.filemap_get_pages.filemap_read
      0.00            +0.9        0.87 ±  8%  perf-profile.calltrace.cycles-pp.folio_alloc.page_cache_ra_order.filemap_get_pages.filemap_read.xfs_file_buffered_read
      0.00            +1.1        1.12 ± 29%  perf-profile.calltrace.cycles-pp.__filemap_add_folio.filemap_add_folio.page_cache_ra_order.filemap_get_pages.filemap_read
      0.00            +1.7        1.68 ± 30%  perf-profile.calltrace.cycles-pp.filemap_add_folio.page_cache_ra_order.filemap_get_pages.filemap_read.xfs_file_buffered_read
      0.00            +1.8        1.77 ± 64%  perf-profile.calltrace.cycles-pp.uncharge_batch.__mem_cgroup_uncharge.free_compound_page.release_pages.__pagevec_release
      0.00            +1.8        1.78 ± 63%  perf-profile.calltrace.cycles-pp.__mem_cgroup_uncharge.free_compound_page.release_pages.__pagevec_release.invalidate_mapping_pagevec
      0.00            +1.8        1.78 ± 63%  perf-profile.calltrace.cycles-pp.free_compound_page.release_pages.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise
      0.78 ± 10%      +1.9        2.65 ± 19%  perf-profile.calltrace.cycles-pp.page_cache_ra_order.filemap_get_pages.filemap_read.xfs_file_buffered_read.xfs_file_read_iter
      0.00            +3.8        3.78 ±  9%  perf-profile.calltrace.cycles-pp.__submit_bio.__submit_bio_noacct.iomap_readahead.read_pages.page_cache_ra_unbounded
      0.00            +3.8        3.78 ±  9%  perf-profile.calltrace.cycles-pp.__submit_bio_noacct.iomap_readahead.read_pages.page_cache_ra_unbounded.filemap_get_pages
      0.00            +3.9        3.87 ±  9%  perf-profile.calltrace.cycles-pp.iomap_readahead.read_pages.page_cache_ra_unbounded.filemap_get_pages.filemap_read
      0.00            +3.9        3.88 ±  9%  perf-profile.calltrace.cycles-pp.read_pages.page_cache_ra_unbounded.filemap_get_pages.filemap_read.xfs_file_buffered_read
      0.00            +4.1        4.06 ± 41%  perf-profile.calltrace.cycles-pp.release_pages.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64
      0.00            +4.1        4.07 ± 41%  perf-profile.calltrace.cycles-pp.__pagevec_release.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.invalidate_mapping_pagevec.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.posix_fadvise
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.posix_fadvise
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.posix_fadvise
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.__x64_sys_fadvise64.do_syscall_64.entry_SYSCALL_64_after_hwframe.posix_fadvise
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64.entry_SYSCALL_64_after_hwframe.posix_fadvise
      0.09 ±223%      +4.7        4.80 ± 37%  perf-profile.calltrace.cycles-pp.generic_fadvise.ksys_fadvise64_64.__x64_sys_fadvise64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +4.7        4.74 ±  8%  perf-profile.calltrace.cycles-pp.page_cache_ra_unbounded.filemap_get_pages.filemap_read.xfs_file_buffered_read.xfs_file_read_iter
      0.07 ± 15%      +0.0        0.11 ± 12%  perf-profile.children.cycles-pp.kmem_cache_free
      0.03 ±100%      +0.1        0.08 ± 11%  perf-profile.children.cycles-pp.iomap_page_create
      0.02 ±141%      +0.1        0.07 ± 10%  perf-profile.children.cycles-pp.iomap_iop_set_range_uptodate
      0.00            +0.1        0.06 ±  7%  perf-profile.children.cycles-pp.smpboot_thread_fn
      0.00            +0.1        0.06 ± 14%  perf-profile.children.cycles-pp.rcu_do_batch
      0.14 ± 14%      +0.1        0.21 ± 10%  perf-profile.children.cycles-pp.iomap_readpage_iter
      0.00            +0.1        0.07 ± 17%  perf-profile.children.cycles-pp.rcu_core
      0.06 ± 16%      +0.1        0.13 ± 11%  perf-profile.children.cycles-pp.__mod_node_page_state
      0.00            +0.1        0.08 ± 12%  perf-profile.children.cycles-pp.get_mem_cgroup_from_mm
      0.06 ± 17%      +0.1        0.14 ±  9%  perf-profile.children.cycles-pp.__mod_lruvec_state
      0.00            +0.1        0.09 ±  6%  perf-profile.children.cycles-pp.kmem_cache_alloc_lru
      0.00            +0.1        0.09 ±  6%  perf-profile.children.cycles-pp.xas_alloc
      0.00            +0.1        0.10 ± 33%  perf-profile.children.cycles-pp.memcg_check_events
      0.00            +0.1        0.11 ± 13%  perf-profile.children.cycles-pp.xa_load
      0.00            +0.1        0.12 ±  5%  perf-profile.children.cycles-pp.xas_create
      0.00            +0.1        0.12 ± 18%  perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
      0.07 ± 15%      +0.1        0.20 ± 11%  perf-profile.children.cycles-pp.xas_load
      0.10 ± 10%      +0.2        0.25 ±  9%  perf-profile.children.cycles-pp.filemap_get_read_batch
      0.13 ± 59%      +0.2        0.29 ± 19%  perf-profile.children.cycles-pp.filemap_unaccount_folio
      0.03 ± 70%      +0.2        0.19 ± 15%  perf-profile.children.cycles-pp.__pagevec_lru_add_fn
      0.08 ±  8%      +0.2        0.24 ±  9%  perf-profile.children.cycles-pp.iomap_read_end_io
      0.03 ±105%      +0.2        0.23 ± 21%  perf-profile.children.cycles-pp.__free_pages_ok
      0.06 ± 49%      +0.2        0.27 ± 12%  perf-profile.children.cycles-pp.__list_del_entry_valid
      0.06 ± 17%      +0.2        0.28 ± 22%  perf-profile.children.cycles-pp.__mod_lruvec_page_state
      0.01 ±223%      +0.2        0.23 ±  6%  perf-profile.children.cycles-pp.xas_store
      0.14 ± 58%      +0.2        0.38 ± 17%  perf-profile.children.cycles-pp.__filemap_remove_folio
      0.00            +0.2        0.24 ± 23%  perf-profile.children.cycles-pp.__free_one_page
      0.00            +0.3        0.26 ± 17%  perf-profile.children.cycles-pp.find_lock_entries
      0.14 ± 58%      +0.3        0.40 ± 18%  perf-profile.children.cycles-pp.remove_mapping
      0.14 ± 58%      +0.3        0.42 ± 18%  perf-profile.children.cycles-pp.__remove_mapping
      0.00            +0.3        0.28 ± 54%  perf-profile.children.cycles-pp.free_transhuge_page
      0.12 ± 52%      +0.3        0.45 ± 39%  perf-profile.children.cycles-pp.ret_from_fork
      0.12 ± 54%      +0.3        0.45 ± 39%  perf-profile.children.cycles-pp.kthread
      0.00            +0.3        0.33 ± 59%  perf-profile.children.cycles-pp.__mem_cgroup_uncharge_list
      0.00            +0.3        0.34 ± 10%  perf-profile.children.cycles-pp.rmqueue_bulk
      0.01 ±223%      +0.5        0.50 ± 44%  perf-profile.children.cycles-pp.__page_cache_release
      0.07 ± 23%      +0.5        0.59 ± 44%  perf-profile.children.cycles-pp.page_counter_try_charge
      0.09 ± 19%      +0.6        0.69 ± 56%  perf-profile.children.cycles-pp.__pagevec_lru_add
      0.36 ±  8%      +0.6        0.96 ±  8%  perf-profile.children.cycles-pp.rmqueue
      0.08 ± 17%      +0.6        0.70 ± 55%  perf-profile.children.cycles-pp.folio_add_lru
      0.00            +0.7        0.66 ± 30%  perf-profile.children.cycles-pp._raw_spin_lock
      0.00            +0.7        0.66 ± 65%  perf-profile.children.cycles-pp.propagate_protected_usage
      0.40 ±  8%      +0.7        1.08 ±  8%  perf-profile.children.cycles-pp.get_page_from_freelist
      0.00            +0.7        0.68 ±  7%  perf-profile.children.cycles-pp.folio_mark_accessed
      0.08 ± 26%      +0.7        0.77 ± 44%  perf-profile.children.cycles-pp.try_charge_memcg
      0.44 ±  8%      +0.7        1.15 ±  8%  perf-profile.children.cycles-pp.__alloc_pages
      0.11 ± 20%      +0.7        0.84 ± 42%  perf-profile.children.cycles-pp.charge_memcg
      0.43 ±  8%      +0.7        1.16 ±  8%  perf-profile.children.cycles-pp.folio_alloc
      0.00            +0.8        0.75 ± 30%  perf-profile.children.cycles-pp.free_pcppages_bulk
      0.15 ± 16%      +0.8        0.92 ± 39%  perf-profile.children.cycles-pp.__mem_cgroup_charge
      0.00            +0.8        0.78 ± 30%  perf-profile.children.cycles-pp.free_unref_page
      0.02 ±142%      +0.9        0.91 ± 61%  perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
      0.26 ± 14%      +1.2        1.46 ± 28%  perf-profile.children.cycles-pp.__filemap_add_folio
      0.11 ± 26%      +1.5        1.60 ± 39%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.02 ±142%      +1.7        1.72 ± 33%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      0.00            +1.8        1.81 ± 63%  perf-profile.children.cycles-pp.free_compound_page
      0.34 ± 13%      +1.8        2.16 ± 28%  perf-profile.children.cycles-pp.filemap_add_folio
      0.78 ± 10%      +1.9        2.65 ± 19%  perf-profile.children.cycles-pp.page_cache_ra_order
      0.00            +1.9        1.93 ± 61%  perf-profile.children.cycles-pp.__mem_cgroup_uncharge
      0.00            +2.1        2.11 ± 62%  perf-profile.children.cycles-pp.page_counter_uncharge
      0.00            +2.2        2.22 ± 61%  perf-profile.children.cycles-pp.uncharge_batch
      0.11 ± 58%      +4.0        4.07 ± 41%  perf-profile.children.cycles-pp.__pagevec_release
      0.12 ± 55%      +4.0        4.14 ± 41%  perf-profile.children.cycles-pp.release_pages
      0.27 ± 58%      +4.5        4.80 ± 37%  perf-profile.children.cycles-pp.posix_fadvise
      0.27 ± 58%      +4.5        4.80 ± 37%  perf-profile.children.cycles-pp.__x64_sys_fadvise64
      0.27 ± 58%      +4.5        4.80 ± 37%  perf-profile.children.cycles-pp.ksys_fadvise64_64
      0.27 ± 58%      +4.5        4.80 ± 37%  perf-profile.children.cycles-pp.generic_fadvise
      0.27 ± 58%      +4.5        4.80 ± 37%  perf-profile.children.cycles-pp.invalidate_mapping_pagevec
      0.00            +4.7        4.74 ±  8%  perf-profile.children.cycles-pp.page_cache_ra_unbounded
      0.07 ±  8%      +0.0        0.09 ±  6%  perf-profile.self.cycles-pp.copy_page_to_iter
      0.00            +0.1        0.05 ±  7%  perf-profile.self.cycles-pp.xas_store
      0.10 ± 10%      +0.1        0.16 ± 10%  perf-profile.self.cycles-pp.filemap_read
      0.00            +0.1        0.06 ± 14%  perf-profile.self.cycles-pp._raw_spin_lock
      0.00            +0.1        0.06 ± 11%  perf-profile.self.cycles-pp.iomap_readahead
      0.00            +0.1        0.06 ±  7%  perf-profile.self.cycles-pp.rmqueue
      0.00            +0.1        0.07 ± 29%  perf-profile.self.cycles-pp.__pagevec_lru_add_fn
      0.06 ± 16%      +0.1        0.13 ± 11%  perf-profile.self.cycles-pp.__mod_node_page_state
      0.00            +0.1        0.07 ± 12%  perf-profile.self.cycles-pp.get_mem_cgroup_from_mm
      0.00            +0.1        0.08 ±  8%  perf-profile.self.cycles-pp.get_page_from_freelist
      0.00            +0.1        0.09 ± 17%  perf-profile.self.cycles-pp.rmqueue_bulk
      0.00            +0.1        0.12 ± 21%  perf-profile.self.cycles-pp.__mod_memcg_lruvec_state
      0.00            +0.1        0.14 ± 11%  perf-profile.self.cycles-pp.iomap_read_end_io
      0.03 ±100%      +0.1        0.17 ± 11%  perf-profile.self.cycles-pp.xas_load
      0.00            +0.1        0.14 ± 30%  perf-profile.self.cycles-pp.__mod_lruvec_page_state
      0.00            +0.2        0.15 ± 21%  perf-profile.self.cycles-pp.release_pages
      0.00            +0.2        0.17 ± 11%  perf-profile.self.cycles-pp.filemap_get_read_batch
      0.00            +0.2        0.18 ± 46%  perf-profile.self.cycles-pp.try_charge_memcg
      0.00            +0.2        0.19 ± 25%  perf-profile.self.cycles-pp.__free_one_page
      0.06 ± 49%      +0.2        0.27 ± 12%  perf-profile.self.cycles-pp.__list_del_entry_valid
      0.00            +0.2        0.23 ± 17%  perf-profile.self.cycles-pp.find_lock_entries
      0.06 ± 51%      +0.4        0.46 ± 41%  perf-profile.self.cycles-pp.page_counter_try_charge
      0.08 ± 16%      +0.4        0.49 ± 33%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.00            +0.7        0.66 ± 65%  perf-profile.self.cycles-pp.propagate_protected_usage
      0.00            +0.7        0.67 ±  7%  perf-profile.self.cycles-pp.folio_mark_accessed
      0.00            +1.6        1.58 ± 60%  perf-profile.self.cycles-pp.page_counter_uncharge
      0.02 ±142%      +1.7        1.71 ± 33%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://01.org/lkp



View attachment "config-5.18.0-rc5-00030-gb9ff43dd2743" of type "text/plain" (162679 bytes)

View attachment "job-script" of type "text/plain" (8701 bytes)

View attachment "job.yaml" of type "text/plain" (5915 bytes)

View attachment "reproduce" of type "text/plain" (934 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ