lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Date:   Tue, 4 Feb 2020 17:13:48 +0800
From:   kernel test robot <rong.a.chen@...el.com>
To:     Matthew Bobrowski <mbobrowski@...browski.org>
Cc:     Theodore Ts'o <tytso@....edu>, Jan Kara <jack@...e.cz>,
        Ritesh Harjani <riteshh@...ux.ibm.com>,
        LKML <linux-kernel@...r.kernel.org>,
        Linus Torvalds <torvalds@...ux-foundation.org>,
        lkp@...ts.01.org
Subject: [ext4] 569342dc24: fio.read_bw_MBps 56.4% improvement

Greeting,

FYI, we noticed a 56.4% improvement of fio.read_bw_MBps due to commit:


commit: 569342dc2485392e95b6a626281708c25014ba37 ("ext4: move inode extension/truncate code out from ->iomap_end() callback")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: fio-basic
on test machine: 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 256G memory
with following parameters:

	disk: 2pmem
	fs: ext4
	mount_option: dax
	runtime: 200s
	nr_task: 50%
	time_based: tb
	rw: rw
	bs: 4k
	ioengine: libaio
	test_size: 200G
	cpufreq_governor: performance
	ucode: 0x500002c

test-description: Fio is a tool that will spawn a number of threads or processes doing a particular type of I/O action as specified by the user.
test-url: https://github.com/axboe/fio

In addition to that, the commit also has significant impact on the following tests:

+------------------+----------------------------------------------------------------------+
| testcase: change | fio-basic: fio.write_bw_MBps 54.0% improvement                       |
| test machine     | 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 256G memory |
| test parameters  | bs=4k                                                                |
|                  | cpufreq_governor=performance                                         |
|                  | disk=2pmem                                                           |
|                  | fs=ext4                                                              |
|                  | ioengine=sync                                                        |
|                  | mount_option=dax                                                     |
|                  | nr_task=50%                                                          |
|                  | runtime=200s                                                         |
|                  | rw=randwrite                                                         |
|                  | test_size=200G                                                       |
|                  | time_based=tb                                                        |
|                  | ucode=0x500002c                                                      |
+------------------+----------------------------------------------------------------------+
| testcase: change | fio-basic: fio.write_bw_MBps 68.8% improvement                       |
| test machine     | 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 256G memory |
| test parameters  | bs=4k                                                                |
|                  | cpufreq_governor=performance                                         |
|                  | disk=2pmem                                                           |
|                  | fs=ext4                                                              |
|                  | ioengine=libaio                                                      |
|                  | mount_option=dax                                                     |
|                  | nr_task=50%                                                          |
|                  | runtime=200s                                                         |
|                  | rw=randwrite                                                         |
|                  | test_size=200G                                                       |
|                  | time_based=tb                                                        |
|                  | ucode=0x500002c                                                      |
+------------------+----------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install job.yaml  # job file is attached in this email
        bin/lkp run     job.yaml

=========================================================================================
bs/compiler/cpufreq_governor/disk/fs/ioengine/kconfig/mount_option/nr_task/rootfs/runtime/rw/tbox_group/test_size/testcase/time_based/ucode:
  4k/gcc-7/performance/2pmem/ext4/libaio/x86_64-rhel-7.6/dax/50%/debian-x86_64-20191114.cgz/200s/rw/lkp-csl-2sp6/200G/fio-basic/tb/0x500002c

commit: 
  b1b4705d54 ("ext4: introduce direct I/O read using iomap infrastructure")
  569342dc24 ("ext4: move inode extension/truncate code out from ->iomap_end() callback")

b1b4705d54abedfd 569342dc2485392e95b6a626281 
---------------- --------------------------- 
       fail:runs  %reproduction    fail:runs
           |             |             |    
           :4           25%           1:4     dmesg.WARNING:at#for_ip_swapgs_restore_regs_and_return_to_usermode/0x
           :4           25%           1:4     dmesg.WARNING:at_ip__fsnotify_parent/0x
          1:4          -25%            :4     dmesg.WARNING:at_ip_aio_read/0x
           :4           25%           1:4     dmesg.WARNING:stack_recursion
         %stddev     %change         %stddev
             \          |                \  
     14.31 ± 30%     -14.3        0.01        fio.latency_1000us%
      3.05 ± 72%     +20.2       23.21 ± 69%  fio.latency_250us%
      0.62 ± 83%      -0.6        0.01        fio.latency_2ms%
     39.71 ± 26%     -31.5        8.24 ± 83%  fio.latency_750us%
      5456           +56.4%       8534        fio.read_bw_MBps
    794624 ±  5%     -40.5%     472576 ± 10%  fio.read_clat_90%_us
    849920 ±  6%     -40.8%     502784 ± 10%  fio.read_clat_95%_us
    948224 ±  7%     -41.1%     558592 ±  9%  fio.read_clat_99%_us
    532777           -36.1%     340678        fio.read_clat_mean_us
    194664 ± 13%     -49.1%      99057 ± 34%  fio.read_clat_stddev
   1396934           +56.4%    2184757        fio.read_iops
      4719 ±  2%      -5.4%       4464        fio.read_slat_mean_us
      4020 ±  5%     -17.9%       3300 ± 10%  fio.read_slat_stddev
      4567 ±  9%     +11.3%       5084 ±  9%  fio.time.involuntary_context_switches
      9322            -1.7%       9160        fio.time.system_time
    286.59 ±  2%     +56.4%     448.31 ±  2%  fio.time.user_time
 5.588e+08           +56.4%  8.739e+08        fio.workload
      5456           +56.4%       8534        fio.write_bw_MBps
    796672 ±  4%     -40.7%     472576 ± 10%  fio.write_clat_90%_us
    849920 ±  6%     -40.7%     503808 ± 10%  fio.write_clat_95%_us
    948224 ±  7%     -41.1%     558592 ±  9%  fio.write_clat_99%_us
    533381           -36.1%     341017        fio.write_clat_mean_us
    194676 ± 13%     -49.1%      99161 ± 34%  fio.write_clat_stddev
   1396910           +56.4%    2184882        fio.write_iops
     28403           -42.8%      16259        fio.write_slat_mean_us
     12283 ± 11%     -45.7%       6666 ± 22%  fio.write_slat_stddev
      1.50 ±  2%      +0.8        2.29 ±  4%  mpstat.cpu.all.usr%
      5899 ± 79%    +151.9%      14862 ± 51%  softirqs.CPU48.SCHED
     47.80            -3.5%      46.11 ±  2%  iostat.cpu.system
      1.49 ±  2%     +52.9%       2.27 ±  4%  iostat.cpu.user
     18587           -82.9%       3179 ±170%  numa-numastat.node0.other_node
    103.25 ± 33%  +14978.9%      15569 ± 34%  numa-numastat.node1.other_node
     45512 ± 23%     -59.2%      18576 ±144%  sched_debug.cfs_rq:/.spread0.avg
     15.56 ± 11%     +33.3%      20.75 ±  7%  sched_debug.cpu.nr_uninterruptible.max
     38681 ±  3%    +535.2%     245689 ±143%  turbostat.C1
    126.48            +4.2%     131.76        turbostat.RAMWatt
      9311 ±  8%     +20.5%      11221 ±  6%  numa-vmstat.node0.nr_slab_reclaimable
     19397 ±  2%     -79.7%       3942 ±142%  numa-vmstat.node0.numa_other
      9935 ±  7%     -19.6%       7983 ± 10%  numa-vmstat.node1.nr_slab_reclaimable
    156898            +9.7%     172186 ±  3%  numa-vmstat.node1.numa_other
     37246 ±  8%     +20.5%      44887 ±  6%  numa-meminfo.node0.KReclaimable
     37246 ±  8%     +20.5%      44887 ±  6%  numa-meminfo.node0.SReclaimable
    137281 ±  3%      +7.8%     148012 ±  2%  numa-meminfo.node0.Slab
     39742 ±  7%     -19.6%      31935 ± 10%  numa-meminfo.node1.KReclaimable
     39742 ±  7%     -19.6%      31935 ± 10%  numa-meminfo.node1.SReclaimable
     47.25 ±137%    -100.0%       0.00        interrupts.46:PCI-MSI.31981579-edge.i40e-eth0-TxRx-10
     59.50 ± 61%     -71.0%      17.25 ±109%  interrupts.CPU10.TLB:TLB_shootdowns
     83.25 ± 45%     -70.9%      24.25 ± 81%  interrupts.CPU11.TLB:TLB_shootdowns
     70.50 ± 50%     -76.6%      16.50 ± 90%  interrupts.CPU16.TLB:TLB_shootdowns
     87.00 ± 44%     -73.9%      22.75 ± 93%  interrupts.CPU17.TLB:TLB_shootdowns
     75.75 ± 46%     -59.1%      31.00 ± 98%  interrupts.CPU20.TLB:TLB_shootdowns
     78.00 ± 42%     -72.4%      21.50 ± 67%  interrupts.CPU22.TLB:TLB_shootdowns
      7327 ± 12%     -40.0%       4393 ± 46%  interrupts.CPU26.NMI:Non-maskable_interrupts
      7327 ± 12%     -40.0%       4393 ± 46%  interrupts.CPU26.PMI:Performance_monitoring_interrupts
      7.25 ± 65%   +1220.7%      95.75 ± 82%  interrupts.CPU3.RES:Rescheduling_interrupts
      1965 ±  7%      -9.4%       1781 ±  9%  interrupts.CPU30.CAL:Function_call_interrupts
      1951 ±  6%     -10.0%       1756 ±  7%  interrupts.CPU31.CAL:Function_call_interrupts
      1957 ±  7%     -10.2%       1757 ±  7%  interrupts.CPU32.CAL:Function_call_interrupts
      1853 ± 13%     +25.4%       2324 ± 16%  interrupts.CPU43.CAL:Function_call_interrupts
      7367 ± 12%     -45.3%       4027 ± 58%  interrupts.CPU44.NMI:Non-maskable_interrupts
      7367 ± 12%     -45.3%       4027 ± 58%  interrupts.CPU44.PMI:Performance_monitoring_interrupts
     13.25 ± 35%    +300.0%      53.00 ± 83%  interrupts.CPU48.RES:Rescheduling_interrupts
     64.25 ± 64%     -60.3%      25.50 ± 54%  interrupts.CPU55.TLB:TLB_shootdowns
     74.00 ± 57%     -74.7%      18.75 ± 81%  interrupts.CPU7.TLB:TLB_shootdowns
     40.50 ± 50%    +108.0%      84.25 ± 21%  interrupts.CPU75.TLB:TLB_shootdowns
      5558 ± 32%     -52.9%       2617 ± 21%  interrupts.CPU8.NMI:Non-maskable_interrupts
      5558 ± 32%     -52.9%       2617 ± 21%  interrupts.CPU8.PMI:Performance_monitoring_interrupts
      5.50 ± 91%   +1440.9%      84.75 ± 34%  interrupts.CPU8.RES:Rescheduling_interrupts
     40.25 ± 38%     +91.3%      77.00 ± 33%  interrupts.CPU83.TLB:TLB_shootdowns
     41.00 ± 46%     +95.7%      80.25 ± 22%  interrupts.CPU90.TLB:TLB_shootdowns
 4.867e+09           +46.6%  7.133e+09 ±  3%  perf-stat.i.branch-instructions
  35659892 ±  2%     +40.1%   49959300        perf-stat.i.branch-misses
     48.56 ±  2%      +6.7       55.23 ±  2%  perf-stat.i.cache-miss-rate%
 1.806e+08 ±  3%     +24.0%  2.239e+08 ±  3%  perf-stat.i.cache-misses
 3.709e+08 ±  2%      +7.7%  3.993e+08 ±  2%  perf-stat.i.cache-references
      5.12           -31.6%       3.50 ±  3%  perf-stat.i.cpi
    750.82 ±  3%     -16.1%     630.03 ±  9%  perf-stat.i.cycles-between-cache-misses
  7.13e+09 ±  9%     +47.4%  1.051e+10 ±  8%  perf-stat.i.dTLB-loads
 4.827e+09 ±  8%     +45.6%  7.027e+09 ±  3%  perf-stat.i.dTLB-stores
  38590154 ±  4%     +23.4%   47627762 ±  9%  perf-stat.i.iTLB-load-misses
 2.618e+10           +46.5%  3.835e+10 ±  3%  perf-stat.i.instructions
    686.01 ±  3%     +19.6%     820.13 ±  7%  perf-stat.i.instructions-per-iTLB-miss
      0.20           +47.3%       0.29        perf-stat.i.ipc
   1537507 ±  4%     +17.0%    1798882 ±  7%  perf-stat.i.node-loads
  13902646 ±  3%      -8.9%   12667482 ±  3%  perf-stat.i.node-store-misses
     17738 ±  3%     +16.9%      20736 ±  4%  perf-stat.i.node-stores
     14.17 ±  2%     -26.5%      10.41        perf-stat.overall.MPKI
      0.73            -0.0        0.70        perf-stat.overall.branch-miss-rate%
     48.71 ±  2%      +7.4       56.08        perf-stat.overall.cache-miss-rate%
      5.13           -32.9%       3.44        perf-stat.overall.cpi
    744.79 ±  3%     -20.8%     589.98        perf-stat.overall.cycles-between-cache-misses
     86.06            +1.7       87.73        perf-stat.overall.iTLB-load-miss-rate%
    679.44 ±  3%     +19.2%     810.22 ±  6%  perf-stat.overall.instructions-per-iTLB-miss
      0.19           +49.0%       0.29        perf-stat.overall.ipc
      9405            -4.4%       8987        perf-stat.overall.path-length
 4.842e+09           +46.6%  7.097e+09 ±  3%  perf-stat.ps.branch-instructions
  35478652 ±  2%     +40.1%   49709486        perf-stat.ps.branch-misses
 1.797e+08 ±  3%     +24.0%  2.228e+08 ±  3%  perf-stat.ps.cache-misses
  3.69e+08 ±  2%      +7.7%  3.973e+08 ±  2%  perf-stat.ps.cache-references
 7.093e+09 ±  9%     +47.4%  1.046e+10 ±  8%  perf-stat.ps.dTLB-loads
 4.803e+09 ±  8%     +45.6%  6.992e+09 ±  3%  perf-stat.ps.dTLB-stores
  38394613 ±  4%     +23.4%   47389655 ±  9%  perf-stat.ps.iTLB-load-misses
 2.605e+10           +46.5%  3.816e+10 ±  3%  perf-stat.ps.instructions
   1529726 ±  4%     +17.0%    1789899 ±  7%  perf-stat.ps.node-loads
  13832194 ±  3%      -8.9%   12604236 ±  3%  perf-stat.ps.node-store-misses
     17673 ±  3%     +16.9%      20659 ±  4%  perf-stat.ps.node-stores
 5.255e+12           +49.5%  7.854e+12        perf-stat.total.instructions
     20.14 ±  9%     -20.1        0.00        perf-profile.calltrace.cycles-pp.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter.aio_write
     14.28 ±  9%     -14.3        0.00        perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     14.20 ±  9%     -14.2        0.00        perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_iomap_end.iomap_apply.dax_iomap_rw
      0.60 ± 10%      +0.4        1.05 ±  7%  perf-profile.calltrace.cycles-pp.do_io_getevents.__x64_sys_io_getevents.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.64 ± 10%      +0.5        1.11 ±  7%  perf-profile.calltrace.cycles-pp.__x64_sys_io_getevents.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.80 ± 12%      +0.6        1.39 ± 10%  perf-profile.calltrace.cycles-pp.__srcu_read_unlock.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      0.12 ±173%      +0.6        0.75 ±  7%  perf-profile.calltrace.cycles-pp.read_events.do_io_getevents.__x64_sys_io_getevents.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +0.7        0.68 ±  7%  perf-profile.calltrace.cycles-pp.aio_read_events.read_events.do_io_getevents.__x64_sys_io_getevents.do_syscall_64
      1.02 ± 11%      +0.7        1.74 ±  4%  perf-profile.calltrace.cycles-pp.__copy_user_nocache.__copy_user_flushcache._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply
      1.02 ± 12%      +0.7        1.75 ±  4%  perf-profile.calltrace.cycles-pp.__copy_user_flushcache._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply.dax_iomap_rw
      1.04 ± 11%      +0.7        1.78 ±  4%  perf-profile.calltrace.cycles-pp._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      1.09 ±  7%      +0.8        1.87 ±  7%  perf-profile.calltrace.cycles-pp.__memcpy_mcsafe.copyout_mcsafe._copy_to_iter_mcsafe.dax_iomap_actor.iomap_apply
      1.11 ±  8%      +0.8        1.91 ±  8%  perf-profile.calltrace.cycles-pp.copyout_mcsafe._copy_to_iter_mcsafe.dax_iomap_actor.iomap_apply.dax_iomap_rw
      1.16 ±  8%      +0.8        1.98 ±  8%  perf-profile.calltrace.cycles-pp._copy_to_iter_mcsafe.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_read_iter
      1.24 ±  8%      +0.9        2.14 ±  7%  perf-profile.calltrace.cycles-pp.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_read_iter.aio_read
      2.00 ±  7%      +1.4        3.38 ±  7%  perf-profile.calltrace.cycles-pp.ext4_journal_check_start.__ext4_journal_start_sb.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      1.91 ±  8%      +1.4        3.29 ±  6%  perf-profile.calltrace.cycles-pp.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter.aio_write
      2.01 ±  7%      +1.4        3.40 ±  7%  perf-profile.calltrace.cycles-pp.__ext4_journal_start_sb.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      2.81 ±  9%      +1.5        4.34 ±  6%  perf-profile.calltrace.cycles-pp._raw_read_lock.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply
      3.50 ±  9%      +2.1        5.63 ±  6%  perf-profile.calltrace.cycles-pp.add_transaction_credits.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply
      4.08 ±  8%      +2.9        6.94 ±  7%  perf-profile.calltrace.cycles-pp._raw_read_lock.jbd2_transaction_committed.ext4_set_iomap.ext4_iomap_begin.iomap_apply
      7.23 ±  9%      +3.6       10.86 ±  5%  perf-profile.calltrace.cycles-pp.jbd2_journal_stop.__ext4_journal_stop.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      5.24 ±  8%      +3.8        8.99 ±  7%  perf-profile.calltrace.cycles-pp.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      5.21 ±  7%      +3.8        8.98 ±  8%  perf-profile.calltrace.cycles-pp.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_read_iter
      7.33 ±  9%      +3.8       11.12 ±  5%  perf-profile.calltrace.cycles-pp.__ext4_journal_stop.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      5.36 ±  8%      +3.9        9.24 ±  8%  perf-profile.calltrace.cycles-pp.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_read_iter.aio_read
      6.67 ±  7%      +4.8       11.48 ±  8%  perf-profile.calltrace.cycles-pp.iomap_apply.dax_iomap_rw.ext4_file_read_iter.aio_read.io_submit_one
      6.68 ±  7%      +4.8       11.51 ±  8%  perf-profile.calltrace.cycles-pp.dax_iomap_rw.ext4_file_read_iter.aio_read.io_submit_one.__x64_sys_io_submit
      7.75 ±  8%      +5.0       12.72 ±  8%  perf-profile.calltrace.cycles-pp.ext4_file_read_iter.aio_read.io_submit_one.__x64_sys_io_submit.do_syscall_64
      7.98 ±  8%      +5.1       13.13 ±  8%  perf-profile.calltrace.cycles-pp.aio_read.io_submit_one.__x64_sys_io_submit.do_syscall_64.entry_SYSCALL_64_after_hwframe
     10.41 ±  8%      +7.5       17.87 ±  7%  perf-profile.calltrace.cycles-pp.jbd2_transaction_committed.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw
     14.19 ±  9%      +8.7       22.93 ±  7%  perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply.dax_iomap_rw
     14.26 ±  9%      +8.8       23.05 ±  7%  perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     29.03 ±  8%     +17.9       46.90 ±  6%  perf-profile.calltrace.cycles-pp.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter.aio_write
     20.16 ±  9%     -20.2        0.00        perf-profile.children.cycles-pp.ext4_iomap_end
     29.63 ±  9%      -5.5       24.17 ±  7%  perf-profile.children.cycles-pp.jbd2__journal_start
     29.48 ±  9%      -5.4       24.04 ±  7%  perf-profile.children.cycles-pp.start_this_handle
      7.30 ±  9%      -1.4        5.90 ±  7%  perf-profile.children.cycles-pp.add_transaction_credits
      4.17 ±  7%      -0.6        3.54 ±  7%  perf-profile.children.cycles-pp.ext4_journal_check_start
      4.19 ±  7%      -0.6        3.56 ±  7%  perf-profile.children.cycles-pp.__ext4_journal_start_sb
      0.06 ± 17%      +0.0        0.09 ±  9%  perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
      0.08 ± 15%      +0.0        0.11 ±  4%  perf-profile.children.cycles-pp.aio_complete_rw
      0.06 ±  6%      +0.0        0.10 ±  4%  perf-profile.children.cycles-pp._cond_resched
      0.05 ±  8%      +0.0        0.09 ±  8%  perf-profile.children.cycles-pp.put_reqs_available
      0.04 ± 58%      +0.0        0.08 ± 10%  perf-profile.children.cycles-pp.__get_user_8
      0.09 ± 13%      +0.0        0.13 ±  6%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.07 ± 10%      +0.0        0.12 ±  7%  perf-profile.children.cycles-pp.__check_object_size
      0.04 ± 58%      +0.0        0.09 ±  7%  perf-profile.children.cycles-pp.__put_user_4
      0.07 ± 14%      +0.1        0.12 ± 12%  perf-profile.children.cycles-pp.__fget
      0.06 ± 11%      +0.1        0.11 ± 11%  perf-profile.children.cycles-pp.mutex_lock
      0.01 ±173%      +0.1        0.06 ± 13%  perf-profile.children.cycles-pp.fsnotify
      0.01 ±173%      +0.1        0.07 ±  7%  perf-profile.children.cycles-pp.fput_many
      0.01 ±173%      +0.1        0.07 ± 13%  perf-profile.children.cycles-pp.__virt_addr_valid
      0.00            +0.1        0.05 ±  8%  perf-profile.children.cycles-pp.__pmem_direct_access
      0.18 ±  8%      +0.1        0.23 ±  8%  perf-profile.children.cycles-pp.kmem_cache_alloc
      0.04 ± 57%      +0.1        0.09 ±  4%  perf-profile.children.cycles-pp.__get_reqs_available
      0.01 ±173%      +0.1        0.07 ±  6%  perf-profile.children.cycles-pp.mutex_unlock
      0.00            +0.1        0.06 ±  7%  perf-profile.children.cycles-pp.refill_reqs_available
      0.01 ±173%      +0.1        0.07 ± 10%  perf-profile.children.cycles-pp.__srcu_read_lock
      0.01 ±173%      +0.1        0.07 ±  5%  perf-profile.children.cycles-pp.__inode_security_revalidate
      0.00            +0.1        0.06 ± 13%  perf-profile.children.cycles-pp.ext4_write_checks
      0.00            +0.1        0.07 ± 17%  perf-profile.children.cycles-pp.ext4_data_block_valid
      0.08 ± 13%      +0.1        0.15 ± 17%  perf-profile.children.cycles-pp.aio_prep_rw
      0.03 ±100%      +0.1        0.09 ± 15%  perf-profile.children.cycles-pp.__check_block_validity
      0.01 ±173%      +0.1        0.08 ± 10%  perf-profile.children.cycles-pp.copy_user_generic_unrolled
      0.00            +0.1        0.07 ± 16%  perf-profile.children.cycles-pp.__sb_start_write
      0.00            +0.1        0.07 ± 17%  perf-profile.children.cycles-pp.current_time
      0.12 ± 10%      +0.1        0.19 ±  9%  perf-profile.children.cycles-pp.__might_sleep
      0.11 ±  7%      +0.1        0.18 ±  8%  perf-profile.children.cycles-pp.__get_user_4
      0.02 ±173%      +0.1        0.10 ± 17%  perf-profile.children.cycles-pp.ext4_meta_trans_blocks
      0.11 ±  7%      +0.1        0.19 ±  6%  perf-profile.children.cycles-pp.copy_user_enhanced_fast_string
      0.00            +0.1        0.08 ±  5%  perf-profile.children.cycles-pp.dax_direct_access
      0.11 ±  9%      +0.1        0.19 ±  9%  perf-profile.children.cycles-pp.selinux_file_permission
      0.10 ± 12%      +0.1        0.18 ± 10%  perf-profile.children.cycles-pp._copy_to_user
      0.12 ±  7%      +0.1        0.20 ±  7%  perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.12 ±  8%      +0.1        0.22 ±  8%  perf-profile.children.cycles-pp._copy_from_user
      0.15 ±  8%      +0.1        0.26 ±  8%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.13 ± 10%      +0.1        0.24 ±  8%  perf-profile.children.cycles-pp.ext4_es_lookup_extent
      0.15 ± 11%      +0.1        0.26 ±  8%  perf-profile.children.cycles-pp.security_file_permission
      0.18 ± 12%      +0.1        0.30 ±  7%  perf-profile.children.cycles-pp.___might_sleep
      0.18 ± 11%      +0.1        0.32 ±  9%  perf-profile.children.cycles-pp.__might_fault
      0.20 ± 11%      +0.2        0.38 ± 10%  perf-profile.children.cycles-pp.ext4_map_blocks
      0.30 ± 10%      +0.2        0.52 ±  7%  perf-profile.children.cycles-pp.lookup_ioctx
      0.39 ± 10%      +0.3        0.69 ±  7%  perf-profile.children.cycles-pp.aio_read_events
      0.43 ± 10%      +0.3        0.76 ±  7%  perf-profile.children.cycles-pp.read_events
      0.61 ± 10%      +0.4        1.05 ±  7%  perf-profile.children.cycles-pp.do_io_getevents
      0.64 ± 10%      +0.5        1.12 ±  7%  perf-profile.children.cycles-pp.__x64_sys_io_getevents
      0.81 ± 12%      +0.6        1.42 ± 10%  perf-profile.children.cycles-pp.__srcu_read_unlock
      1.02 ± 11%      +0.7        1.74 ±  4%  perf-profile.children.cycles-pp.__copy_user_nocache
      1.03 ± 12%      +0.7        1.75 ±  4%  perf-profile.children.cycles-pp.__copy_user_flushcache
      1.04 ± 11%      +0.7        1.78 ±  4%  perf-profile.children.cycles-pp._copy_from_iter_flushcache
      1.09 ±  7%      +0.8        1.88 ±  7%  perf-profile.children.cycles-pp.__memcpy_mcsafe
      1.12 ±  7%      +0.8        1.92 ±  7%  perf-profile.children.cycles-pp.copyout_mcsafe
      1.16 ±  7%      +0.8        1.98 ±  8%  perf-profile.children.cycles-pp._copy_to_iter_mcsafe
      9.94 ±  8%      +1.7       11.60 ±  7%  perf-profile.children.cycles-pp._raw_read_lock
      3.15 ±  8%      +2.3        5.44 ±  7%  perf-profile.children.cycles-pp.dax_iomap_actor
      7.75 ±  8%      +5.0       12.72 ±  8%  perf-profile.children.cycles-pp.ext4_file_read_iter
      7.98 ±  8%      +5.2       13.14 ±  8%  perf-profile.children.cycles-pp.aio_read
     10.41 ±  8%      +7.5       17.88 ±  7%  perf-profile.children.cycles-pp.jbd2_transaction_committed
     10.46 ±  8%      +7.5       17.98 ±  7%  perf-profile.children.cycles-pp.ext4_set_iomap
     34.40 ±  8%     +21.8       56.16 ±  7%  perf-profile.children.cycles-pp.ext4_iomap_begin
     16.32 ±  9%      -2.8       13.53 ±  8%  perf-profile.self.cycles-pp.start_this_handle
      7.28 ±  9%      -1.4        5.87 ±  6%  perf-profile.self.cycles-pp.add_transaction_credits
      4.12 ±  7%      -0.6        3.50 ±  7%  perf-profile.self.cycles-pp.ext4_journal_check_start
      0.07 ± 12%      +0.0        0.10 ±  5%  perf-profile.self.cycles-pp.aio_complete_rw
      0.06 ± 14%      +0.0        0.09 ± 10%  perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
      0.08 ± 13%      +0.0        0.12 ±  5%  perf-profile.self.cycles-pp.kmem_cache_alloc
      0.05 ±  8%      +0.0        0.09 ±  8%  perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.05 ±  8%      +0.0        0.09 ±  8%  perf-profile.self.cycles-pp.put_reqs_available
      0.09 ± 13%      +0.0        0.13 ±  9%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.07 ± 13%      +0.0        0.11 ±  7%  perf-profile.self.cycles-pp.__x64_sys_io_submit
      0.04 ± 58%      +0.0        0.08 ±  5%  perf-profile.self.cycles-pp.__put_user_4
      0.04 ± 57%      +0.0        0.08 ± 10%  perf-profile.self.cycles-pp.do_syscall_64
      0.04 ± 57%      +0.0        0.08 ± 10%  perf-profile.self.cycles-pp.__get_user_8
      0.08 ± 15%      +0.0        0.13 ±  8%  perf-profile.self.cycles-pp.ext4_iomap_begin
      0.07 ± 12%      +0.0        0.12 ±  9%  perf-profile.self.cycles-pp.selinux_file_permission
      0.01 ±173%      +0.0        0.06 ± 11%  perf-profile.self.cycles-pp.fsnotify
      0.01 ±173%      +0.0        0.06 ± 11%  perf-profile.self.cycles-pp.kmem_cache_free
      0.04 ± 58%      +0.0        0.09 ±  7%  perf-profile.self.cycles-pp.ext4_es_lookup_extent
      0.07 ± 14%      +0.1        0.12 ± 12%  perf-profile.self.cycles-pp.__fget
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.copyout_mcsafe
      0.04 ± 57%      +0.1        0.09 ±  7%  perf-profile.self.cycles-pp.__get_reqs_available
      0.03 ±100%      +0.1        0.08 ±  8%  perf-profile.self.cycles-pp.dax_iomap_actor
      0.01 ±173%      +0.1        0.07 ±  7%  perf-profile.self.cycles-pp.__srcu_read_lock
      0.01 ±173%      +0.1        0.07 ± 13%  perf-profile.self.cycles-pp.__virt_addr_valid
      0.00            +0.1        0.06 ±  9%  perf-profile.self.cycles-pp.aio_write
      0.00            +0.1        0.06 ±  9%  perf-profile.self.cycles-pp.ext4_map_blocks
      0.01 ±173%      +0.1        0.07 ±  6%  perf-profile.self.cycles-pp.mutex_unlock
      0.00            +0.1        0.06 ±  7%  perf-profile.self.cycles-pp.refill_reqs_available
      0.00            +0.1        0.06 ± 11%  perf-profile.self.cycles-pp.ext4_file_read_iter
      0.01 ±173%      +0.1        0.07 ± 11%  perf-profile.self.cycles-pp.mutex_lock
      0.04 ± 57%      +0.1        0.10 ± 25%  perf-profile.self.cycles-pp.ext4_set_iomap
      0.00            +0.1        0.06 ±  6%  perf-profile.self.cycles-pp.fput_many
      0.09 ± 11%      +0.1        0.16 ±  7%  perf-profile.self.cycles-pp.iomap_apply
      0.01 ±173%      +0.1        0.08 ± 15%  perf-profile.self.cycles-pp.copy_user_generic_unrolled
      0.11 ±  7%      +0.1        0.17 ±  8%  perf-profile.self.cycles-pp.__might_sleep
      0.00            +0.1        0.07 ± 12%  perf-profile.self.cycles-pp.__x64_sys_io_getevents
      0.00            +0.1        0.07 ± 12%  perf-profile.self.cycles-pp.__might_fault
      0.08 ± 15%      +0.1        0.15 ± 17%  perf-profile.self.cycles-pp.aio_prep_rw
      0.11 ±  7%      +0.1        0.18 ±  6%  perf-profile.self.cycles-pp.__get_user_4
      0.00            +0.1        0.07 ± 20%  perf-profile.self.cycles-pp.ext4_file_write_iter
      0.11 ±  7%      +0.1        0.19 ±  6%  perf-profile.self.cycles-pp.copy_user_enhanced_fast_string
      0.10 ±  4%      +0.1        0.18 ±  6%  perf-profile.self.cycles-pp.aio_read_events
      0.12 ±  7%      +0.1        0.20 ±  7%  perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.13 ± 14%      +0.1        0.22 ±  7%  perf-profile.self.cycles-pp.lookup_ioctx
      0.15 ±  8%      +0.1        0.26 ±  8%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.18 ± 11%      +0.1        0.29 ±  8%  perf-profile.self.cycles-pp.___might_sleep
      0.19 ±  6%      +0.2        0.35 ±  7%  perf-profile.self.cycles-pp.io_submit_one
      0.81 ± 12%      +0.6        1.41 ± 10%  perf-profile.self.cycles-pp.__srcu_read_unlock
      1.02 ± 11%      +0.7        1.73 ±  4%  perf-profile.self.cycles-pp.__copy_user_nocache
      1.08 ±  7%      +0.8        1.86 ±  8%  perf-profile.self.cycles-pp.__memcpy_mcsafe
      9.89 ±  9%      +1.6       11.54 ±  7%  perf-profile.self.cycles-pp._raw_read_lock
      6.29 ±  8%      +4.6       10.88 ±  7%  perf-profile.self.cycles-pp.jbd2_transaction_committed


                                                                                
                                  fio.read_bw_MBps                              
                                                                                
  10000 +-+-----------------------------------------------------------------+   
   9000 O-OO O   O  O    O    O O OO                                        |   
        |      O  O   O    OO        O O  O  O O O OO                       |   
   8000 +-+                             O   O                               |   
   7000 +-+                                                                 |   
        |                                                                   |   
   6000 +-+     .+ .+.++.+.++.+.+.++.+.++.+.++.+.+.++.+.++.+.++.+.+. +.+. +.|   
   5000 +-++.+.+  +                                                 +    +  |   
   4000 +-+                                                                 |   
        |                                                                   |   
   3000 +-+                                                                 |   
   2000 +-+                                                                 |   
        |                                                                   |   
   1000 +-+                                                                 |   
      0 +-+------------O----------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                     fio.read_iops                              
                                                                                
  2.5e+06 +-+---------------------------------------------------------------+   
          O OO O OO O OO   OO OO O OO O                                     |   
          |                             OO O OO O OO O                      |   
    2e+06 +-+                                                               |   
          |                                                                 |   
          |                                                                 |   
  1.5e+06 +-++.+. +. .++.+.++.++.+.++.+.++.+.++.+.++.+.++.+.++.++.+. +.+.++.|   
          |.+    +  +                                               +       |   
    1e+06 +-+                                                               |   
          |                                                                 |   
          |                                                                 |   
   500000 +-+                                                               |   
          |                                                                 |   
          |                                                                 |   
        0 +-+------------O--------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                fio.read_clat_mean_us                           
                                                                                
  600000 +-+----------------------------------------------------------------+   
         |.++.+  +.+.                                              + :  .+  |   
  500000 +-+         ++.+.++.+.++.+.++.+.+.++.+.++.+.++.+.++.+.++.+  +.+  +.|   
         |                                                                  |   
         |                                                                  |   
  400000 +-+                                                                |   
         |    O O  O  O    O O  O   OO O O OO O OO O O                      |   
  300000 O-OO    O   O    O    O  O                                         |   
         |                                                                  |   
  200000 +-+                                                                |   
         |                                                                  |   
         |                                                                  |   
  100000 +-+                                                                |   
         |                                                                  |   
       0 +-+------------O---------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                  fio.write_bw_MBps                             
                                                                                
  10000 +-+-----------------------------------------------------------------+   
   9000 O-OO O   O  O    O    O O OO                                        |   
        |      O  O   O    OO        O O  O  O O O OO                       |   
   8000 +-+                             O   O                               |   
   7000 +-+                                                                 |   
        |                                                                   |   
   6000 +-+     .+ .+.++.+.++.+.+.++.+.++.+.++.+.+.++.+.++.+.++.+.+. +.+. +.|   
   5000 +-++.+.+  +                                                 +    +  |   
   4000 +-+                                                                 |   
        |                                                                   |   
   3000 +-+                                                                 |   
   2000 +-+                                                                 |   
        |                                                                   |   
   1000 +-+                                                                 |   
      0 +-+------------O----------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                    fio.write_iops                              
                                                                                
  2.5e+06 +-+---------------------------------------------------------------+   
          O OO O OO O OO   OO OO O OO O                                     |   
          |                             OO O OO O OO O                      |   
    2e+06 +-+                                                               |   
          |                                                                 |   
          |                                                                 |   
  1.5e+06 +-++.+. +. .++.+.++.++.+.++.+.++.+.++.+.++.+.++.+.++.++.+. +.+.++.|   
          |.+    +  +                                               +       |   
    1e+06 +-+                                                               |   
          |                                                                 |   
          |                                                                 |   
   500000 +-+                                                               |   
          |                                                                 |   
          |                                                                 |   
        0 +-+------------O--------------------------------------------------+   
                                                                                
                                                                                                                                                                
                               fio.write_clat_mean_us                           
                                                                                
  600000 +-+----------------------------------------------------------------+   
         |.++.+  +  +                                              + :  .+  |   
  500000 +-+         ++.+.++.+.++.+.++.+.+.++.+.++.+.++.+.++.+.++.+  +.+  +.|   
         |                                                                  |   
         |                                                                  |   
  400000 +-+                                                                |   
         |    O O  O  O    O O  O   OO O O OO O OO O O                      |   
  300000 O-OO    O   O    O    O  O                                         |   
         |                                                                  |   
  200000 +-+                                                                |   
         |                                                                  |   
         |                                                                  |   
  100000 +-+                                                                |   
         |                                                                  |   
       0 +-+------------O---------------------------------------------------+   
                                                                                
                                                                                                                                                                
                               fio.write_slat_mean_us                           
                                                                                
  35000 +-+-----------------------------------------------------------------+   
        |                                                                   |   
  30000 +-+   .+. +.                                               .+   .+  |   
        |  +.+   +  +.++.+.++.+.+.++.+.++.+.++.+.+.++. .++.+.++.+.+  +.+  +.|   
  25000 +-+                                           +                     |   
        |                                                                   |   
  20000 +-+                                                                 |   
        |                               O O OO O    O                       |   
  15000 O-OO O O OO O O  O OO O O OO O O         O O                        |   
        |                                                                   |   
  10000 +-+                                                                 |   
        |                                                                   |   
   5000 +-+                                                                 |   
        |                                                                   |   
      0 +-+------------O----------------------------------------------------+   
                                                                                
                                                                                                                                                                
                                    fio.workload                                
                                                                                
  1e+09 +-+-----------------------------------------------------------------+   
  9e+08 O-OO O O OO O O  O OO O O OO O                                      |   
        |                              OO O OO O O OO                       |   
  8e+08 +-+                                                                 |   
  7e+08 +-+                                                                 |   
        |                                                                   |   
  6e+08 +-++.+. .+ .+.++.+.++.+.+.++.+.++.+.++.+.+.++.+.++.+.++.+.+. +.+.++.|   
  5e+08 +-+    +  +                                                 +       |   
  4e+08 +-+                                                                 |   
        |                                                                   |   
  3e+08 +-+                                                                 |   
  2e+08 +-+                                                                 |   
        |                                                                   |   
  1e+08 +-+                                                                 |   
      0 +-+------------O----------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample

***************************************************************************************************
lkp-csl-2sp6: 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 256G memory
=========================================================================================
bs/compiler/cpufreq_governor/disk/fs/ioengine/kconfig/mount_option/nr_task/rootfs/runtime/rw/tbox_group/test_size/testcase/time_based/ucode:
  4k/gcc-7/performance/2pmem/ext4/sync/x86_64-rhel-7.6/dax/50%/debian-x86_64-20191114.cgz/200s/randwrite/lkp-csl-2sp6/200G/fio-basic/tb/0x500002c

commit: 
  b1b4705d54 ("ext4: introduce direct I/O read using iomap infrastructure")
  569342dc24 ("ext4: move inode extension/truncate code out from ->iomap_end() callback")

b1b4705d54abedfd 569342dc2485392e95b6a626281 
---------------- --------------------------- 
       fail:runs  %reproduction    fail:runs
           |             |             |    
           :4           25%           1:4     dmesg.WARNING:at#for_ip_interrupt_entry/0x
         %stddev     %change         %stddev
             \          |                \  
      4.08 ± 25%      -3.9        0.15 ± 34%  fio.latency_100us%
      0.81 ± 45%     +15.6       16.45 ± 47%  fio.latency_10us%
     63.62 ±  3%     -27.6       36.07 ± 17%  fio.latency_50us%
    175.73           +50.8%     265.02 ±  2%  fio.time.user_time
 3.401e+08 ±  2%     +54.0%  5.237e+08        fio.workload
      6642 ±  2%     +54.0%      10229        fio.write_bw_MBps
     43392 ±  4%     -36.1%      27712 ±  5%  fio.write_clat_90%_us
     48384 ±  4%     -35.7%      31104 ±  5%  fio.write_clat_95%_us
     59136 ±  3%     -33.3%      39424 ±  5%  fio.write_clat_99%_us
     27760 ±  2%     -35.7%      17839        fio.write_clat_mean_us
     12867 ±  9%     -33.9%       8506 ± 18%  fio.write_clat_stddev
   1700412 ±  2%     +54.0%    2618693        fio.write_iops
      2011            +1.2%       2034        boot-time.idle
      0.94           +48.6%       1.39 ±  2%  iostat.cpu.user
      2972 ±  3%     +21.0%       3597        meminfo.Dirty
     66317 ±  2%     +24.3%      82442 ± 20%  softirqs.CPU48.TIMER
      0.00 ±  9%      -0.0        0.00 ± 24%  mpstat.cpu.all.iowait%
      0.94            +0.5        1.40 ±  2%  mpstat.cpu.all.usr%
     13981 ± 56%     -99.2%     109.75 ± 95%  numa-numastat.node0.other_node
      4771 ±167%    +292.0%      18700        numa-numastat.node1.other_node
    745.50 ±  3%     +21.1%     903.00        proc-vmstat.nr_dirty
    743.75 ±  3%     +21.5%     903.75        proc-vmstat.nr_zone_write_pending
    238649 ± 16%     +28.0%     305583 ±  7%  numa-meminfo.node0.Active
    208040 ± 24%     +37.6%     286262 ±  5%  numa-meminfo.node0.Active(anon)
    229500 ± 26%     -36.3%     146157 ±  7%  numa-meminfo.node1.Active(anon)
  19606483           -18.4%   16003967 ± 20%  turbostat.C1E
     49.52           -12.5       37.06 ± 30%  turbostat.C1E%
    124.19            +3.3%     128.34        turbostat.RAMWatt
   9.6e+09           -25.2%  7.183e+09 ± 30%  cpuidle.C1E.time
  19607054           -18.4%   16005122 ± 20%  cpuidle.C1E.usage
  42226648 ± 44%   +5712.0%  2.454e+09 ± 88%  cpuidle.C6.time
     40856 ± 44%   +8800.0%    3636237 ± 90%  cpuidle.C6.usage
     51953 ± 24%     +37.6%      71503 ±  5%  numa-vmstat.node0.nr_active_anon
     51953 ± 24%     +37.6%      71503 ±  5%  numa-vmstat.node0.nr_zone_active_anon
     14639 ± 52%     -95.4%     678.00 ± 85%  numa-vmstat.node0.numa_other
     57365 ± 26%     -36.3%      36564 ±  7%  numa-vmstat.node1.nr_active_anon
     57365 ± 26%     -36.3%      36564 ±  7%  numa-vmstat.node1.nr_zone_active_anon
    161195 ±  4%      +9.1%     175918        numa-vmstat.node1.numa_other
      1476 ±  3%     -14.7%       1259 ±  4%  slabinfo.ext4_extent_status.active_slabs
    150614 ±  3%     -14.7%     128502 ±  4%  slabinfo.ext4_extent_status.num_objs
      1476 ±  3%     -14.7%       1259 ±  4%  slabinfo.ext4_extent_status.num_slabs
      1096 ±  7%     -14.8%     934.25 ±  7%  slabinfo.khugepaged_mm_slot.active_objs
      1096 ±  7%     -14.8%     934.25 ±  7%  slabinfo.khugepaged_mm_slot.num_objs
    400.00 ± 13%     +28.2%     513.00 ±  8%  slabinfo.skbuff_fclone_cache.active_objs
    400.00 ± 13%     +28.2%     513.00 ±  8%  slabinfo.skbuff_fclone_cache.num_objs
      2785 ±  7%     +14.5%       3188 ±  7%  slabinfo.sock_inode_cache.active_objs
      2785 ±  7%     +14.5%       3188 ±  7%  slabinfo.sock_inode_cache.num_objs
 2.393e+09 ±  2%     +35.2%  3.235e+09        perf-stat.i.branch-instructions
      0.98            -0.0        0.93 ±  2%  perf-stat.i.branch-miss-rate%
  23967630 ±  2%     +28.1%   30704745 ±  3%  perf-stat.i.branch-misses
     10.10 ±  2%     -26.6%       7.41        perf-stat.i.cpi
 4.179e+09 ±  4%     +38.3%  5.779e+09 ±  8%  perf-stat.i.dTLB-loads
 2.663e+09           +36.1%  3.625e+09 ±  7%  perf-stat.i.dTLB-stores
     79.97            +3.0       82.93        perf-stat.i.iTLB-load-miss-rate%
  26249647           +31.2%   34434951 ±  4%  perf-stat.i.iTLB-load-misses
   6552634            +8.0%    7076093 ±  3%  perf-stat.i.iTLB-loads
 1.347e+10 ±  2%     +35.8%   1.83e+10        perf-stat.i.instructions
      0.10 ±  2%     +34.3%       0.14 ±  2%  perf-stat.i.ipc
      1.00            -0.1        0.95        perf-stat.overall.branch-miss-rate%
      9.99 ±  2%     -26.4%       7.35        perf-stat.overall.cpi
      0.00 ± 21%      -0.0        0.00 ± 26%  perf-stat.overall.dTLB-store-miss-rate%
     80.02            +2.9       82.94        perf-stat.overall.iTLB-load-miss-rate%
      0.10 ±  2%     +35.9%       0.14        perf-stat.overall.ipc
      7939           -11.9%       6995        perf-stat.overall.path-length
 2.381e+09 ±  2%     +35.2%  3.218e+09        perf-stat.ps.branch-instructions
  23847364 ±  2%     +28.1%   30544365 ±  3%  perf-stat.ps.branch-misses
 4.157e+09 ±  4%     +38.3%  5.749e+09 ±  8%  perf-stat.ps.dTLB-loads
 2.649e+09           +36.1%  3.606e+09 ±  7%  perf-stat.ps.dTLB-stores
  26117405           +31.2%   34259896 ±  4%  perf-stat.ps.iTLB-load-misses
   6519655            +8.0%    7039502 ±  3%  perf-stat.ps.iTLB-loads
 1.341e+10 ±  2%     +35.8%  1.821e+10        perf-stat.ps.instructions
   2.7e+12 ±  2%     +35.7%  3.664e+12        perf-stat.total.instructions
     57.00 ± 88%     -96.5%       2.00 ±173%  interrupts.55:PCI-MSI.31981588-edge.i40e-eth0-TxRx-19
      1760 ±  4%     +17.3%       2065 ±  4%  interrupts.CPU0.CAL:Function_call_interrupts
      5733           -24.1%       4353 ± 21%  interrupts.CPU0.NMI:Non-maskable_interrupts
      5733           -24.1%       4353 ± 21%  interrupts.CPU0.PMI:Performance_monitoring_interrupts
      1761 ±  5%     +26.4%       2226 ± 10%  interrupts.CPU1.CAL:Function_call_interrupts
     14.75 ± 49%   +2852.5%     435.50 ±111%  interrupts.CPU1.RES:Rescheduling_interrupts
      1734 ±  5%     +17.9%       2044 ± 13%  interrupts.CPU12.CAL:Function_call_interrupts
      1681 ±  6%     +23.1%       2068 ±  8%  interrupts.CPU15.CAL:Function_call_interrupts
      1757 ±  5%     +15.9%       2037 ±  2%  interrupts.CPU2.CAL:Function_call_interrupts
      1829 ±  9%     +19.8%       2191 ±  7%  interrupts.CPU31.CAL:Function_call_interrupts
      1757 ±  4%     +19.6%       2101 ±  9%  interrupts.CPU4.CAL:Function_call_interrupts
      1761 ±  4%     +17.6%       2070 ±  4%  interrupts.CPU5.CAL:Function_call_interrupts
      1753 ±  4%     +18.8%       2082 ±  3%  interrupts.CPU6.CAL:Function_call_interrupts
      6375 ± 25%     -39.4%       3861 ± 30%  interrupts.CPU60.NMI:Non-maskable_interrupts
      6375 ± 25%     -39.4%       3861 ± 30%  interrupts.CPU60.PMI:Performance_monitoring_interrupts
      6.50 ± 93%    +807.7%      59.00 ± 90%  interrupts.CPU65.RES:Rescheduling_interrupts
      6333 ± 25%     -31.0%       4370 ± 46%  interrupts.CPU66.NMI:Non-maskable_interrupts
      6333 ± 25%     -31.0%       4370 ± 46%  interrupts.CPU66.PMI:Performance_monitoring_interrupts
      1754 ±  4%     +16.8%       2050 ±  3%  interrupts.CPU7.CAL:Function_call_interrupts
      6364 ± 25%     -30.9%       4399 ± 46%  interrupts.CPU71.NMI:Non-maskable_interrupts
      6364 ± 25%     -30.9%       4399 ± 46%  interrupts.CPU71.PMI:Performance_monitoring_interrupts
     61.50 ± 77%     -68.3%      19.50 ±108%  interrupts.CPU77.RES:Rescheduling_interrupts
      1750 ±  4%     +17.6%       2058 ±  4%  interrupts.CPU8.CAL:Function_call_interrupts
      6785 ± 15%     -36.5%       4311 ± 23%  interrupts.CPU85.NMI:Non-maskable_interrupts
      6785 ± 15%     -36.5%       4311 ± 23%  interrupts.CPU85.PMI:Performance_monitoring_interrupts
      6262 ± 14%     -27.2%       4557 ± 15%  interrupts.CPU86.NMI:Non-maskable_interrupts
      6262 ± 14%     -27.2%       4557 ± 15%  interrupts.CPU86.PMI:Performance_monitoring_interrupts
      1752 ±  4%     +17.8%       2064 ±  4%  interrupts.CPU9.CAL:Function_call_interrupts
     21.45 ±  8%     -21.4        0.00        perf-profile.calltrace.cycles-pp.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter.new_sync_write
     14.50 ±  8%     -14.5        0.00        perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     14.42 ±  8%     -14.4        0.00        perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_iomap_end.iomap_apply.dax_iomap_rw
     45.47 ±  9%      -8.0       37.44 ± 16%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry
     45.61 ±  9%      -8.0       37.66 ± 16%  perf-profile.calltrace.cycles-pp.secondary_startup_64
     44.50 ±  9%      -7.6       36.90 ± 16%  perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary
     44.50 ±  9%      -7.6       36.91 ± 16%  perf-profile.calltrace.cycles-pp.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
     44.58 ±  9%      -7.6       37.00 ± 16%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
     44.58 ±  9%      -7.6       37.00 ± 16%  perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64
     44.58 ±  9%      -7.6       37.00 ± 16%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64
      4.87 ±  8%      -4.9        0.00        perf-profile.calltrace.cycles-pp.__ext4_journal_stop.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      4.74 ±  8%      -4.7        0.00        perf-profile.calltrace.cycles-pp.jbd2_journal_stop.__ext4_journal_stop.ext4_iomap_end.iomap_apply.dax_iomap_rw
      0.77 ±  7%      +0.5        1.31 ±  5%  perf-profile.calltrace.cycles-pp.__srcu_read_unlock.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      0.80 ± 12%      +0.9        1.70 ± 26%  perf-profile.calltrace.cycles-pp.__copy_user_nocache.__copy_user_flushcache._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply
      0.81 ± 12%      +0.9        1.71 ± 26%  perf-profile.calltrace.cycles-pp.__copy_user_flushcache._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply.dax_iomap_rw
      0.83 ± 12%      +0.9        1.74 ± 25%  perf-profile.calltrace.cycles-pp._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      1.66 ±  6%      +1.5        3.18 ± 16%  perf-profile.calltrace.cycles-pp.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter.new_sync_write
      2.29 ±  9%      +1.7        3.98 ± 13%  perf-profile.calltrace.cycles-pp._raw_read_lock.jbd2_transaction_committed.ext4_set_iomap.ext4_iomap_begin.iomap_apply
      2.00 ± 11%      +2.0        3.96 ± 12%  perf-profile.calltrace.cycles-pp.ext4_journal_check_start.__ext4_journal_start_sb.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      2.01 ± 11%      +2.0        4.03 ± 12%  perf-profile.calltrace.cycles-pp.__ext4_journal_start_sb.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      2.94 ±  9%      +2.5        5.40 ±  9%  perf-profile.calltrace.cycles-pp._raw_read_lock.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply
      3.23 ±  9%      +3.6        6.81 ± 14%  perf-profile.calltrace.cycles-pp.add_transaction_credits.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply
      5.56 ±  9%      +4.8       10.40 ± 14%  perf-profile.calltrace.cycles-pp.jbd2_transaction_committed.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      5.58 ±  9%      +5.0       10.54 ± 13%  perf-profile.calltrace.cycles-pp.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      7.53 ±  9%      +6.4       13.92 ±  6%  perf-profile.calltrace.cycles-pp.jbd2_journal_stop.__ext4_journal_stop.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      7.63 ±  9%      +6.5       14.15 ±  6%  perf-profile.calltrace.cycles-pp.__ext4_journal_stop.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     54.07 ±  8%      +7.8       61.91 ± 10%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
     54.10 ±  8%      +7.9       61.97 ± 10%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
     13.99 ±  8%     +13.3       27.25 ± 10%  perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply.dax_iomap_rw
     14.08 ±  8%     +13.3       27.40 ± 10%  perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     29.51 ±  8%     +27.1       56.59 ±  9%  perf-profile.calltrace.cycles-pp.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter.new_sync_write
     21.45 ±  8%     -21.4        0.00        perf-profile.children.cycles-pp.ext4_iomap_end
     45.47 ±  9%      -8.0       37.44 ± 16%  perf-profile.children.cycles-pp.intel_idle
     45.53 ±  9%      -8.0       37.55 ± 16%  perf-profile.children.cycles-pp.cpuidle_enter
     45.53 ±  9%      -8.0       37.55 ± 16%  perf-profile.children.cycles-pp.cpuidle_enter_state
     45.61 ±  9%      -8.0       37.66 ± 16%  perf-profile.children.cycles-pp.do_idle
     45.61 ±  9%      -8.0       37.66 ± 16%  perf-profile.children.cycles-pp.secondary_startup_64
     45.61 ±  9%      -8.0       37.66 ± 16%  perf-profile.children.cycles-pp.cpu_startup_entry
     44.58 ±  9%      -7.6       37.00 ± 16%  perf-profile.children.cycles-pp.start_secondary
      0.07 ±  5%      +0.0        0.09 ± 14%  perf-profile.children.cycles-pp.update_process_times
      0.07 ±  5%      +0.0        0.10 ± 15%  perf-profile.children.cycles-pp.tick_sched_handle
      0.09 ±  4%      +0.0        0.12 ± 15%  perf-profile.children.cycles-pp.tick_sched_timer
      0.06 ± 11%      +0.0        0.10 ±  8%  perf-profile.children.cycles-pp.selinux_file_permission
      0.05 ±  8%      +0.0        0.09 ± 11%  perf-profile.children.cycles-pp.__fdget_pos
      0.07 ± 12%      +0.0        0.11 ± 11%  perf-profile.children.cycles-pp.security_file_permission
      0.04 ± 58%      +0.1        0.10 ± 18%  perf-profile.children.cycles-pp.ext4_llseek
      0.04 ± 58%      +0.1        0.10 ±  9%  perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.09 ±  8%      +0.1        0.15 ±  8%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.01 ±173%      +0.1        0.07 ± 10%  perf-profile.children.cycles-pp.__fget_light
      0.06 ± 58%      +0.1        0.12 ± 36%  perf-profile.children.cycles-pp.ktime_get
      0.00            +0.1        0.08 ± 19%  perf-profile.children.cycles-pp.__sb_start_write
      0.10 ±  4%      +0.1        0.18 ± 11%  perf-profile.children.cycles-pp.ksys_lseek
      0.24 ± 13%      +0.1        0.34 ± 15%  perf-profile.children.cycles-pp.hrtimer_interrupt
      0.06 ±  6%      +0.1        0.16 ± 45%  perf-profile.children.cycles-pp.ext4_es_lookup_extent
      0.28 ± 11%      +0.1        0.39 ± 23%  perf-profile.children.cycles-pp.smp_apic_timer_interrupt
      0.31 ±  9%      +0.1        0.44 ± 21%  perf-profile.children.cycles-pp.apic_timer_interrupt
      0.11 ± 17%      +0.2        0.29 ± 52%  perf-profile.children.cycles-pp.ext4_map_blocks
      0.77 ±  7%      +0.5        1.31 ±  5%  perf-profile.children.cycles-pp.__srcu_read_unlock
      0.80 ± 12%      +0.9        1.71 ± 26%  perf-profile.children.cycles-pp.__copy_user_nocache
      0.81 ± 12%      +0.9        1.71 ± 26%  perf-profile.children.cycles-pp.__copy_user_flushcache
      0.83 ± 12%      +0.9        1.74 ± 25%  perf-profile.children.cycles-pp._copy_from_iter_flushcache
      1.66 ±  6%      +1.5        3.19 ± 16%  perf-profile.children.cycles-pp.dax_iomap_actor
     12.68 ±  8%      +1.6       14.32 ±  6%  perf-profile.children.cycles-pp.__ext4_journal_stop
     12.46 ±  8%      +1.6       14.10 ±  6%  perf-profile.children.cycles-pp.jbd2_journal_stop
      5.56 ±  9%      +4.9       10.41 ± 14%  perf-profile.children.cycles-pp.jbd2_transaction_committed
      5.58 ±  9%      +5.0       10.54 ± 13%  perf-profile.children.cycles-pp.ext4_set_iomap
     54.18 ±  8%      +7.8       62.02 ± 10%  perf-profile.children.cycles-pp.do_syscall_64
     54.21 ±  8%      +7.9       62.07 ± 10%  perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     29.51 ±  8%     +27.1       56.60 ±  9%  perf-profile.children.cycles-pp.ext4_iomap_begin
     45.47 ±  9%      -8.0       37.44 ± 16%  perf-profile.self.cycles-pp.intel_idle
      0.01 ±173%      +0.1        0.06 ± 17%  perf-profile.self.cycles-pp.___might_sleep
      0.00            +0.1        0.05 ±  8%  perf-profile.self.cycles-pp.do_syscall_64
      0.04 ± 58%      +0.1        0.10 ±  9%  perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.09 ±  8%      +0.1        0.15 ±  8%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.00            +0.1        0.06 ± 14%  perf-profile.self.cycles-pp.selinux_file_permission
      0.01 ±173%      +0.1        0.07 ± 10%  perf-profile.self.cycles-pp.__fget_light
      0.05 ± 58%      +0.1        0.11 ± 33%  perf-profile.self.cycles-pp.ktime_get
      0.00            +0.1        0.06 ± 20%  perf-profile.self.cycles-pp.ext4_es_lookup_extent
      0.03 ±100%      +0.1        0.09 ± 16%  perf-profile.self.cycles-pp.iomap_apply
      0.03 ±100%      +0.1        0.10 ± 18%  perf-profile.self.cycles-pp.ext4_llseek
      0.00            +0.1        0.07 ± 15%  perf-profile.self.cycles-pp.ext4_iomap_begin
      0.02 ±173%      +0.1        0.14 ± 56%  perf-profile.self.cycles-pp.ext4_file_write_iter
      0.76 ±  7%      +0.5        1.30 ±  6%  perf-profile.self.cycles-pp.__srcu_read_unlock
      0.80 ± 12%      +0.9        1.69 ± 26%  perf-profile.self.cycles-pp.__copy_user_nocache
      3.25 ±  9%      +3.1        6.40 ± 14%  perf-profile.self.cycles-pp.jbd2_transaction_committed



***************************************************************************************************
lkp-csl-2sp6: 96 threads Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 256G memory
=========================================================================================
bs/compiler/cpufreq_governor/disk/fs/ioengine/kconfig/mount_option/nr_task/rootfs/runtime/rw/tbox_group/test_size/testcase/time_based/ucode:
  4k/gcc-7/performance/2pmem/ext4/libaio/x86_64-rhel-7.6/dax/50%/debian-x86_64-20191114.cgz/200s/randwrite/lkp-csl-2sp6/200G/fio-basic/tb/0x500002c

commit: 
  b1b4705d54 ("ext4: introduce direct I/O read using iomap infrastructure")
  569342dc24 ("ext4: move inode extension/truncate code out from ->iomap_end() callback")

b1b4705d54abedfd 569342dc2485392e95b6a626281 
---------------- --------------------------- 
       fail:runs  %reproduction    fail:runs
           |             |             |    
          1:4          -25%            :4     dmesg.WARNING:at_ip__x64_sys_io_submit/0x
           :4           25%           1:4     dmesg.WARNING:at_ip_aio_write/0x
         %stddev     %change         %stddev
             \          |                \  
     22.50 ± 85%     -16.5        6.00 ±150%  fio.latency_1000us%
      0.01            +3.3        3.29 ±158%  fio.latency_250us%
     43.58 ± 27%     -43.5        0.04 ±104%  fio.latency_2ms%
     22.38 ± 16%     +40.0       62.34 ± 24%  fio.latency_750us%
      9422            -1.2%       9304        fio.time.system_time
    185.76 ±  3%     +63.1%     303.05 ±  2%  fio.time.user_time
 3.191e+08           +68.8%  5.384e+08        fio.workload
      6231           +68.8%      10516        fio.write_bw_MBps
   1327104 ±  7%     -46.3%     712704 ± 10%  fio.write_clat_90%_us
   1380352 ±  6%     -46.6%     737280 ± 10%  fio.write_clat_95%_us
   1503232 ±  7%     -47.4%     790528 ± 10%  fio.write_clat_99%_us
    933218           -40.7%     553062        fio.write_clat_mean_us
    310780 ± 29%     -57.8%     131049 ± 33%  fio.write_clat_stddev
   1595317           +68.8%    2692113        fio.write_iops
     29413           -41.6%      17164        fio.write_slat_mean_us
     14463 ± 30%     -52.9%       6814 ± 13%  fio.write_slat_stddev
   7159296 ± 12%     -11.5%    6334976        meminfo.DirectMap2M
     21181            -1.0%      20962        proc-vmstat.nr_slab_reclaimable
     48.55            -1.4%      47.88        iostat.cpu.system
      0.99 ±  3%     +60.9%       1.59 ±  2%  iostat.cpu.user
      0.00 ± 17%      -0.0        0.00 ± 24%  mpstat.cpu.all.iowait%
      0.99 ±  3%      +0.6        1.60 ±  2%  mpstat.cpu.all.usr%
    383524 ± 37%     -42.5%     220431 ± 57%  numa-numastat.node1.local_node
    392957 ± 35%     -40.3%     234425 ± 50%  numa-numastat.node1.numa_hit
    248.01            +2.5%     254.11        turbostat.PkgWatt
    124.91            +3.3%     129.03        turbostat.RAMWatt
     40962 ± 10%     -16.3%      34274 ±  3%  numa-vmstat.node0.nr_anon_pages
     26868 ± 16%     +25.4%      33684 ±  3%  numa-vmstat.node1.nr_anon_pages
      8600 ± 12%     +17.7%      10124 ± 10%  numa-vmstat.node1.nr_slab_reclaimable
    163884 ± 10%     -16.3%     137230 ±  3%  numa-meminfo.node0.AnonPages
    138488 ± 14%     +22.8%     170029        numa-meminfo.node1.Active
     67242 ± 22%     +25.5%      84358 ±  6%  numa-meminfo.node1.AnonHugePages
    107464 ± 16%     +25.2%     134574 ±  3%  numa-meminfo.node1.AnonPages
     34404 ± 12%     +17.7%      40494 ± 10%  numa-meminfo.node1.KReclaimable
     34404 ± 12%     +17.7%      40494 ± 10%  numa-meminfo.node1.SReclaimable
     70842 ±  6%      -7.7%      65369 ±  3%  softirqs.CPU3.TIMER
      8500 ±105%    +128.3%      19405 ± 45%  softirqs.CPU35.SCHED
      8786 ±103%    +123.1%      19604 ± 46%  softirqs.CPU40.SCHED
      8981 ± 98%    +116.0%      19402 ± 44%  softirqs.CPU42.SCHED
     68254 ±  5%      +6.0%      72342 ±  4%  softirqs.CPU44.TIMER
     65323 ±  3%     +10.5%      72152 ±  2%  softirqs.CPU58.TIMER
     19561 ± 49%     -71.6%       5557 ± 91%  softirqs.CPU82.SCHED
      1744 ±  6%     +23.5%       2154 ±  3%  slabinfo.UNIX.active_objs
      1744 ±  6%     +23.5%       2154 ±  3%  slabinfo.UNIX.num_objs
    111978 ±  4%     -17.4%      92440 ±  4%  slabinfo.ext4_extent_status.active_objs
      1416 ±  4%     -22.0%       1104 ±  4%  slabinfo.ext4_extent_status.active_slabs
    144495 ±  4%     -22.0%     112722 ±  4%  slabinfo.ext4_extent_status.num_objs
      1416 ±  4%     -22.0%       1104 ±  4%  slabinfo.ext4_extent_status.num_slabs
      2737 ±  4%     +20.5%       3298 ±  4%  slabinfo.sock_inode_cache.active_objs
      2737 ±  4%     +20.5%       3298 ±  4%  slabinfo.sock_inode_cache.num_objs
    535573 ±  2%      +8.1%     579164 ±  3%  sched_debug.cfs_rq:/.load.avg
    516.79 ±  2%      +7.8%     557.21 ±  3%  sched_debug.cfs_rq:/.runnable_load_avg.avg
    535510 ±  2%      +8.1%     579108 ±  3%  sched_debug.cfs_rq:/.runnable_weight.avg
      0.01 ± 24%     -50.0%       0.00 ± 24%  sched_debug.cpu.nr_uninterruptible.avg
     14089 ±  4%     -21.0%      11124 ± 12%  sched_debug.cpu.sched_count.max
      2552 ±  9%     -13.5%       2208 ± 12%  sched_debug.cpu.sched_count.stddev
      6997 ±  5%     -21.1%       5521 ± 12%  sched_debug.cpu.sched_goidle.max
      1261 ±  8%     -13.7%       1089 ± 11%  sched_debug.cpu.sched_goidle.stddev
      5749 ±  5%     -23.7%       4386 ± 13%  sched_debug.cpu.ttwu_local.max
     23.50 ± 66%    +293.6%      92.50 ± 39%  interrupts.CPU12.TLB:TLB_shootdowns
      4.75 ± 63%   +3384.2%     165.50 ±162%  interrupts.CPU13.RES:Rescheduling_interrupts
      2382 ± 14%     -24.1%       1809 ± 10%  interrupts.CPU32.CAL:Function_call_interrupts
      2351 ± 11%     -19.1%       1901 ±  9%  interrupts.CPU36.CAL:Function_call_interrupts
      2259 ± 11%     -16.2%       1894 ±  8%  interrupts.CPU38.CAL:Function_call_interrupts
      2428 ± 12%     -17.6%       2001 ± 12%  interrupts.CPU60.CAL:Function_call_interrupts
      5774 ± 25%     -48.5%       2973 ± 56%  interrupts.CPU7.NMI:Non-maskable_interrupts
      5774 ± 25%     -48.5%       2973 ± 56%  interrupts.CPU7.PMI:Performance_monitoring_interrupts
     80.50 ± 38%     +60.2%     129.00 ± 20%  interrupts.CPU73.TLB:TLB_shootdowns
     54.50 ± 84%     -83.9%       8.75 ±100%  interrupts.CPU83.RES:Rescheduling_interrupts
     91.75 ±100%     -86.4%      12.50 ±122%  interrupts.CPU89.RES:Rescheduling_interrupts
     18.16 ± 13%     -36.0%      11.62 ±  5%  perf-stat.i.MPKI
 2.855e+09           +51.8%  4.333e+09        perf-stat.i.branch-instructions
      0.80            -0.1        0.73        perf-stat.i.branch-miss-rate%
  23116045           +37.2%   31705911        perf-stat.i.branch-misses
      8.56           -34.6%       5.59        perf-stat.i.cpi
 4.579e+09           +66.3%  7.613e+09 ± 10%  perf-stat.i.dTLB-loads
 3.192e+09 ±  2%     +45.8%  4.654e+09 ±  8%  perf-stat.i.dTLB-stores
     81.32            +2.0       83.28        perf-stat.i.iTLB-load-miss-rate%
  28448462 ±  4%     +18.0%   33581121 ±  3%  perf-stat.i.iTLB-load-misses
 1.588e+10           +52.2%  2.417e+10        perf-stat.i.instructions
    573.58 ±  3%     +27.9%     733.82 ±  4%  perf-stat.i.instructions-per-iTLB-miss
      0.12           +50.6%       0.18        perf-stat.i.ipc
  14520205           -10.4%   13007330 ± 13%  perf-stat.i.node-store-misses
     17811 ±  3%     +21.2%      21596 ±  3%  perf-stat.i.node-stores
     18.03 ± 13%     -35.8%      11.57 ±  5%  perf-stat.overall.MPKI
      0.81            -0.1        0.73        perf-stat.overall.branch-miss-rate%
      8.48           -34.4%       5.56        perf-stat.overall.cpi
     81.36            +2.0       83.34        perf-stat.overall.iTLB-load-miss-rate%
    559.06 ±  3%     +29.0%     721.05 ±  4%  perf-stat.overall.instructions-per-iTLB-miss
      0.12           +52.4%       0.18        perf-stat.overall.ipc
      9977            -9.8%       8996        perf-stat.overall.path-length
  2.84e+09           +51.8%  4.311e+09        perf-stat.ps.branch-instructions
  23003415           +37.1%   31544372        perf-stat.ps.branch-misses
 4.556e+09           +66.2%  7.574e+09 ± 10%  perf-stat.ps.dTLB-loads
 3.176e+09 ±  2%     +45.8%   4.63e+09 ±  8%  perf-stat.ps.dTLB-stores
  28304001 ±  4%     +18.0%   33410816 ±  3%  perf-stat.ps.iTLB-load-misses
  1.58e+10           +52.2%  2.405e+10        perf-stat.ps.instructions
  14446128           -10.4%   12941633 ± 13%  perf-stat.ps.node-store-misses
     17748 ±  3%     +21.0%      21481 ±  3%  perf-stat.ps.node-stores
 3.184e+12           +52.1%  4.844e+12        perf-stat.total.instructions
     24.52 ±  9%     -24.5        0.00        perf-profile.calltrace.cycles-pp.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter.aio_write
     17.04 ± 10%     -17.0        0.00        perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     16.95 ± 10%     -16.9        0.00        perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_iomap_end.iomap_apply.dax_iomap_rw
      5.24 ±  7%      -5.2        0.00        perf-profile.calltrace.cycles-pp.__ext4_journal_stop.ext4_iomap_end.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      5.09 ±  7%      -5.1        0.00        perf-profile.calltrace.cycles-pp.jbd2_journal_stop.__ext4_journal_stop.ext4_iomap_end.iomap_apply.dax_iomap_rw
      0.00            +0.6        0.58 ±  8%  perf-profile.calltrace.cycles-pp.do_io_getevents.__x64_sys_io_getevents.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +0.6        0.62 ±  8%  perf-profile.calltrace.cycles-pp.__x64_sys_io_getevents.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.97 ±  5%      +0.6        1.60 ± 22%  perf-profile.calltrace.cycles-pp.__srcu_read_unlock.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      1.13 ± 12%      +1.0        2.08 ± 12%  perf-profile.calltrace.cycles-pp.__copy_user_nocache.__copy_user_flushcache._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply
      1.14 ± 12%      +1.0        2.09 ± 12%  perf-profile.calltrace.cycles-pp.__copy_user_flushcache._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply.dax_iomap_rw
      1.16 ± 12%      +1.0        2.12 ± 12%  perf-profile.calltrace.cycles-pp._copy_from_iter_flushcache.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      2.20 ±  8%      +1.7        3.86 ±  7%  perf-profile.calltrace.cycles-pp.dax_iomap_actor.iomap_apply.dax_iomap_rw.ext4_file_write_iter.aio_write
      2.18 ± 12%      +1.7        3.90 ±  7%  perf-profile.calltrace.cycles-pp._raw_read_lock.jbd2_transaction_committed.ext4_set_iomap.ext4_iomap_begin.iomap_apply
      2.17 ± 12%      +1.7        3.89 ±  7%  perf-profile.calltrace.cycles-pp.ext4_journal_check_start.__ext4_journal_start_sb.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      2.19 ± 12%      +1.7        3.93 ±  7%  perf-profile.calltrace.cycles-pp.__ext4_journal_start_sb.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      3.46 ± 10%      +1.9        5.32 ± 11%  perf-profile.calltrace.cycles-pp._raw_read_lock.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply
      4.08 ± 10%      +2.6        6.69 ±  7%  perf-profile.calltrace.cycles-pp.add_transaction_credits.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply
      9.21 ±  9%      +4.5       13.74 ± 12%  perf-profile.calltrace.cycles-pp.jbd2_journal_stop.__ext4_journal_stop.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      5.56 ± 12%      +4.6       10.18 ±  7%  perf-profile.calltrace.cycles-pp.jbd2_transaction_committed.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw
      5.59 ± 12%      +4.7       10.25 ±  8%  perf-profile.calltrace.cycles-pp.ext4_set_iomap.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
      9.32 ±  9%      +4.7       14.00 ± 12%  perf-profile.calltrace.cycles-pp.__ext4_journal_stop.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     16.80 ± 10%     +10.3       27.15 ± 10%  perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_iomap_begin.iomap_apply.dax_iomap_rw
     16.89 ± 10%     +10.4       27.30 ± 10%  perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter
     34.20 ± 10%     +21.7       55.87 ± 10%  perf-profile.calltrace.cycles-pp.ext4_iomap_begin.iomap_apply.dax_iomap_rw.ext4_file_write_iter.aio_write
     24.52 ±  9%     -24.5        0.00        perf-profile.children.cycles-pp.ext4_iomap_end
     34.47 ± 10%      -6.7       27.80 ± 10%  perf-profile.children.cycles-pp.jbd2__journal_start
     34.30 ± 10%      -6.7       27.65 ± 10%  perf-profile.children.cycles-pp.start_this_handle
      8.29 ± 11%      -1.5        6.81 ±  7%  perf-profile.children.cycles-pp.add_transaction_credits
      0.05 ±  8%      +0.0        0.07 ± 12%  perf-profile.children.cycles-pp._cond_resched
      0.07 ±  7%      +0.0        0.10 ± 13%  perf-profile.children.cycles-pp.selinux_file_permission
      0.08 ± 10%      +0.0        0.12 ±  7%  perf-profile.children.cycles-pp.__might_sleep
      0.07 ± 14%      +0.0        0.11 ± 11%  perf-profile.children.cycles-pp.security_file_permission
      0.03 ±100%      +0.0        0.07 ± 12%  perf-profile.children.cycles-pp.__fget
      0.06 ± 14%      +0.0        0.10 ±  7%  perf-profile.children.cycles-pp._copy_to_user
      0.07 ±  7%      +0.0        0.11 ±  6%  perf-profile.children.cycles-pp.copy_user_enhanced_fast_string
      0.06 ± 13%      +0.0        0.11 ±  7%  perf-profile.children.cycles-pp.__get_user_4
      0.04 ± 57%      +0.0        0.09 ±  8%  perf-profile.children.cycles-pp.aio_prep_rw
      0.07 ±  7%      +0.0        0.11 ±  9%  perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.03 ±100%      +0.1        0.08 ± 14%  perf-profile.children.cycles-pp.__sb_start_write
      0.00            +0.1        0.05 ±  9%  perf-profile.children.cycles-pp.put_reqs_available
      0.00            +0.1        0.05 ±  9%  perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
      0.04 ± 58%      +0.1        0.10 ± 18%  perf-profile.children.cycles-pp.ext4_meta_trans_blocks
      0.01 ±173%      +0.1        0.07 ± 12%  perf-profile.children.cycles-pp.__check_object_size
      0.07 ± 11%      +0.1        0.13 ±  9%  perf-profile.children.cycles-pp._copy_from_user
      0.09 ± 13%      +0.1        0.14 ± 13%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.07 ± 17%      +0.1        0.12 ± 13%  perf-profile.children.cycles-pp.ext4_es_lookup_extent
      0.12 ± 11%      +0.1        0.18 ±  7%  perf-profile.children.cycles-pp.___might_sleep
      0.00            +0.1        0.06 ± 13%  perf-profile.children.cycles-pp.mutex_lock
      0.10 ± 11%      +0.1        0.17 ±  9%  perf-profile.children.cycles-pp.__might_fault
      0.11 ± 17%      +0.1        0.21 ± 17%  perf-profile.children.cycles-pp.ext4_map_blocks
      0.18 ± 13%      +0.1        0.30 ±  9%  perf-profile.children.cycles-pp.lookup_ioctx
      0.22 ±  9%      +0.2        0.38 ±  8%  perf-profile.children.cycles-pp.aio_read_events
      0.25 ±  9%      +0.2        0.42 ±  8%  perf-profile.children.cycles-pp.read_events
      0.35 ±  9%      +0.2        0.58 ±  8%  perf-profile.children.cycles-pp.do_io_getevents
      0.37 ± 10%      +0.3        0.63 ±  8%  perf-profile.children.cycles-pp.__x64_sys_io_getevents
      0.97 ±  5%      +0.6        1.60 ± 22%  perf-profile.children.cycles-pp.__srcu_read_unlock
      1.14 ± 12%      +1.0        2.09 ± 12%  perf-profile.children.cycles-pp.__copy_user_flushcache
      1.13 ± 12%      +1.0        2.08 ± 12%  perf-profile.children.cycles-pp.__copy_user_nocache
      1.16 ± 12%      +1.0        2.12 ± 12%  perf-profile.children.cycles-pp._copy_from_iter_flushcache
      2.20 ±  8%      +1.7        3.87 ±  7%  perf-profile.children.cycles-pp.dax_iomap_actor
      5.56 ± 12%      +4.6       10.19 ±  7%  perf-profile.children.cycles-pp.jbd2_transaction_committed
      5.59 ± 12%      +4.7       10.26 ±  8%  perf-profile.children.cycles-pp.ext4_set_iomap
     34.20 ± 10%     +21.7       55.87 ± 10%  perf-profile.children.cycles-pp.ext4_iomap_begin
      8.26 ± 10%      -1.5        6.79 ±  7%  perf-profile.self.cycles-pp.add_transaction_credits
      0.08 ±  6%      +0.0        0.11 ±  4%  perf-profile.self.cycles-pp.__might_sleep
      0.05            +0.0        0.09 ±  9%  perf-profile.self.cycles-pp.iomap_apply
      0.06 ± 13%      +0.0        0.10 ± 10%  perf-profile.self.cycles-pp.__get_user_4
      0.03 ±100%      +0.0        0.07 ± 12%  perf-profile.self.cycles-pp.__fget
      0.07 ±  7%      +0.0        0.11 ±  6%  perf-profile.self.cycles-pp.copy_user_enhanced_fast_string
      0.03 ±100%      +0.0        0.07 ± 14%  perf-profile.self.cycles-pp.__x64_sys_io_submit
      0.05 ±  9%      +0.0        0.10 ±  7%  perf-profile.self.cycles-pp.aio_read_events
      0.08 ± 13%      +0.0        0.13 ± 12%  perf-profile.self.cycles-pp.lookup_ioctx
      0.04 ± 57%      +0.0        0.09 ±  8%  perf-profile.self.cycles-pp.aio_prep_rw
      0.07 ±  7%      +0.0        0.11 ±  9%  perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.00            +0.1        0.05 ±  9%  perf-profile.self.cycles-pp.aio_write
      0.00            +0.1        0.05 ±  9%  perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.00            +0.1        0.05 ±  9%  perf-profile.self.cycles-pp.put_reqs_available
      0.09 ± 13%      +0.1        0.14 ± 13%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.12 ± 11%      +0.1        0.18 ±  8%  perf-profile.self.cycles-pp.___might_sleep
      0.01 ±173%      +0.1        0.07 ± 14%  perf-profile.self.cycles-pp.ext4_iomap_begin
      0.11 ± 10%      +0.1        0.18 ±  8%  perf-profile.self.cycles-pp.io_submit_one
      0.03 ±102%      +0.1        0.11 ± 17%  perf-profile.self.cycles-pp.ext4_file_write_iter
      0.97 ±  5%      +0.6        1.59 ± 22%  perf-profile.self.cycles-pp.__srcu_read_unlock
      1.13 ± 12%      +0.9        2.07 ± 12%  perf-profile.self.cycles-pp.__copy_user_nocache
      3.37 ± 12%      +2.9        6.25 ±  8%  perf-profile.self.cycles-pp.jbd2_transaction_committed



***************************************************************************************************
lkp-hsw-d01: 8 threads Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz with 8G memory
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/rootfs/tbox_group/test/testcase/ucode:
  gcc-7/performance/1HDD/ext4/x86_64-rhel-7.6/debian-x86_64-2019-11-14.cgz/lkp-hsw-d01/fivestreamreaddirect.f/filebench/0x27

commit: 
  b1b4705d54 ("ext4: introduce direct I/O read using iomap infrastructure")
  569342dc24 ("ext4: move inode extension/truncate code out from ->iomap_end() callback")

b1b4705d54abedfd 569342dc2485392e95b6a626281 
---------------- --------------------------- 
       fail:runs  %reproduction    fail:runs
           |             |             |    
          1:4          -25%            :2     dmesg.WARNING:at#for_ip_swapgs_restore_regs_and_return_to_usermode/0x
          1:4          -25%            :2     dmesg.WARNING:stack_recursion
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:da
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:db
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:dc
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:dd
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:de
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:df
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:ea
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:eb
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:ec
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:ed
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:ee
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:ef
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:fa
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:fb
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:fc
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:fd
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:fe
          1:4          -25%            :2     kmsg.addr:#vm_flags:#anon_vma:#mapping:#index:ff
          1:4          -25%            :2     kmsg.file:libnss_files-#so_fault:filemap_fault_mmap:generic_file_mmap_readpage:simple_readpage
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a07fffffa0ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a17fffffa1ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a27fffffa2ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a37fffffa3ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a47fffffa4ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a57fffffa5ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a67fffffa6ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a77fffffa7ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a7fffff8aff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a7fffff9aff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a87fffffa8ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#a97fffffa9ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#aa7fffffaaff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ab7fffffabff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ac7fffffacff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ad7fffffadff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ae7fffffaeff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#af7fffffafff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b07fffffb0ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b17fffffb1ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b27fffffb2ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b37fffffb3ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b47fffffb4ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b57fffffb5ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b67fffffb6ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b77fffffb7ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b7fffff8bff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b7fffff9bff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b87fffffb8ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#b97fffffb9ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ba7fffffbaff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#bb7fffffbbff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#bc7fffffbcff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#bd7fffffbdff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#be7fffffbeff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#bf7fffffbfff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c07fffffc0ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c17fffffc1ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c27fffffc2ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c37fffffc3ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c47fffffc4ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c57fffffc5ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c67fffffc6ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c77fffffc7ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c7fffff8cff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c7fffff9cff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c87fffffc8ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#c97fffffc9ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ca7fffffcaff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#cb7fffffcbff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#cc7fffffccff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#cd7fffffcdff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ce7fffffceff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#cf7fffffcfff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d07fffffd0ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d17fffffd1ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d27fffffd2ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d37fffffd3ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d47fffffd4ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d57fffffd5ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d67fffffd6ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d77fffffd7ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d7fffff8dff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d7fffff9dff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d87fffffd8ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#d97fffffd9ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#da7fffffdaff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#db7fffffdbff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#dc7fffffdcff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#dd7fffffddff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#de7fffffdeff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#df7fffffdfff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e07fffffe0ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e17fffffe1ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e27fffffe2ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e37fffffe3ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e47fffffe4ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e57fffffe5ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e67fffffe6ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e77fffffe7ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e7fffff8eff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e7fffff9eff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e87fffffe8ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#e97fffffe9ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ea7fffffeaff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#eb7fffffebff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ec7fffffecff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ed7fffffedff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ee7fffffeeff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ef7fffffefff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f07ffffff0ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f17ffffff1ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f27ffffff2ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f37ffffff3ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f47ffffff4ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f57ffffff5ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f67ffffff6ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f77ffffff7ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f7fffff8fff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f7fffff9fff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f87ffffff8ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#f97ffffff9ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fa7ffffffaff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fb7ffffffbff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fc7ffffffcff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fd7ffffffdff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fe7ffffffeff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#ff7fffffffff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff80ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff81ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff82ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff83ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff84ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff85ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff86ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff87ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff88ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff89ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff90ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff91ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff92ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff93ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff94ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff95ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff96ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff97ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff98ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffff99ff
          1:4          -25%            :2     kmsg.swap_info_get:Bad_swap_file_entry#fffffffffff
         %stddev     %change         %stddev
             \          |                \  
    219770 ± 63%     +70.4%     374538 ±  3%  cpuidle.C7s.usage
    219674 ± 63%     +70.5%     374616 ±  3%  turbostat.C7s
     49.50 ±  3%      +9.1%      54.00        vmstat.cpu.id
     47.75 ±  2%      -7.9%      44.00 ±  2%  vmstat.cpu.wa
     50.18 ±  3%      +8.5%      54.43        iostat.cpu.idle
     48.35 ±  2%      -7.1%      44.91 ±  2%  iostat.cpu.iowait
      0.87 ±137%     -79.7%       0.18        iostat.cpu.user
      6679 ± 14%     -29.6%       4701 ±  4%  slabinfo.anon_vma_chain.active_objs
      7047 ± 12%     -23.5%       5387 ±  2%  slabinfo.anon_vma_chain.num_objs
      2794 ±  4%      +9.1%       3049 ±  2%  slabinfo.filp.active_objs
    332426 ± 16%     +21.2%     402873 ±  3%  interrupts.CPU0.LOC:Local_timer_interrupts
    169.50 ± 15%    +134.8%     398.00 ± 45%  interrupts.CPU0.NMI:Non-maskable_interrupts
    169.50 ± 15%    +134.8%     398.00 ± 45%  interrupts.CPU0.PMI:Performance_monitoring_interrupts
    204.75 ± 21%     -48.7%     105.00        interrupts.CPU0.RES:Rescheduling_interrupts
    307421 ±  3%     +13.9%     350098 ±  2%  interrupts.CPU2.LOC:Local_timer_interrupts
    158.25 ± 54%    +137.6%     376.00 ± 61%  interrupts.CPU4.RES:Rescheduling_interrupts
    786.00 ± 30%     +26.4%     993.50 ±  2%  interrupts.CPU5.CAL:Function_call_interrupts
      1588 ± 14%     +30.7%       2076        interrupts.RES:Rescheduling_interrupts
    331.32 ± 13%     -27.3%     240.78        sched_debug.cfs_rq:/.exec_clock.min
      4972 ±  5%     -13.2%       4317 ±  3%  sched_debug.cfs_rq:/.min_vruntime.min
      0.45 ± 17%     -51.7%       0.22 ± 42%  sched_debug.cfs_rq:/.nr_spread_over.avg
      2.00           -50.0%       1.00        sched_debug.cfs_rq:/.nr_spread_over.max
      0.68 ±  4%     -44.4%       0.38 ± 12%  sched_debug.cfs_rq:/.nr_spread_over.stddev
    564589 ±  4%      +4.8%     591417 ±  3%  sched_debug.cpu.avg_idle.avg
     13.12 ± 19%     +21.9%      16.00 ±  9%  sched_debug.cpu.nr_uninterruptible.max
    133870 ±  6%     -16.3%     112088 ± 10%  sched_debug.cpu.ttwu_count.max
     15029 ±  8%     -33.4%      10005 ±  7%  sched_debug.cpu.ttwu_count.min
    132827 ±  6%     -16.0%     111524 ± 10%  sched_debug.cpu.ttwu_local.max
     10470 ±  9%     -27.5%       7589 ± 14%  sched_debug.cpu.ttwu_local.min
     10.17 ±  2%      +0.3       10.48        perf-stat.i.cache-miss-rate%
    558901 ±  2%      +6.4%     594670 ±  3%  perf-stat.i.cache-misses
   5172535            +1.8%    5265269        perf-stat.i.cache-references
      2.40            +3.8%       2.49 ±  2%  perf-stat.i.cpi
     14.90 ±  6%      +9.7%      16.35 ±  3%  perf-stat.i.cpu-migrations
 1.115e+08 ±  2%      +3.3%  1.152e+08        perf-stat.i.dTLB-loads
     62749 ±  3%      +8.0%      67738 ±  4%  perf-stat.i.dTLB-store-misses
     10.81 ±  2%      +0.5       11.29 ±  3%  perf-stat.overall.cache-miss-rate%
      1.82 ±  2%      +4.2%       1.89        perf-stat.overall.cpi
      0.07 ±  3%      +0.0        0.08 ±  3%  perf-stat.overall.dTLB-store-miss-rate%
      0.55 ±  2%      -4.1%       0.53        perf-stat.overall.ipc
    553610 ±  2%      +6.4%     589043 ±  3%  perf-stat.ps.cache-misses
   5123552            +1.8%    5215383        perf-stat.ps.cache-references
     14.76 ±  6%      +9.7%      16.20 ±  3%  perf-stat.ps.cpu-migrations
 1.104e+08 ±  2%      +3.3%  1.141e+08        perf-stat.ps.dTLB-loads
     62154 ±  3%      +8.0%      67096 ±  4%  perf-stat.ps.dTLB-store-misses
      0.68 ±104%      +0.8        1.53 ± 26%  perf-profile.calltrace.cycles-pp.free_unref_page_list.release_pages.__pagevec_release.truncate_inode_pages_range.ext4_evict_inode
      0.12 ± 17%      -0.1        0.04 ±100%  perf-profile.children.cycles-pp.__delete_from_page_cache
      0.07 ± 14%      +0.0        0.11 ± 18%  perf-profile.children.cycles-pp.irq_work_needs_cpu
      0.01 ±173%      +0.1        0.07 ±  7%  perf-profile.children.cycles-pp.kmem_cache_alloc
      0.01 ±173%      +0.1        0.07 ±  7%  perf-profile.children.cycles-pp.__unwind_start
      0.00            +0.1        0.07 ±  7%  perf-profile.children.cycles-pp.truncate_exceptional_pvec_entries
      0.00            +0.1        0.07 ±  7%  perf-profile.children.cycles-pp.rcu_core_si
      0.04 ±106%      +0.1        0.11 ± 18%  perf-profile.children.cycles-pp.drm_fb_helper_dirty
      0.04 ±102%      +0.1        0.11 ± 18%  perf-profile.children.cycles-pp.ext4_invalidatepage
      0.04 ±102%      +0.1        0.11 ± 18%  perf-profile.children.cycles-pp.xas_find
      0.02 ±173%      +0.1        0.10 ± 36%  perf-profile.children.cycles-pp.__wake_up_common
      0.02 ±173%      +0.1        0.10 ± 30%  perf-profile.children.cycles-pp.intel_pmu_disable_all
      0.00            +0.1        0.08 ± 12%  perf-profile.children.cycles-pp.timerqueue_iterate_next
      0.04 ±110%      +0.1        0.13        perf-profile.children.cycles-pp.free_unref_page_prepare
      0.04 ±102%      +0.1        0.13        perf-profile.children.cycles-pp.xas_start
      0.00            +0.1        0.10 ± 36%  perf-profile.children.cycles-pp.autoremove_wake_function
      0.17 ± 51%      +0.1        0.28 ± 20%  perf-profile.children.cycles-pp.__blk_mq_delay_run_hw_queue
      0.11 ± 44%      +0.1        0.21 ±  4%  perf-profile.children.cycles-pp.ext4_io_submit
      0.03 ±173%      +0.1        0.15 ± 37%  perf-profile.children.cycles-pp.update_rt_rq_load_avg
      0.22 ± 45%      +0.1        0.34 ±  2%  perf-profile.children.cycles-pp.rb_next
      0.11 ± 81%      +0.1        0.23 ± 17%  perf-profile.children.cycles-pp.cpuidle_reflect
      0.32 ± 11%      +0.1        0.46        perf-profile.children.cycles-pp.__x86_indirect_thunk_rax
      0.46 ± 25%      +0.2        0.62 ± 15%  perf-profile.children.cycles-pp._raw_spin_lock
      0.14 ± 93%      +0.2        0.31 ± 48%  perf-profile.children.cycles-pp.workingset_update_node
      0.44 ± 23%      +0.2        0.61 ± 18%  perf-profile.children.cycles-pp.sched_clock
      0.09 ± 59%      +0.2        0.26 ±  3%  perf-profile.children.cycles-pp.__mod_zone_page_state
      0.20 ± 19%      +0.2        0.37 ± 45%  perf-profile.children.cycles-pp.pm_qos_read_value
      0.41 ± 24%      +0.2        0.58 ± 24%  perf-profile.children.cycles-pp.native_sched_clock
      0.15 ± 53%      +0.2        0.34 ± 35%  perf-profile.children.cycles-pp.unaccount_page_cache_page
      0.20 ± 42%      +0.2        0.42 ± 20%  perf-profile.children.cycles-pp.nr_iowait_cpu
      0.25 ± 55%      +0.3        0.52 ± 22%  perf-profile.children.cycles-pp.update_ts_time_stats
      0.56 ± 16%      +0.3        0.83 ± 20%  perf-profile.children.cycles-pp.sched_clock_cpu
      0.95 ± 58%      +0.7        1.61 ± 27%  perf-profile.children.cycles-pp.free_unref_page_list
      0.16 ± 23%      +0.0        0.21 ±  4%  perf-profile.self.cycles-pp.menu_reflect
      0.05 ± 60%      +0.1        0.10 ± 30%  perf-profile.self.cycles-pp.leave_mm
      0.04 ±110%      +0.1        0.10 ± 30%  perf-profile.self.cycles-pp.free_unref_page_prepare
      0.01 ±173%      +0.1        0.08 ± 12%  perf-profile.self.cycles-pp.jbd2_journal_try_to_free_buffers
      0.00            +0.1        0.07 ±  7%  perf-profile.self.cycles-pp.timerqueue_iterate_next
      0.00            +0.1        0.07 ±  7%  perf-profile.self.cycles-pp.truncate_exceptional_pvec_entries
      0.00            +0.1        0.07 ±  7%  perf-profile.self.cycles-pp.rcu_core_si
      0.04 ±102%      +0.1        0.11 ± 18%  perf-profile.self.cycles-pp.ext4_invalidatepage
      0.15 ± 24%      +0.1        0.24 ± 10%  perf-profile.self.cycles-pp.get_next_timer_interrupt
      0.09 ± 67%      +0.1        0.20 ± 35%  perf-profile.self.cycles-pp.try_to_free_buffers
      0.00            +0.1        0.11 ± 18%  perf-profile.self.cycles-pp.xas_start
      0.03 ±173%      +0.1        0.15 ± 37%  perf-profile.self.cycles-pp.update_rt_rq_load_avg
      0.11 ± 81%      +0.1        0.23 ± 17%  perf-profile.self.cycles-pp.cpuidle_reflect
      0.26 ± 20%      +0.1        0.39 ± 14%  perf-profile.self.cycles-pp.__x86_indirect_thunk_rax
      0.39 ± 22%      +0.1        0.52 ± 15%  perf-profile.self.cycles-pp.native_sched_clock
      0.10 ±109%      +0.1        0.23 ± 17%  perf-profile.self.cycles-pp.update_ts_time_stats
      0.10 ± 57%      +0.1        0.24 ± 10%  perf-profile.self.cycles-pp.sched_clock_cpu
      0.07 ± 66%      +0.2        0.22 ± 25%  perf-profile.self.cycles-pp.free_unref_page_commit
      0.46 ± 25%      +0.2        0.62 ± 15%  perf-profile.self.cycles-pp._raw_spin_lock
      0.03 ±173%      +0.2        0.20 ± 35%  perf-profile.self.cycles-pp.unaccount_page_cache_page
      0.09 ± 59%      +0.2        0.26 ±  3%  perf-profile.self.cycles-pp.__mod_zone_page_state
      0.20 ± 19%      +0.2        0.37 ± 45%  perf-profile.self.cycles-pp.pm_qos_read_value
      0.18 ± 40%      +0.2        0.37 ± 27%  perf-profile.self.cycles-pp.timerqueue_del
      0.38 ± 35%      +0.2        0.58 ± 13%  perf-profile.self.cycles-pp.__hrtimer_run_queues
      0.19 ± 36%      +0.2        0.42 ± 20%  perf-profile.self.cycles-pp.nr_iowait_cpu
      0.00            +0.2        0.23 ± 43%  perf-profile.self.cycles-pp.free_unref_page_list





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Rong Chen


View attachment "config-5.4.0-rc3-00048-g569342dc24853" of type "text/plain" (200550 bytes)

View attachment "job-script" of type "text/plain" (8266 bytes)

View attachment "job.yaml" of type "text/plain" (5796 bytes)

View attachment "reproduce" of type "text/plain" (900 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ