lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <CAL3q7H4T+AjDsVScdWs6FJd2CNjbVPwfbZyrQphWew10wbBnZQ@mail.gmail.com>
Date:   Mon, 23 Aug 2021 10:51:42 +0100
From:   Filipe Manana <fdmanana@...il.com>
To:     kernel test robot <oliver.sang@...el.com>
Cc:     Filipe Manana <fdmanana@...e.com>, David Sterba <dsterba@...e.com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        kbuild test robot <lkp@...el.com>,
        Huang Ying <ying.huang@...el.com>, feng.tang@...el.com,
        zhengjun.xing@...ux.intel.com,
        linux-btrfs <linux-btrfs@...r.kernel.org>
Subject: Re: [btrfs] ecc64fab7d: stress-ng.link.ops_per_sec -81.7% regression

On Sun, Aug 8, 2021 at 6:36 PM Filipe Manana <fdmanana@...il.com> wrote:
>
> On Sun, Aug 8, 2021 at 2:30 PM kernel test robot <oliver.sang@...el.com> wrote:
> >
> >
> >
> > Greeting,
> >
> > FYI, we noticed a -81.7% regression of stress-ng.link.ops_per_sec due to commit:
> >
> >
> > commit: ecc64fab7d49c678e70bd4c35fe64d2ab3e3d212 ("btrfs: fix lost inode on log replay after mix of fsync, rename and inode eviction")
> > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
>
> The following patch, which was part of the same patchset, but it's not
> yet in Linus' tree, helps on restoring most of the performance:
>
> https://lore.kernel.org/linux-btrfs/307aaa44d39ad115e299bfe7d1f7e3eb4e991374.1627379796.git.fdmanana@suse.com/

Ok, so yesterday's report for that patch confirms that it more than
restores the performance drop from the bug fix:

https://lore.kernel.org/linux-btrfs/20210822150411.GA29963@xsang-OptiPlex-9020/

Thanks.


>
> There will be two more to reduce logging work during link and rename
> operations, but I'll only send them out after coming back from
> vacations.
>
> Thanks.
>
> >
> >
> > in testcase: stress-ng
> > on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory
> > with following parameters:
> >
> >         nr_threads: 10%
> >         disk: 1HDD
> >         testtime: 60s
> >         fs: btrfs
> >         class: filesystem
> >         test: link
> >         cpufreq_governor: performance
> >         ucode: 0x5003006
> >
> >
> >
> >
> > If you fix the issue, kindly add following tag
> > Reported-by: kernel test robot <oliver.sang@...el.com>
> >
> >
> > Details are as below:
> > -------------------------------------------------------------------------------------------------->
> >
> >
> > To reproduce:
> >
> >         git clone https://github.com/intel/lkp-tests.git
> >         cd lkp-tests
> >         bin/lkp install                job.yaml  # job file is attached in this email
> >         bin/lkp split-job --compatible job.yaml  # generate the yaml file for lkp run
> >         bin/lkp run                    generated-yaml-file
> >
> > =========================================================================================
> > class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode:
> >   filesystem/gcc-9/performance/1HDD/btrfs/x86_64-rhel-8.3/10%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp7/link/stress-ng/60s/0x5003006
> >
> > commit:
> >   240246f6b9 ("btrfs: mark compressed range uptodate only if all bio succeed")
> >   ecc64fab7d ("btrfs: fix lost inode on log replay after mix of fsync, rename and inode eviction")
> >
> > 240246f6b913b0c2 ecc64fab7d49c678e70bd4c35fe
> > ---------------- ---------------------------
> >          %stddev     %change         %stddev
> >              \          |                \
> >     227.57 ą  2%     -81.9%      41.14 ą 10%  stress-ng.link.ops
> >       3.75 ą  2%     -81.7%       0.69 ą 10%  stress-ng.link.ops_per_sec
> >     949188 ą  2%   +4871.8%   47192178 ą  6%  stress-ng.time.file_system_outputs
> >    8737302 ą  3%     +18.5%   10352864 ą  5%  stress-ng.time.voluntary_context_switches
> >       6.96            -4.9%       6.62        iostat.cpu.system
> >       4659 ą 27%     -73.6%       1228 ą 73%  numa-meminfo.node0.Dirty
> >       3435 ą 39%     -69.5%       1048 ą 95%  numa-meminfo.node1.Dirty
> >      11008 ą  7%     -10.6%       9835 ą  6%  softirqs.CPU4.SCHED
> >      11489 ą  5%     -14.1%       9871 ą 10%  softirqs.CPU49.SCHED
> >       4721 ą  5%     -87.1%     610.86 ą 23%  vmstat.io.bo
> >    2817765           +21.3%    3416713        vmstat.memory.cache
> >       2663            +3.5%       2757        turbostat.Bzy_MHz
> >    8407573 ą  4%     +15.4%    9703225 ą  6%  turbostat.C1
> >     158.73            +4.0%     165.10        turbostat.PkgWatt
> >       9517 ą  6%     -88.6%       1088 ą 19%  interrupts.315:PCI-MSI.376832-edge.ahci[0000:00:17.0]
> >       5890 ą 49%     -68.2%       1875 ą 68%  interrupts.CPU21.CAL:Function_call_interrupts
> >     471.14 ą 57%    +124.1%       1056 ą 41%  interrupts.CPU75.NMI:Non-maskable_interrupts
> >     471.14 ą 57%    +124.1%       1056 ą 41%  interrupts.CPU75.PMI:Performance_monitoring_interrupts
> >       0.29 ą  5%      -0.2        0.04 ą 20%  mpstat.cpu.all.iowait%
> >       0.90 ą  8%      -0.2        0.74 ą  3%  mpstat.cpu.all.irq%
> >       0.07 ą  5%      -0.0        0.06 ą  2%  mpstat.cpu.all.soft%
> >       0.18 ą  2%      -0.0        0.14 ą  2%  mpstat.cpu.all.usr%
> >     292111 ą  2%    +207.8%     898999        meminfo.Active
> >     285313 ą  2%    +212.7%     892049        meminfo.Active(file)
> >    2689046           +22.4%    3291841        meminfo.Cached
> >       8096 ą  2%     -72.1%       2259 ą  6%  meminfo.Dirty
> >       7347 ą  5%     -46.4%       3938 ą 40%  meminfo.Inactive(file)
> >    4474242           +13.7%    5086026        meminfo.Memused
> >    4500587           +14.5%    5152537        meminfo.max_used_kB
> >      47198 ą 34%   +3147.3%    1532689 ą 78%  numa-vmstat.node0.nr_dirtied
> >       1163 ą 27%     -73.6%     306.86 ą 73%  numa-vmstat.node0.nr_dirty
> >      13644 ą 34%     -82.6%       2373 ą 58%  numa-vmstat.node0.nr_written
> >       1192 ą 29%     -74.2%     307.71 ą 74%  numa-vmstat.node0.nr_zone_write_pending
> >     858.29 ą 39%     -69.6%     260.71 ą 96%  numa-vmstat.node1.nr_dirty
> >       8345 ą 58%     -81.8%       1514 ą101%  numa-vmstat.node1.nr_written
> >     870.29 ą 38%     -69.8%     262.71 ą 95%  numa-vmstat.node1.nr_zone_write_pending
> >      71331 ą  2%    +212.7%     223017        proc-vmstat.nr_active_file
> >     159852 ą  2%   +3593.0%    5903385 ą  6%  proc-vmstat.nr_dirtied
> >       2023 ą  2%     -72.0%     566.00 ą  7%  proc-vmstat.nr_dirty
> >     672525           +22.4%     823223        proc-vmstat.nr_file_pages
> >       1836 ą  5%     -46.5%     983.00 ą 40%  proc-vmstat.nr_inactive_file
> >      43756 ą  5%     -87.1%       5658 ą 24%  proc-vmstat.nr_written
> >      71331 ą  2%    +212.7%     223017        proc-vmstat.nr_zone_active_file
> >       1836 ą  5%     -46.5%     983.00 ą 40%  proc-vmstat.nr_zone_inactive_file
> >       2055 ą  3%     -72.3%     570.00 ą  6%  proc-vmstat.nr_zone_write_pending
> >     617463           +17.3%     724326        proc-vmstat.numa_hit
> >     530828           +20.1%     637670        proc-vmstat.numa_local
> >      49518 ą  2%     +47.9%      73219 ą 16%  proc-vmstat.pgactivate
> >     648782           +15.4%     748934        proc-vmstat.pgalloc_normal
> >     358760            -3.4%     346576        proc-vmstat.pgfree
> >     308332 ą  6%     -87.5%      38626 ą 24%  proc-vmstat.pgpgout
> >  2.142e+09           -15.7%  1.806e+09 ą  2%  perf-stat.i.branch-instructions
> >   18232988 ą  3%     -17.5%   15041745 ą  3%  perf-stat.i.branch-misses
> >       2.02 ą  3%     +14.5%       2.31        perf-stat.i.cpi
> >  2.129e+10            -4.8%  2.026e+10        perf-stat.i.cpu-cycles
> >       0.02 ą 36%      -0.0        0.00 ą 34%  perf-stat.i.dTLB-load-miss-rate%
> >     385230 ą 32%     -72.5%     105867 ą 28%  perf-stat.i.dTLB-load-misses
> >  2.742e+09           -17.8%  2.253e+09        perf-stat.i.dTLB-loads
> >   1.11e+09 ą  2%     -23.2%  8.532e+08 ą  2%  perf-stat.i.dTLB-stores
> >      32.05            -3.7       28.32        perf-stat.i.iTLB-load-miss-rate%
> >    3835461 ą  2%     -26.6%    2816654 ą  3%  perf-stat.i.iTLB-load-misses
> >    8289183 ą  3%     -12.0%    7292203 ą  3%  perf-stat.i.iTLB-loads
> >  1.069e+10           -16.6%  8.923e+09        perf-stat.i.instructions
> >       2913 ą  2%     +13.8%       3316 ą  5%  perf-stat.i.instructions-per-iTLB-miss
> >       0.52 ą  3%     -12.9%       0.45        perf-stat.i.ipc
> >       0.22            -4.8%       0.21        perf-stat.i.metric.GHz
> >      62.64           -18.1%      51.27        perf-stat.i.metric.M/sec
> >      92.36           -23.1       69.23 ą  6%  perf-stat.i.node-store-miss-rate%
> >      98872 ą  6%    +764.1%     854357 ą 10%  perf-stat.i.node-stores
> >      22849           -71.6%       6482 ą  6%  perf-stat.i.page-faults
> >       1.99 ą  2%     +14.0%       2.27        perf-stat.overall.cpi
> >       0.01 ą 32%      -0.0        0.00 ą 27%  perf-stat.overall.dTLB-load-miss-rate%
> >      31.64            -3.8       27.87        perf-stat.overall.iTLB-load-miss-rate%
> >       2789 ą  2%     +13.7%       3172 ą  4%  perf-stat.overall.instructions-per-iTLB-miss
> >       0.50 ą  2%     -12.3%       0.44        perf-stat.overall.ipc
> >      95.08           -26.8       68.28 ą  6%  perf-stat.overall.node-store-miss-rate%
> >  2.108e+09           -15.7%  1.777e+09 ą  2%  perf-stat.ps.branch-instructions
> >   17956331 ą  3%     -17.5%   14821238 ą  3%  perf-stat.ps.branch-misses
> >  2.095e+10            -4.8%  1.994e+10        perf-stat.ps.cpu-cycles
> >     379273 ą 32%     -72.5%     104229 ą 28%  perf-stat.ps.dTLB-load-misses
> >  2.698e+09           -17.8%  2.218e+09        perf-stat.ps.dTLB-loads
> >  1.093e+09 ą  2%     -23.1%  8.398e+08 ą  2%  perf-stat.ps.dTLB-stores
> >    3774726 ą  2%     -26.6%    2772418 ą  3%  perf-stat.ps.iTLB-load-misses
> >    8157708 ą  3%     -12.0%    7176458 ą  3%  perf-stat.ps.iTLB-loads
> >  1.052e+10           -16.6%  8.783e+09        perf-stat.ps.instructions
> >      97299 ą  6%    +763.6%     840325 ą 10%  perf-stat.ps.node-stores
> >      22491           -71.6%       6381 ą  6%  perf-stat.ps.page-faults
> >  6.702e+11           -17.1%  5.556e+11        perf-stat.total.instructions
> >       5395 ą  2%      -9.6%       4876 ą  3%  slabinfo.Acpi-State.active_objs
> >       5395 ą  2%      -9.6%       4876 ą  3%  slabinfo.Acpi-State.num_objs
> >     485.57 ą  5%     -55.8%     214.86 ą  8%  slabinfo.biovec-128.active_objs
> >     485.57 ą  5%     -55.8%     214.86 ą  8%  slabinfo.biovec-128.num_objs
> >     516.43 ą 12%     -23.6%     394.43 ą  9%  slabinfo.biovec-64.active_objs
> >     516.43 ą 12%     -23.6%     394.43 ą  9%  slabinfo.biovec-64.num_objs
> >     392.71 ą  8%     -49.5%     198.43 ą 12%  slabinfo.biovec-max.active_objs
> >     407.86 ą  7%     -49.3%     206.71 ą 11%  slabinfo.biovec-max.num_objs
> >       7325 ą  3%     -55.1%       3287 ą  6%  slabinfo.btrfs_delayed_tree_ref.active_objs
> >     187.43 ą  3%     -55.3%      83.86 ą  6%  slabinfo.btrfs_delayed_tree_ref.active_slabs
> >       7325 ą  3%     -55.1%       3287 ą  6%  slabinfo.btrfs_delayed_tree_ref.num_objs
> >     187.43 ą  3%     -55.3%      83.86 ą  6%  slabinfo.btrfs_delayed_tree_ref.num_slabs
> >       8208           -51.2%       4004 ą  8%  slabinfo.btrfs_extent_map.active_objs
> >       8208           -51.2%       4004 ą  8%  slabinfo.btrfs_extent_map.num_objs
> >     903.14 ą  6%     -77.0%     207.29 ą 33%  slabinfo.btrfs_ordered_extent.active_objs
> >     903.14 ą  6%     -77.0%     207.29 ą 33%  slabinfo.btrfs_ordered_extent.num_objs
> >       9693 ą  3%     -17.3%       8017 ą  7%  slabinfo.fsnotify_mark_connector.active_objs
> >       9693 ą  3%     -17.3%       8017 ą  7%  slabinfo.fsnotify_mark_connector.num_objs
> >      13523 ą  6%     -37.0%       8525 ą  6%  slabinfo.kmalloc-128.active_objs
> >     425.29 ą  6%     -36.6%     269.57 ą  6%  slabinfo.kmalloc-128.active_slabs
> >      13629 ą  6%     -36.6%       8642 ą  6%  slabinfo.kmalloc-128.num_objs
> >     425.29 ą  6%     -36.6%     269.57 ą  6%  slabinfo.kmalloc-128.num_slabs
> >      17162 ą  6%     -33.9%      11351 ą  5%  slabinfo.kmalloc-192.active_objs
> >     411.71 ą  6%     -34.0%     271.71 ą  5%  slabinfo.kmalloc-192.active_slabs
> >      17307 ą  6%     -34.0%      11430 ą  5%  slabinfo.kmalloc-192.num_objs
> >     411.71 ą  6%     -34.0%     271.71 ą  5%  slabinfo.kmalloc-192.num_slabs
> >      23093 ą  9%     -64.0%       8304 ą  6%  slabinfo.kmalloc-rcl-96.active_objs
> >     549.43 ą  9%     -64.1%     197.29 ą  6%  slabinfo.kmalloc-rcl-96.active_slabs
> >      23095 ą  9%     -64.0%       8304 ą  6%  slabinfo.kmalloc-rcl-96.num_objs
> >     549.43 ą  9%     -64.1%     197.29 ą  6%  slabinfo.kmalloc-rcl-96.num_slabs
> >       2224 ą  4%     -31.9%       1514 ą  7%  slabinfo.mnt_cache.active_objs
> >       2224 ą  4%     -31.9%       1514 ą  7%  slabinfo.mnt_cache.num_objs
> >       8352 ą  2%     +79.5%      14992 ą  6%  slabinfo.pool_workqueue.active_objs
> >     261.71 ą  2%     +79.3%     469.14 ą  6%  slabinfo.pool_workqueue.active_slabs
> >       8381 ą  2%     +79.2%      15022 ą  6%  slabinfo.pool_workqueue.num_objs
> >     261.71 ą  2%     +79.3%     469.14 ą  6%  slabinfo.pool_workqueue.num_slabs
> >      29474           +13.4%      33417 ą  2%  slabinfo.radix_tree_node.active_objs
> >      29474           +13.4%      33429 ą  2%  slabinfo.radix_tree_node.num_objs
> >     612.57 ą  9%     -35.3%     396.43 ą 11%  slabinfo.skbuff_fclone_cache.active_objs
> >     612.57 ą  9%     -35.3%     396.43 ą 11%  slabinfo.skbuff_fclone_cache.num_objs
> >      30.52 ą  3%     -28.4        2.16 ą158%  perf-profile.calltrace.cycles-pp.do_unlinkat.do_syscall_64.entry_SYSCALL_64_after_hwframe
> >      30.35 ą  3%     -28.2        2.15 ą158%  perf-profile.calltrace.cycles-pp.vfs_unlink.do_unlinkat.do_syscall_64.entry_SYSCALL_64_after_hwframe
> >      30.28 ą  3%     -28.1        2.15 ą158%  perf-profile.calltrace.cycles-pp.btrfs_unlink.vfs_unlink.do_unlinkat.do_syscall_64.entry_SYSCALL_64_after_hwframe
> >      29.86 ą  3%     -27.7        2.11 ą158%  perf-profile.calltrace.cycles-pp.btrfs_unlink_inode.btrfs_unlink.vfs_unlink.do_unlinkat.do_syscall_64
> >      29.86 ą  3%     -27.7        2.11 ą158%  perf-profile.calltrace.cycles-pp.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_unlink.vfs_unlink.do_unlinkat
> >      19.65 ą  4%     -18.3        1.36 ą158%  perf-profile.calltrace.cycles-pp.btrfs_add_link.btrfs_link.vfs_link.do_linkat.__x64_sys_link
> >      18.68 ą  3%     -17.4        1.26 ą158%  perf-profile.calltrace.cycles-pp.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_unlink.vfs_unlink
> >      17.74 ą  3%     -16.5        1.20 ą158%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_unlink
> >      15.15 ą  3%     -14.1        1.03 ą158%  perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_unlink_inode
> >      15.10 ą  3%     -14.1        1.02 ą158%  perf-profile.calltrace.cycles-pp.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode
> >      14.92 ą  3%     -13.9        1.01 ą158%  perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref
> >      12.51 ą  4%     -11.6        0.88 ą158%  perf-profile.calltrace.cycles-pp.btrfs_insert_inode_ref.btrfs_add_link.btrfs_link.vfs_link.do_linkat
> >      10.73 ą  4%     -10.0        0.76 ą158%  perf-profile.calltrace.cycles-pp.btrfs_insert_empty_items.btrfs_insert_inode_ref.btrfs_add_link.btrfs_link.vfs_link
> >      10.23 ą  4%      -9.5        0.72 ą158%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_insert_empty_items.btrfs_insert_inode_ref.btrfs_add_link.btrfs_link
> >       9.38 ą  4%      -8.8        0.62 ą158%  perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_unlink.vfs_unlink
> >       9.34 ą  4%      -8.7        0.62 ą158%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_unlink
> >       7.85 ą  4%      -7.3        0.52 ą158%  perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode
> >       7.82 ą  4%      -7.3        0.51 ą158%  perf-profile.calltrace.cycles-pp.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode
> >       7.72 ą  4%      -7.2        0.51 ą158%  perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item
> >       7.65 ą  4%      -7.1        0.52 ą158%  perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_insert_empty_items.btrfs_insert_inode_ref.btrfs_add_link
> >       7.62 ą  4%      -7.1        0.52 ą158%  perf-profile.calltrace.cycles-pp.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_insert_empty_items.btrfs_insert_inode_ref
> >       6.93 ą  4%      -6.5        0.46 ą158%  perf-profile.calltrace.cycles-pp.btrfs_insert_dir_item.btrfs_add_link.btrfs_link.vfs_link.do_linkat
> >       6.16 ą  4%      -5.8        0.41 ą158%  perf-profile.calltrace.cycles-pp.insert_with_overflow.btrfs_insert_dir_item.btrfs_add_link.btrfs_link.vfs_link
> >       6.15 ą  4%      -5.7        0.41 ą159%  perf-profile.calltrace.cycles-pp.btrfs_insert_empty_items.insert_with_overflow.btrfs_insert_dir_item.btrfs_add_link.btrfs_link
> >       5.67 ą  4%      -5.3        0.38 ą159%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_insert_empty_items.insert_with_overflow.btrfs_insert_dir_item.btrfs_add_link
> >      28.95 ą  3%      -2.8       26.20 ą  6%  perf-profile.calltrace.cycles-pp.osq_lock.rwsem_down_write_slowpath.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot
> >       0.00            +0.6        0.58 ą  8%  perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_insert_empty_items.copy_items.btrfs_log_inode
> >       0.00            +0.6        0.61 ą 11%  perf-profile.calltrace.cycles-pp.btrfs_search_forward.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name
> >       0.00            +0.7        0.69 ą 13%  perf-profile.calltrace.cycles-pp.setup_items_for_insert.btrfs_insert_empty_items.copy_items.btrfs_log_inode.log_new_dir_dentries
> >       0.00            +1.0        1.00 ą 17%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name
> >       0.00            +1.2        1.21 ą 15%  perf-profile.calltrace.cycles-pp.btrfs_search_forward.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name.btrfs_link
> >       0.00            +1.6        1.60 ą 18%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.overwrite_item.log_dir_items.log_directory_changes.btrfs_log_inode
> >       0.00            +2.0        2.01 ą 18%  perf-profile.calltrace.cycles-pp.overwrite_item.log_dir_items.log_directory_changes.btrfs_log_inode.btrfs_log_inode_parent
> >       0.00            +2.1        2.14 ą 18%  perf-profile.calltrace.cycles-pp.log_dir_items.log_directory_changes.btrfs_log_inode.btrfs_log_inode_parent.btrfs_log_new_name
> >       0.00            +2.1        2.14 ą 18%  perf-profile.calltrace.cycles-pp.log_directory_changes.btrfs_log_inode.btrfs_log_inode_parent.btrfs_log_new_name.btrfs_link
> >       0.00            +2.4        2.43 ą 17%  perf-profile.calltrace.cycles-pp.btrfs_log_inode.btrfs_log_inode_parent.btrfs_log_new_name.btrfs_link.vfs_link
> >       0.00            +2.7        2.66 ą 44%  perf-profile.calltrace.cycles-pp.split_leaf.btrfs_search_slot.btrfs_insert_empty_items.copy_items.btrfs_log_inode
> >       0.00            +2.9        2.85 ą 75%  perf-profile.calltrace.cycles-pp.btrfs_del_leaf.btrfs_del_items.drop_objectid_items.btrfs_log_inode.log_new_dir_dentries
> >       0.00            +3.8        3.83 ą 58%  perf-profile.calltrace.cycles-pp.btrfs_del_items.drop_objectid_items.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent
> >      11.90 ą  5%      +4.3       16.20 ą  6%  perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_insert_empty_items
> >       0.00           +15.6       15.57 ą 13%  perf-profile.calltrace.cycles-pp.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_insert_empty_items.copy_items
> >       0.00           +15.6       15.62 ą 13%  perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_insert_empty_items.copy_items.btrfs_log_inode
> >       0.00           +15.7       15.73 ą 14%  perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.drop_objectid_items
> >       0.00           +15.9       15.87 ą 14%  perf-profile.calltrace.cycles-pp.__btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.drop_objectid_items.btrfs_log_inode
> >       0.00           +15.9       15.93 ą 14%  perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.drop_objectid_items.btrfs_log_inode.log_new_dir_dentries
> >       0.00           +17.3       17.26 ą 14%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.drop_objectid_items.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent
> >       0.00           +20.1       20.15 ą 15%  perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_insert_empty_items.copy_items.btrfs_log_inode.log_new_dir_dentries
> >       0.00           +20.9       20.86 ą 14%  perf-profile.calltrace.cycles-pp.btrfs_insert_empty_items.copy_items.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent
> >       0.00           +21.7       21.73 ą 16%  perf-profile.calltrace.cycles-pp.drop_objectid_items.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name
> >       0.00           +21.8       21.77 ą 14%  perf-profile.calltrace.cycles-pp.copy_items.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name
> >      21.17 ą  4%     +29.7       50.85 ą  8%  perf-profile.calltrace.cycles-pp.__x64_sys_link.do_syscall_64.entry_SYSCALL_64_after_hwframe
> >      21.17 ą  4%     +29.7       50.85 ą  8%  perf-profile.calltrace.cycles-pp.do_linkat.__x64_sys_link.do_syscall_64.entry_SYSCALL_64_after_hwframe
> >      20.22 ą  4%     +30.6       50.78 ą  8%  perf-profile.calltrace.cycles-pp.vfs_link.do_linkat.__x64_sys_link.do_syscall_64.entry_SYSCALL_64_after_hwframe
> >      20.17 ą  4%     +30.6       50.78 ą  8%  perf-profile.calltrace.cycles-pp.btrfs_link.vfs_link.do_linkat.__x64_sys_link.do_syscall_64
> >       0.00           +45.5       45.51 ą 14%  perf-profile.calltrace.cycles-pp.btrfs_log_inode.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name.btrfs_link
> >       0.00           +46.9       46.92 ą 13%  perf-profile.calltrace.cycles-pp.log_new_dir_dentries.btrfs_log_inode_parent.btrfs_log_new_name.btrfs_link.vfs_link
> >       0.00           +49.4       49.36 ą 12%  perf-profile.calltrace.cycles-pp.btrfs_log_new_name.btrfs_link.vfs_link.do_linkat.__x64_sys_link
> >       0.00           +49.4       49.36 ą 12%  perf-profile.calltrace.cycles-pp.btrfs_log_inode_parent.btrfs_log_new_name.btrfs_link.vfs_link.do_linkat
> >      30.52 ą  3%     -28.4        2.16 ą158%  perf-profile.children.cycles-pp.do_unlinkat
> >      30.35 ą  3%     -28.2        2.15 ą158%  perf-profile.children.cycles-pp.vfs_unlink
> >      30.28 ą  3%     -28.1        2.15 ą158%  perf-profile.children.cycles-pp.btrfs_unlink
> >      29.86 ą  3%     -27.8        2.11 ą158%  perf-profile.children.cycles-pp.btrfs_unlink_inode
> >      29.86 ą  3%     -27.7        2.11 ą158%  perf-profile.children.cycles-pp.__btrfs_unlink_inode
> >      19.65 ą  4%     -18.3        1.36 ą158%  perf-profile.children.cycles-pp.btrfs_add_link
> >      18.68 ą  3%     -17.4        1.31 ą158%  perf-profile.children.cycles-pp.btrfs_del_inode_ref
> >      12.52 ą  4%     -11.6        0.88 ą158%  perf-profile.children.cycles-pp.btrfs_insert_inode_ref
> >       9.38 ą  4%      -8.7        0.64 ą158%  perf-profile.children.cycles-pp.btrfs_lookup_dir_item
> >       6.93 ą  4%      -6.5        0.46 ą158%  perf-profile.children.cycles-pp.btrfs_insert_dir_item
> >       6.17 ą  4%      -5.8        0.41 ą158%  perf-profile.children.cycles-pp.insert_with_overflow
> >      29.64 ą  3%      -3.0       26.61 ą  6%  perf-profile.children.cycles-pp.osq_lock
> >       3.23 ą 10%      -2.9        0.30 ą117%  perf-profile.children.cycles-pp.ret_from_fork
> >       3.23 ą 10%      -2.9        0.30 ą117%  perf-profile.children.cycles-pp.kthread
> >       0.63 ą 11%      -0.4        0.20 ą 51%  perf-profile.children.cycles-pp.poll_idle
> >       0.73 ą  3%      -0.4        0.32 ą 26%  perf-profile.children.cycles-pp.btrfs_set_token_32
> >       0.50 ą  7%      -0.4        0.11 ą 55%  perf-profile.children.cycles-pp.__btrfs_release_delayed_node
> >       1.06 ą  5%      -0.4        0.68 ą 19%  perf-profile.children.cycles-pp.unlock_up
> >       0.65 ą  8%      -0.4        0.28 ą 28%  perf-profile.children.cycles-pp.btrfs_get_token_32
> >       0.50 ą 10%      -0.2        0.27 ą 24%  perf-profile.children.cycles-pp.check_setget_bounds
> >       1.02 ą  7%      -0.2        0.79 ą 17%  perf-profile.children.cycles-pp.setup_items_for_insert
> >       0.39 ą  6%      -0.2        0.19 ą 17%  perf-profile.children.cycles-pp.memmove
> >       0.45 ą  8%      -0.2        0.28 ą 24%  perf-profile.children.cycles-pp.down_write
> >       0.32 ą  8%      -0.2        0.15 ą 34%  perf-profile.children.cycles-pp.up_write
> >       0.31 ą  7%      -0.1        0.17 ą 14%  perf-profile.children.cycles-pp.memmove_extent_buffer
> >       0.21 ą  9%      -0.1        0.07 ą 61%  perf-profile.children.cycles-pp.mutex_lock
> >       0.18 ą 11%      -0.1        0.10 ą 26%  perf-profile.children.cycles-pp.__list_del_entry_valid
> >       0.17 ą  6%      -0.1        0.09 ą 27%  perf-profile.children.cycles-pp.kmem_cache_alloc
> >       0.14 ą 10%      -0.1        0.07 ą 29%  perf-profile.children.cycles-pp.__might_sleep
> >       0.15 ą 10%      -0.1        0.08 ą 25%  perf-profile.children.cycles-pp.___might_sleep
> >       0.12 ą 11%      -0.1        0.07 ą 15%  perf-profile.children.cycles-pp.btrfs_reserve_metadata_bytes
> >       0.12 ą 10%      -0.0        0.07 ą 13%  perf-profile.children.cycles-pp.__reserve_bytes
> >       0.07 ą  9%      +0.0        0.11 ą 11%  perf-profile.children.cycles-pp.leaf_space_used
> >       0.15 ą  8%      +0.1        0.20 ą 11%  perf-profile.children.cycles-pp.btrfs_get_32
> >       0.00            +0.1        0.06 ą 12%  perf-profile.children.cycles-pp.fill_inode_item
> >       0.00            +0.1        0.07 ą 19%  perf-profile.children.cycles-pp.btrfs_release_extent_buffer_pages
> >       0.00            +0.1        0.07 ą 26%  perf-profile.children.cycles-pp.btree_read_extent_buffer_pages
> >       0.00            +0.1        0.08 ą 27%  perf-profile.children.cycles-pp.insert_dir_log_key
> >       0.03 ą 86%      +0.1        0.11 ą 16%  perf-profile.children.cycles-pp.btrfs_buffer_uptodate
> >       0.10 ą 10%      +0.1        0.18 ą 16%  perf-profile.children.cycles-pp.btrfs_bin_search
> >       0.24 ą 10%      +0.1        0.32 ą  7%  perf-profile.children.cycles-pp.btrfs_comp_cpu_keys
> >       0.00            +0.1        0.09 ą 19%  perf-profile.children.cycles-pp.tree_search_offset
> >       0.00            +0.1        0.09 ą 18%  perf-profile.children.cycles-pp.btrfs_use_block_rsv
> >       0.00            +0.1        0.09 ą 25%  perf-profile.children.cycles-pp.xas_load
> >       0.00            +0.1        0.09 ą 13%  perf-profile.children.cycles-pp.btrfs_commit_inode_delayed_inode
> >       0.00            +0.1        0.10 ą 22%  perf-profile.children.cycles-pp.fixup_low_keys
> >       0.00            +0.1        0.11 ą 23%  perf-profile.children.cycles-pp.del_ptr
> >       0.00            +0.1        0.11 ą 25%  perf-profile.children.cycles-pp.insert_ptr
> >       0.00            +0.1        0.13 ą 23%  perf-profile.children.cycles-pp.try_merge_free_space
> >       0.09 ą 22%      +0.1        0.24 ą 12%  perf-profile.children.cycles-pp.release_extent_buffer
> >       0.00            +0.2        0.17 ą 23%  perf-profile.children.cycles-pp.btree_clear_page_dirty
> >       0.13 ą  8%      +0.2        0.31 ą 15%  perf-profile.children.cycles-pp.__radix_tree_lookup
> >       0.00            +0.2        0.19 ą 20%  perf-profile.children.cycles-pp.pagecache_get_page
> >       0.18 ą  9%      +0.2        0.37 ą 19%  perf-profile.children.cycles-pp.btrfs_get_64
> >       0.00            +0.2        0.20 ą 22%  perf-profile.children.cycles-pp.__set_page_dirty_nobuffers
> >       0.00            +0.2        0.23 ą 23%  perf-profile.children.cycles-pp.clear_extent_buffer_dirty
> >       0.10 ą  6%      +0.3        0.38 ą 31%  perf-profile.children.cycles-pp.__push_leaf_left
> >       0.07 ą 18%      +0.3        0.39 ą 18%  perf-profile.children.cycles-pp.set_extent_buffer_dirty
> >       0.10 ą 10%      +0.3        0.43 ą 17%  perf-profile.children.cycles-pp.btrfs_mark_buffer_dirty
> >       0.00            +0.4        0.35 ą 19%  perf-profile.children.cycles-pp.read_tree_block
> >       0.12 ą  7%      +0.4        0.55 ą 30%  perf-profile.children.cycles-pp.push_leaf_left
> >       0.09 ą 13%      +0.6        0.64 ą 16%  perf-profile.children.cycles-pp.alloc_extent_buffer
> >       0.00            +0.6        0.59 ą 30%  perf-profile.children.cycles-pp.copy_for_split
> >       0.00            +0.6        0.59 ą 14%  perf-profile.children.cycles-pp.btrfs_read_node_slot
> >       0.56 ą  8%      +0.6        1.17 ą 14%  perf-profile.children.cycles-pp.generic_bin_search
> >       0.00            +0.8        0.82 ą 20%  perf-profile.children.cycles-pp.copy_extent_buffer
> >       0.00            +0.9        0.91 ą 19%  perf-profile.children.cycles-pp.btrfs_unlock_up_safe
> >       0.09 ą 10%      +1.0        1.10 ą 16%  perf-profile.children.cycles-pp.read_extent_buffer
> >       0.00            +1.0        1.05 ą 99%  perf-profile.children.cycles-pp.find_free_extent
> >       0.00            +1.1        1.07 ą 98%  perf-profile.children.cycles-pp.btrfs_reserve_extent
> >       0.05 ą 43%      +1.6        1.67 ą 64%  perf-profile.children.cycles-pp.btrfs_alloc_tree_block
> >       0.49 ą  8%      +1.8        2.31 ą 83%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
> >       0.00            +1.8        1.83 ą 13%  perf-profile.children.cycles-pp.btrfs_search_forward
> >       0.00            +2.0        2.01 ą 18%  perf-profile.children.cycles-pp.overwrite_item
> >       0.00            +2.1        2.14 ą 18%  perf-profile.children.cycles-pp.log_directory_changes
> >       0.00            +2.1        2.14 ą 18%  perf-profile.children.cycles-pp.log_dir_items
> >       0.55 ą  5%      +2.2        2.72 ą 41%  perf-profile.children.cycles-pp.split_leaf
> >       0.25 ą 16%      +2.6        2.87 ą 75%  perf-profile.children.cycles-pp.btrfs_del_leaf
> >       1.14 ą  5%      +2.8        3.92 ą 54%  perf-profile.children.cycles-pp.btrfs_del_items
> >      17.66 ą  4%      +4.7       22.37 ą  6%  perf-profile.children.cycles-pp.btrfs_insert_empty_items
> >       0.00           +21.9       21.87 ą 16%  perf-profile.children.cycles-pp.drop_objectid_items
> >       0.00           +21.9       21.90 ą 14%  perf-profile.children.cycles-pp.copy_items
> >      21.17 ą  4%     +29.7       50.85 ą  8%  perf-profile.children.cycles-pp.__x64_sys_link
> >      21.17 ą  4%     +29.7       50.85 ą  8%  perf-profile.children.cycles-pp.do_linkat
> >      20.22 ą  4%     +30.6       50.78 ą  8%  perf-profile.children.cycles-pp.vfs_link
> >      20.17 ą  4%     +30.6       50.78 ą  8%  perf-profile.children.cycles-pp.btrfs_link
> >       0.00           +46.9       46.92 ą 13%  perf-profile.children.cycles-pp.log_new_dir_dentries
> >       0.00           +47.9       47.94 ą 13%  perf-profile.children.cycles-pp.btrfs_log_inode
> >       0.00           +49.4       49.36 ą 12%  perf-profile.children.cycles-pp.btrfs_log_new_name
> >       0.00           +49.4       49.36 ą 12%  perf-profile.children.cycles-pp.btrfs_log_inode_parent
> >      29.45 ą  3%      -3.0       26.45 ą  6%  perf-profile.self.cycles-pp.osq_lock
> >       0.60 ą 11%      -0.4        0.19 ą 54%  perf-profile.self.cycles-pp.poll_idle
> >       0.55 ą  4%      -0.3        0.24 ą 25%  perf-profile.self.cycles-pp.btrfs_set_token_32
> >       0.53 ą  8%      -0.3        0.22 ą 28%  perf-profile.self.cycles-pp.btrfs_get_token_32
> >       0.92 ą  5%      -0.3        0.62 ą  9%  perf-profile.self.cycles-pp._raw_spin_lock
> >       0.38 ą  6%      -0.2        0.18 ą 20%  perf-profile.self.cycles-pp.memmove
> >       0.34 ą  5%      -0.2        0.14 ą 36%  perf-profile.self.cycles-pp.find_extent_buffer_nolock
> >       0.41 ą  9%      -0.2        0.23 ą 25%  perf-profile.self.cycles-pp.check_setget_bounds
> >       0.31 ą  7%      -0.2        0.14 ą 35%  perf-profile.self.cycles-pp.up_write
> >       0.17 ą  9%      -0.1        0.05 ą 96%  perf-profile.self.cycles-pp.mutex_lock
> >       0.31 ą  6%      -0.1        0.20 ą 24%  perf-profile.self.cycles-pp.down_write
> >       0.18 ą  9%      -0.1        0.10 ą 25%  perf-profile.self.cycles-pp.__list_del_entry_valid
> >       0.14 ą 10%      -0.1        0.06 ą 47%  perf-profile.self.cycles-pp.__might_sleep
> >       0.14 ą  9%      -0.1        0.08 ą 25%  perf-profile.self.cycles-pp.___might_sleep
> >       0.04 ą 41%      +0.1        0.10 ą 10%  perf-profile.self.cycles-pp.unlock_up
> >       0.00            +0.1        0.07 ą 17%  perf-profile.self.cycles-pp.alloc_extent_buffer
> >       0.10 ą 10%      +0.1        0.18 ą 16%  perf-profile.self.cycles-pp.btrfs_bin_search
> >       0.00            +0.1        0.08 ą 12%  perf-profile.self.cycles-pp.btrfs_read_node_slot
> >       0.23 ą  9%      +0.1        0.31 ą  8%  perf-profile.self.cycles-pp.btrfs_comp_cpu_keys
> >       0.00            +0.1        0.08 ą 23%  perf-profile.self.cycles-pp.xas_load
> >       0.00            +0.1        0.09 ą 19%  perf-profile.self.cycles-pp.tree_search_offset
> >       0.00            +0.1        0.10 ą 22%  perf-profile.self.cycles-pp.btrfs_buffer_uptodate
> >       0.06 ą 15%      +0.1        0.18 ą 15%  perf-profile.self.cycles-pp.set_extent_buffer_dirty
> >       0.00            +0.1        0.13 ą 24%  perf-profile.self.cycles-pp.pagecache_get_page
> >       0.13 ą  5%      +0.1        0.28 ą 17%  perf-profile.self.cycles-pp.btrfs_get_64
> >       0.12 ą  9%      +0.2        0.30 ą 14%  perf-profile.self.cycles-pp.__radix_tree_lookup
> >       0.00            +0.2        0.18 ą 17%  perf-profile.self.cycles-pp.btrfs_search_forward
> >       0.34 ą 10%      +0.5        0.85 ą 17%  perf-profile.self.cycles-pp.generic_bin_search
> >       0.09 ą 11%      +1.0        1.09 ą 16%  perf-profile.self.cycles-pp.read_extent_buffer
> >       0.49 ą  8%      +1.8        2.27 ą 83%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
> >
> >
> >
> >                          stress-ng.time.file_system_outputs
> >
> >   6e+07 +-------------------------------------------------------------------+
> >         |               O                                                   |
> >   5e+07 |-+ O    O O  O        O     OO   O O  O     O                      |
> >         |      O            O                 O  O     OO                   |
> >         |    O       O    O  O     O    O          O                        |
> >   4e+07 |-O                      O                                          |
> >         |                                                                   |
> >   3e+07 |-+                                                                 |
> >         |                                                                   |
> >   2e+07 |-+                                                                 |
> >         |                                                                   |
> >         |                                                                   |
> >   1e+07 |-+                                                                 |
> >         |                                                                   |
> >       0 +-------------------------------------------------------------------+
> >
> >
> >                                 stress-ng.link.ops
> >
> >   250 +---------------------------------------------------------------------+
> >       |   +.++.+.+.+.+. +. .+.+. .++.+.+.+.+.+.+ .+.+.+.+.++.+. .+.+.++   +.|
> >       |                +  +     +               +              +            |
> >   200 |-+                                                                   |
> >       |                                                                     |
> >       |                                                                     |
> >   150 |-+                                                                   |
> >       |                                                                     |
> >   100 |-+                                                                   |
> >       |                                                                     |
> >       |                                                                     |
> >    50 |-+    O                    O    O     O      O O O                   |
> >       | O O O  O O O O OO O O O O  O O   O O   OO O                         |
> >       |                                                                     |
> >     0 +---------------------------------------------------------------------+
> >
> >
> >                             stress-ng.link.ops_per_sec
> >
> >   4.5 +---------------------------------------------------------------------+
> >       |                                                                     |
> >     4 |.+.+.++.+.+.+.+. +. .+.   .+ .+.+.+.+.      .+.+.+.++.+.     .+ .+.+.|
> >   3.5 |-+              +  +   +.+  +         +.++.+            +.+.+  +     |
> >       |                                                                     |
> >     3 |-+                                                                   |
> >   2.5 |-+                                                                   |
> >       |                                                                     |
> >     2 |-+                                                                   |
> >   1.5 |-+                                                                   |
> >       |                                                                     |
> >     1 |-+                                                                   |
> >   0.5 |-+ O OO O O O O OO O O O O OO O O O O O OO O O O O                   |
> >       | O                                                                   |
> >     0 +---------------------------------------------------------------------+
> >
> >
> > [*] bisect-good sample
> > [O] bisect-bad  sample
> >
> >
> >
> > Disclaimer:
> > Results have been estimated based on internal Intel analysis and are provided
> > for informational purposes only. Any difference in system hardware or software
> > design or configuration may affect actual performance.
> >
> >
> > ---
> > 0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
> > https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation
> >
> > Thanks,
> > Oliver Sang
> >
>
>
> --
> Filipe David Manana,
>
> “Whether you think you can, or you think you can't — you're right.”



-- 
Filipe David Manana,

“Whether you think you can, or you think you can't — you're right.”

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ