lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <202304292249.93ef60fb-oliver.sang@intel.com>
Date:   Sat, 29 Apr 2023 22:29:00 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Boris Burkov <boris@....io>
CC:     <oe-lkp@...ts.linux.dev>, <lkp@...el.com>,
        <linux-kernel@...r.kernel.org>, David Sterba <dsterba@...e.com>,
        <linux-btrfs@...r.kernel.org>, <ying.huang@...el.com>,
        <feng.tang@...el.com>, <fengwei.yin@...el.com>,
        <oliver.sang@...el.com>
Subject: [linus:master] [btrfs]  52bb7a2166:  filebench.sum_operations/s
 -4.0% regression


Hello,

kernel test robot noticed a -4.0% regression of filebench.sum_operations/s on:


commit: 52bb7a2166af490317ce2cca1865b6630e86aca8 ("btrfs: introduce size class to block group allocator")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: filebench
test machine: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz (Cascade Lake) with 128G memory
parameters:

	disk: 1HDD
	fs: btrfs
	test: filemicro_rwritedsync.f
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+----------------------------------------------------------------------------------------------------------+
| testcase: change | phoronix-test-suite: phoronix-test-suite.pgbench.BufferTest.SingleThread.ReadWrite.tps 32.8% improvement |
| test machine     | 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz (Cascade Lake) with 512G memory            |
| test parameters  | cpufreq_governor=performance                                                                             |
|                  | option_a=Buffer Test                                                                                     |
|                  | option_b=Single Thread                                                                                   |
|                  | option_c=Read Write                                                                                      |
|                  | test=pgbench-1.9.1                                                                                       |
+------------------+----------------------------------------------------------------------------------------------------------+


If you fix the issue, kindly add following tag
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Link: https://lore.kernel.org/oe-lkp/202304292249.93ef60fb-oliver.sang@intel.com


Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.

=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/rootfs/tbox_group/test/testcase:
  gcc-11/performance/1HDD/btrfs/x86_64-rhel-8.3/debian-11.1-x86_64-20220510.cgz/lkp-csl-2sp9/filemicro_rwritedsync.f/filebench

commit: 
  854c2f365d ("btrfs: add more find_free_extent tracepoints")
  52bb7a2166 ("btrfs: introduce size class to block group allocator")

854c2f365d7e0b5b 52bb7a2166af490317ce2cca186 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      2124            -4.0%       2040        filebench.sum_operations
     35.40            -4.0%      34.00        filebench.sum_operations/s
     28.24            +4.1%      29.40        filebench.sum_time_ms/op
     35.33            -3.8%      34.00        filebench.sum_writes/s
      9459            -2.1%       9260        proc-vmstat.nr_active_file
      9459            -2.1%       9260        proc-vmstat.nr_zone_active_file
     11420            +6.5%      12164        proc-vmstat.pgactivate
    534598 ± 33%     +52.7%     816136 ± 13%  numa-vmstat.node0.nr_file_pages
    440964 ± 41%     -64.1%     158411 ± 70%  numa-vmstat.node1.nr_file_pages
    257210 ±118%     -92.8%      18575 ±139%  numa-vmstat.node1.nr_unevictable
    257210 ±118%     -92.8%      18575 ±139%  numa-vmstat.node1.nr_zone_unevictable
   2135884 ± 33%     +52.8%    3264540 ± 13%  numa-meminfo.node0.FilePages
   3103648 ± 26%     +37.0%    4252443 ± 10%  numa-meminfo.node0.MemUsed
   1761338 ± 41%     -64.0%     633638 ± 70%  numa-meminfo.node1.FilePages
   2522612 ± 32%     -45.6%    1372106 ± 33%  numa-meminfo.node1.MemUsed
   1028842 ±118%     -92.8%      74305 ±139%  numa-meminfo.node1.Unevictable
      9.01 ± 80%      -7.9        1.11 ±223%  perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      7.62 ± 72%      -7.1        0.56 ±223%  perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      7.62 ± 72%      -7.1        0.56 ±223%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault
      7.62 ± 72%      -7.1        0.56 ±223%  perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
      7.24 ±104%      -6.7        0.56 ±223%  perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
      5.38 ± 79%      -4.8        0.56 ±223%  perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      4.55 ±103%      -4.0        0.56 ±223%  perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.getdents64
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.calltrace.cycles-pp.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.calltrace.cycles-pp.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.calltrace.cycles-pp.getdents64
     12.08 ± 61%      -9.7        2.39 ±141%  perf-profile.children.cycles-pp.do_user_addr_fault
     12.08 ± 61%      -9.7        2.39 ±141%  perf-profile.children.cycles-pp.exc_page_fault
      9.01 ± 80%      -7.9        1.11 ±223%  perf-profile.children.cycles-pp.__handle_mm_fault
      7.24 ±104%      -6.7        0.56 ±223%  perf-profile.children.cycles-pp.do_fault
      9.01 ± 80%      -6.6        2.39 ±141%  perf-profile.children.cycles-pp.handle_mm_fault
      5.38 ± 79%      -4.8        0.56 ±223%  perf-profile.children.cycles-pp.do_read_fault
      5.90 ± 74%      -3.7        2.22 ±141%  perf-profile.children.cycles-pp.__schedule
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.children.cycles-pp.__x64_sys_getdents64
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.children.cycles-pp.iterate_dir
      3.76 ±105%      -2.5        1.28 ±223%  perf-profile.children.cycles-pp.getdents64


***************************************************************************************************
lkp-csl-2sp7: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz (Cascade Lake) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/option_a/option_b/option_c/rootfs/tbox_group/test/testcase:
  gcc-11/performance/x86_64-rhel-8.3/Buffer Test/Single Thread/Read Write/debian-x86_64-phoronix/lkp-csl-2sp7/pgbench-1.9.1/phoronix-test-suite

commit: 
  854c2f365d ("btrfs: add more find_free_extent tracepoints")
  52bb7a2166 ("btrfs: introduce size class to block group allocator")

854c2f365d7e0b5b 52bb7a2166af490317ce2cca186 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     26.51           +32.8%      35.21 ±  9%  phoronix-test-suite.pgbench.BufferTest.SingleThread.ReadWrite.tps
    117794            +3.2%     121555        phoronix-test-suite.time.voluntary_context_switches
      0.54 ± 11%      -0.1        0.44 ± 10%  perf-profile.children.cycles-pp.get_next_timer_interrupt
      0.10 ± 10%      -0.0        0.06 ± 47%  perf-profile.children.cycles-pp.fault_in_iov_iter_readable
      0.09 ± 10%      -0.0        0.06 ± 47%  perf-profile.children.cycles-pp.fault_in_readable
      0.05 ± 55%      +0.0        0.09 ± 11%  perf-profile.children.cycles-pp.update_rt_rq_load_avg
      0.03 ±122%      +0.0        0.08 ± 28%  perf-profile.children.cycles-pp.lock_and_cleanup_extent_if_need
      0.18 ± 50%      +0.1        0.31 ± 27%  perf-profile.children.cycles-pp.ttwu_do_wakeup
      3.29 ±  4%      -0.3        3.02 ±  4%  perf-profile.self.cycles-pp.cpuidle_enter_state
      0.25 ±  8%      -0.1        0.18 ± 20%  perf-profile.self.cycles-pp.do_idle
      0.08 ± 11%      -0.0        0.04 ± 71%  perf-profile.self.cycles-pp.fault_in_readable
      0.11 ± 14%      +0.0        0.14 ± 14%  perf-profile.self.cycles-pp.load_balance
      0.05 ± 55%      +0.0        0.09 ± 11%  perf-profile.self.cycles-pp.update_rt_rq_load_avg





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests



View attachment "config-6.2.0-rc8-00021-g52bb7a2166af" of type "text/plain" (156271 bytes)

View attachment "job-script" of type "text/plain" (8239 bytes)

View attachment "job.yaml" of type "text/plain" (5964 bytes)

View attachment "reproduce" of type "text/plain" (505 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ