lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <202311211612.33647109-oliver.sang@intel.com>
Date:   Tue, 21 Nov 2023 16:38:34 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Yosry Ahmed <yosryahmed@...gle.com>
CC:     <oe-lkp@...ts.linux.dev>, <lkp@...el.com>,
        <linux-kernel@...r.kernel.org>,
        Andrew Morton <akpm@...ux-foundation.org>,
        Johannes Weiner <hannes@...xchg.org>,
        Michal Hocko <mhocko@...nel.org>,
        Michal Koutný <mkoutny@...e.com>,
        Muchun Song <muchun.song@...ux.dev>,
        Roman Gushchin <roman.gushchin@...ux.dev>,
        "Shakeel Butt" <shakeelb@...gle.com>, <cgroups@...r.kernel.org>,
        <linux-mm@...ck.org>, <ying.huang@...el.com>,
        <feng.tang@...el.com>, <fengwei.yin@...el.com>,
        <oliver.sang@...el.com>
Subject: [linus:master] [mm]  7bd5bc3ce9:  aim7.jobs-per-min 4.4% improvement



Hello,

kernel test robot noticed a 4.4% improvement of aim7.jobs-per-min on:


commit: 7bd5bc3ce9632aefd0eed33a19212a2e55c0f873 ("mm: memcg: normalize the value passed into memcg_rstat_updated()")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: aim7
test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory
parameters:

	disk: 1BRD_48G
	fs: ext4
	test: disk_rr
	load: 3000
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+--------------------------------------------------------------------------------------------+
| testcase: change | aim7: aim7.jobs-per-min 2.2% improvement                                                   |
| test machine     | 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                               |
|                  | disk=1BRD_48G                                                                              |
|                  | fs=ext4                                                                                    |
|                  | load=3000                                                                                  |
|                  | test=disk_cp                                                                               |
+------------------+--------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20231121/202311211612.33647109-oliver.sang@intel.com

=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/load/rootfs/tbox_group/test/testcase:
  gcc-12/performance/1BRD_48G/ext4/x86_64-rhel-8.3/3000/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp2/disk_rr/aim7

commit: 
  ff841a06c8 ("mm: memcg: refactor page state unit helpers")
  7bd5bc3ce9 ("mm: memcg: normalize the value passed into memcg_rstat_updated()")

ff841a06c844b055 7bd5bc3ce9632aefd0eed33a192 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    838.50 ± 27%     -47.7%     438.67 ± 50%  perf-sched.wait_and_delay.count.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
     21.24 ± 22%     +36.9%      29.06 ± 20%  sched_debug.cfs_rq:/.util_est_enqueued.avg
    620370            +4.4%     647378        aim7.jobs-per-min
    153681 ±  6%      -8.1%     141236 ±  4%  aim7.time.involuntary_context_switches
      2286            -6.8%       2132        aim7.time.system_time
      2.79            -4.6%       2.66        perf-stat.overall.cpi
      0.01 ± 19%      -0.0        0.01 ± 13%  perf-stat.overall.dTLB-load-miss-rate%
      0.36            +4.9%       0.38        perf-stat.overall.ipc
     21.96            -0.8       21.14        perf-profile.calltrace.cycles-pp.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin
     22.01            -0.8       21.19        perf-profile.calltrace.cycles-pp.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write
     21.40            -0.8       20.58        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru
     21.42            -0.8       20.61        perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio
     21.41            -0.8       20.60        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio
     23.15            -0.8       22.37        perf-profile.calltrace.cycles-pp.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
     24.45            -0.7       23.76        perf-profile.calltrace.cycles-pp.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write
     24.40            -0.7       23.75        perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode
     24.38            -0.7       23.72        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release
     24.40            -0.7       23.74        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range
     27.97            -0.6       27.32        perf-profile.calltrace.cycles-pp.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill
     25.20            -0.6       24.56        perf-profile.calltrace.cycles-pp.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict
     30.21            -0.5       29.69        perf-profile.calltrace.cycles-pp.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill.dentry_kill
     30.32            -0.5       29.81        perf-profile.calltrace.cycles-pp.evict.__dentry_kill.dentry_kill.dput.__fput
     30.31            -0.5       29.80        perf-profile.calltrace.cycles-pp.ext4_evict_inode.evict.__dentry_kill.dentry_kill.dput
     30.36            -0.5       29.85        perf-profile.calltrace.cycles-pp.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     30.34            -0.5       29.83        perf-profile.calltrace.cycles-pp.dentry_kill.dput.__fput.__x64_sys_close.do_syscall_64
     30.38            -0.5       29.87        perf-profile.calltrace.cycles-pp.__close
     30.35            -0.5       29.84        perf-profile.calltrace.cycles-pp.dput.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe
     30.37            -0.5       29.87        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__close
     30.37            -0.5       29.87        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     30.36            -0.5       29.86        perf-profile.calltrace.cycles-pp.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     30.33            -0.5       29.83        perf-profile.calltrace.cycles-pp.__dentry_kill.dentry_kill.dput.__fput.__x64_sys_close
     27.52            -0.5       27.03        perf-profile.calltrace.cycles-pp.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
     32.56            -0.5       32.09        perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64
     33.62            -0.4       33.22        perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
     34.80            -0.3       34.52        perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
      0.76            -0.2        0.54        perf-profile.calltrace.cycles-pp.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write
      0.76            -0.2        0.54        perf-profile.calltrace.cycles-pp.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags
      0.78            -0.2        0.56        perf-profile.calltrace.cycles-pp.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter
      0.80            -0.2        0.59        perf-profile.calltrace.cycles-pp.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.75            -0.2        0.54        perf-profile.calltrace.cycles-pp.cgroup_rstat_flush_locked.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages
      1.16            -0.2        0.98        perf-profile.calltrace.cycles-pp.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      0.56 ±  2%      +0.0        0.58        perf-profile.calltrace.cycles-pp.workingset_age_nonresident.workingset_activation.folio_mark_accessed.filemap_read.vfs_read
      0.74 ±  2%      +0.0        0.77        perf-profile.calltrace.cycles-pp.workingset_activation.folio_mark_accessed.filemap_read.vfs_read.ksys_read
      1.10            +0.0        1.14        perf-profile.calltrace.cycles-pp.__filemap_add_folio.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write
      0.64            +0.0        0.68        perf-profile.calltrace.cycles-pp.copyin.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.54            +0.0        0.58        perf-profile.calltrace.cycles-pp.folio_alloc.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      0.98            +0.0        1.03        perf-profile.calltrace.cycles-pp.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      0.58            +0.0        0.63        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.llseek
      0.95            +0.1        1.00        perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write
      0.84            +0.1        0.89        perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      0.75            +0.1        0.81        perf-profile.calltrace.cycles-pp.filemap_get_read_batch.filemap_get_pages.filemap_read.vfs_read.ksys_read
      0.71            +0.1        0.77        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.llseek
      0.90            +0.1        0.97        perf-profile.calltrace.cycles-pp.filemap_get_pages.filemap_read.vfs_read.ksys_read.do_syscall_64
      0.91            +0.1        0.98 ±  2%  perf-profile.calltrace.cycles-pp.ext4_da_map_blocks.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write
      0.93            +0.1        1.00 ±  2%  perf-profile.calltrace.cycles-pp.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      2.17            +0.1        2.25        perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      1.63            +0.1        1.72        perf-profile.calltrace.cycles-pp.truncate_cleanup_folio.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill
      1.58            +0.1        1.69        perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read
      1.42            +0.1        1.53        perf-profile.calltrace.cycles-pp.llseek
      1.71            +0.1        1.83        perf-profile.calltrace.cycles-pp._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read.ksys_read
      1.81            +0.1        1.93        perf-profile.calltrace.cycles-pp.copy_page_to_iter.filemap_read.vfs_read.ksys_read.do_syscall_64
      2.74            +0.2        2.90        perf-profile.calltrace.cycles-pp.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write
     29.25            +0.3       29.56        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
     29.38            +0.3       29.71        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
     30.13            +0.4       30.51        perf-profile.calltrace.cycles-pp.read
     70.96            -1.7       69.29        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
     70.80            -1.7       69.14        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     70.75            -1.6       69.12        perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
     47.66            -1.0       46.66        perf-profile.children.cycles-pp.folio_batch_move_lru
     22.05            -0.8       21.23        perf-profile.children.cycles-pp.folio_add_lru
     23.15            -0.8       22.38        perf-profile.children.cycles-pp.filemap_add_folio
     24.49            -0.7       23.80        perf-profile.children.cycles-pp.__filemap_get_folio
     27.97            -0.6       27.32        perf-profile.children.cycles-pp.__folio_batch_release
     25.44            -0.6       24.83        perf-profile.children.cycles-pp.release_pages
     30.21            -0.5       29.70        perf-profile.children.cycles-pp.truncate_inode_pages_range
     30.32            -0.5       29.81        perf-profile.children.cycles-pp.evict
     30.34            -0.5       29.83        perf-profile.children.cycles-pp.__dentry_kill
     30.31            -0.5       29.80        perf-profile.children.cycles-pp.ext4_evict_inode
     30.35            -0.5       29.84        perf-profile.children.cycles-pp.dput
     30.36            -0.5       29.85        perf-profile.children.cycles-pp.__fput
     30.34            -0.5       29.83        perf-profile.children.cycles-pp.dentry_kill
     30.36            -0.5       29.86        perf-profile.children.cycles-pp.__x64_sys_close
     30.38            -0.5       29.87        perf-profile.children.cycles-pp.__close
     27.56            -0.5       27.07        perf-profile.children.cycles-pp.ext4_da_write_begin
     32.68            -0.5       32.20        perf-profile.children.cycles-pp.generic_perform_write
     33.67            -0.4       33.28        perf-profile.children.cycles-pp.ext4_buffered_write_iter
     34.90            -0.3       34.61        perf-profile.children.cycles-pp.vfs_write
      0.95 ±  4%      -0.2        0.72 ±  4%  perf-profile.children.cycles-pp.cgroup_rstat_updated
      0.76            -0.2        0.54        perf-profile.children.cycles-pp.do_flush_stats
     97.02            -0.2       96.80        perf-profile.children.cycles-pp.do_syscall_64
      0.76            -0.2        0.54        perf-profile.children.cycles-pp.cgroup_rstat_flush
      0.78            -0.2        0.56        perf-profile.children.cycles-pp.mem_cgroup_wb_stats
      0.75            -0.2        0.54        perf-profile.children.cycles-pp.cgroup_rstat_flush_locked
      0.80            -0.2        0.59        perf-profile.children.cycles-pp.balance_dirty_pages
     97.31            -0.2       97.12        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      1.19            -0.2        1.02        perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags
      0.56            -0.2        0.40        perf-profile.children.cycles-pp.mem_cgroup_css_rstat_flush
      1.00 ±  2%      -0.1        0.87 ±  2%  perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
      1.20            -0.1        1.14 ±  2%  perf-profile.children.cycles-pp.__mod_lruvec_page_state
      0.29 ±  3%      -0.0        0.24 ±  3%  perf-profile.children.cycles-pp.__count_memcg_events
      0.33 ±  3%      -0.0        0.30        perf-profile.children.cycles-pp.charge_memcg
      0.40            -0.0        0.37        perf-profile.children.cycles-pp.__mem_cgroup_charge
      0.08 ±  9%      -0.0        0.05 ±  8%  perf-profile.children.cycles-pp.cgroup_base_stat_flush
      0.14            +0.0        0.15        perf-profile.children.cycles-pp.filemap_unaccount_folio
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.generic_file_read_iter
      0.12            +0.0        0.13        perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      0.08            +0.0        0.09        perf-profile.children.cycles-pp.timestamp_truncate
      0.06            +0.0        0.07        perf-profile.children.cycles-pp.try_charge_memcg
      0.14 ±  2%      +0.0        0.15        perf-profile.children.cycles-pp.up_write
      0.11            +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.node_dirty_ok
      0.10 ±  5%      +0.0        0.11        perf-profile.children.cycles-pp.ext4_dirty_inode
      0.21 ±  2%      +0.0        0.23 ±  3%  perf-profile.children.cycles-pp.ext4_generic_write_checks
      0.19            +0.0        0.21 ±  2%  perf-profile.children.cycles-pp.syscall_enter_from_user_mode
      0.16 ±  2%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.xas_start
      0.19 ±  2%      +0.0        0.21 ±  2%  perf-profile.children.cycles-pp.__mod_node_page_state
      0.12 ±  4%      +0.0        0.14 ±  2%  perf-profile.children.cycles-pp.tlb_finish_mmu
      0.12 ±  4%      +0.0        0.14 ±  2%  perf-profile.children.cycles-pp.tlb_batch_pages_flush
      0.26            +0.0        0.27 ±  2%  perf-profile.children.cycles-pp.__mod_lruvec_state
      0.25            +0.0        0.27        perf-profile.children.cycles-pp.inode_needs_update_time
      0.23 ±  2%      +0.0        0.25 ±  2%  perf-profile.children.cycles-pp.__list_del_entry_valid_or_report
      0.16 ±  4%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.__mark_inode_dirty
      0.24            +0.0        0.26 ±  2%  perf-profile.children.cycles-pp.ext4_da_reserve_space
      0.56 ±  2%      +0.0        0.58        perf-profile.children.cycles-pp.workingset_age_nonresident
      0.12 ±  6%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.generic_update_time
      0.44            +0.0        0.46        perf-profile.children.cycles-pp.filemap_get_entry
      0.15 ±  3%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.__mmput
      0.15 ±  3%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.exit_mmap
      0.30            +0.0        0.32 ±  2%  perf-profile.children.cycles-pp.disk_rr
      0.15 ±  3%      +0.0        0.17 ±  2%  perf-profile.children.cycles-pp.exit_mm
      0.38 ±  2%      +0.0        0.40 ±  2%  perf-profile.children.cycles-pp.get_page_from_freelist
      0.38            +0.0        0.41 ±  2%  perf-profile.children.cycles-pp.fault_in_readable
      0.16 ±  3%      +0.0        0.18 ±  3%  perf-profile.children.cycles-pp.__x64_sys_exit_group
      0.16 ±  3%      +0.0        0.18 ±  3%  perf-profile.children.cycles-pp.do_group_exit
      0.16 ±  3%      +0.0        0.18 ±  3%  perf-profile.children.cycles-pp.do_exit
      0.35            +0.0        0.38 ±  2%  perf-profile.children.cycles-pp.atime_needs_update
      0.35            +0.0        0.38 ±  2%  perf-profile.children.cycles-pp.__cond_resched
      0.44            +0.0        0.46        perf-profile.children.cycles-pp.fault_in_iov_iter_readable
      0.40            +0.0        0.42        perf-profile.children.cycles-pp.delete_from_page_cache_batch
      0.41 ±  2%      +0.0        0.44        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      0.28            +0.0        0.32        perf-profile.children.cycles-pp.xas_store
      0.37 ±  2%      +0.0        0.40 ±  2%  perf-profile.children.cycles-pp.ksys_lseek
      0.46            +0.0        0.50        perf-profile.children.cycles-pp.__alloc_pages
      0.42            +0.0        0.45 ±  2%  perf-profile.children.cycles-pp.touch_atime
      0.20 ±  3%      +0.0        0.24 ±  5%  perf-profile.children.cycles-pp.ext4_file_read_iter
      0.54            +0.0        0.58        perf-profile.children.cycles-pp.folio_alloc
      0.20 ±  2%      +0.0        0.23 ±  5%  perf-profile.children.cycles-pp.ext4_file_write_iter
      1.12            +0.0        1.16        perf-profile.children.cycles-pp.__filemap_add_folio
      0.69            +0.0        0.73        perf-profile.children.cycles-pp.copyin
      0.42 ±  3%      +0.0        0.46 ±  2%  perf-profile.children.cycles-pp.__fget_light
      0.40 ±  2%      +0.0        0.44        perf-profile.children.cycles-pp.apparmor_file_permission
      0.46            +0.0        0.50        perf-profile.children.cycles-pp.file_modified
      0.62            +0.0        0.66        perf-profile.children.cycles-pp.xas_load
      0.32 ±  2%      +0.0        0.36 ±  4%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.52            +0.0        0.56        perf-profile.children.cycles-pp.security_file_permission
      0.98            +0.0        1.03        perf-profile.children.cycles-pp.memset_orig
      0.98            +0.0        1.03        perf-profile.children.cycles-pp.zero_user_segments
      0.90            +0.1        0.95        perf-profile.children.cycles-pp.copy_page_from_iter_atomic
      0.59 ±  2%      +0.1        0.64 ±  2%  perf-profile.children.cycles-pp.__fdget_pos
      0.80            +0.1        0.85        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.77            +0.1        0.83        perf-profile.children.cycles-pp.filemap_get_read_batch
      0.92            +0.1        0.99 ±  2%  perf-profile.children.cycles-pp.ext4_da_map_blocks
      0.91            +0.1        0.98        perf-profile.children.cycles-pp.filemap_get_pages
      0.94            +0.1        1.01 ±  2%  perf-profile.children.cycles-pp.ext4_da_get_block_prep
      1.01            +0.1        1.08        perf-profile.children.cycles-pp.__entry_text_start
      1.64            +0.1        1.72        perf-profile.children.cycles-pp.truncate_cleanup_folio
      1.63            +0.1        1.74        perf-profile.children.cycles-pp.copyout
      1.73            +0.1        1.84        perf-profile.children.cycles-pp._copy_to_iter
      1.82            +0.1        1.95        perf-profile.children.cycles-pp.copy_page_to_iter
      1.68            +0.1        1.81        perf-profile.children.cycles-pp.llseek
      2.75            +0.2        2.92        perf-profile.children.cycles-pp.ext4_block_write_begin
     30.38            +0.4       30.79        perf-profile.children.cycles-pp.read
     70.80            -1.7       69.14        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      0.81 ±  5%      -0.2        0.61 ±  5%  perf-profile.self.cycles-pp.cgroup_rstat_updated
      0.54            -0.2        0.39        perf-profile.self.cycles-pp.mem_cgroup_css_rstat_flush
      0.08 ±  4%      -0.0        0.05 ±  8%  perf-profile.self.cycles-pp.cgroup_rstat_flush_locked
      0.08 ±  9%      -0.0        0.05 ±  8%  perf-profile.self.cycles-pp.cgroup_base_stat_flush
      0.19 ±  2%      -0.0        0.17 ±  2%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.19 ±  2%      +0.0        0.20 ±  2%  perf-profile.self.cycles-pp.ext4_block_write_begin
      0.18            +0.0        0.19        perf-profile.self.cycles-pp.__mod_node_page_state
      0.20            +0.0        0.21        perf-profile.self.cycles-pp.block_invalidate_folio
      0.23            +0.0        0.24        perf-profile.self.cycles-pp.ext4_da_do_write_end
      0.17            +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.__fdget_pos
      0.26            +0.0        0.27        perf-profile.self.cycles-pp.generic_perform_write
      0.21 ±  2%      +0.0        0.23        perf-profile.self.cycles-pp.xas_load
      0.21            +0.0        0.22 ±  2%  perf-profile.self.cycles-pp.__cond_resched
      0.15 ±  3%      +0.0        0.17 ±  2%  perf-profile.self.cycles-pp.xas_store
      0.28            +0.0        0.30        perf-profile.self.cycles-pp.__filemap_get_folio
      0.17 ±  2%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.folio_mark_accessed
      0.37            +0.0        0.39        perf-profile.self.cycles-pp.fault_in_readable
      0.24            +0.0        0.26 ±  2%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.29            +0.0        0.31 ±  2%  perf-profile.self.cycles-pp.read
      0.21            +0.0        0.23 ±  2%  perf-profile.self.cycles-pp.do_syscall_64
      0.56 ±  2%      +0.0        0.58        perf-profile.self.cycles-pp.workingset_age_nonresident
      0.27            +0.0        0.29        perf-profile.self.cycles-pp.llseek
      0.27            +0.0        0.29        perf-profile.self.cycles-pp.__entry_text_start
      0.22 ±  2%      +0.0        0.25 ±  3%  perf-profile.self.cycles-pp.ext4_da_write_begin
      0.31            +0.0        0.33        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.28 ±  2%      +0.0        0.31        perf-profile.self.cycles-pp.apparmor_file_permission
      0.46            +0.0        0.49        perf-profile.self.cycles-pp.vfs_read
      0.27 ±  3%      +0.0        0.30 ±  3%  perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags
      0.40 ±  3%      +0.0        0.43 ±  3%  perf-profile.self.cycles-pp.__fget_light
      0.19 ±  3%      +0.0        0.23 ±  4%  perf-profile.self.cycles-pp.ext4_file_read_iter
      0.19 ±  2%      +0.0        0.22 ±  5%  perf-profile.self.cycles-pp.ext4_file_write_iter
      0.67            +0.0        0.72        perf-profile.self.cycles-pp.__block_commit_write
      0.55            +0.0        0.59        perf-profile.self.cycles-pp.vfs_write
      0.28 ±  2%      +0.0        0.32 ±  4%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      0.68            +0.0        0.72        perf-profile.self.cycles-pp.copyin
      0.50            +0.0        0.54        perf-profile.self.cycles-pp.filemap_get_read_batch
      0.98            +0.0        1.02        perf-profile.self.cycles-pp.memset_orig
      0.78            +0.0        0.82        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.64            +0.0        0.68        perf-profile.self.cycles-pp.filemap_read
      0.29 ±  5%      +0.0        0.34 ±  4%  perf-profile.self.cycles-pp.__mod_memcg_lruvec_state
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.try_charge_memcg
      1.61            +0.1        1.72        perf-profile.self.cycles-pp.copyout


***************************************************************************************************
lkp-icl-2sp2: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/load/rootfs/tbox_group/test/testcase:
  gcc-12/performance/1BRD_48G/ext4/x86_64-rhel-8.3/3000/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp2/disk_cp/aim7

commit: 
  ff841a06c8 ("mm: memcg: refactor page state unit helpers")
  7bd5bc3ce9 ("mm: memcg: normalize the value passed into memcg_rstat_updated()")

ff841a06c844b055 7bd5bc3ce9632aefd0eed33a192 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      1.19 ±  5%      +9.1%       1.30 ±  3%  iostat.cpu.user
      1.25 ±  5%      +0.1        1.38 ±  4%  mpstat.cpu.all.usr%
     11.57 ± 62%     -65.1%       4.04 ± 65%  perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     39.78 ±  4%     +12.9       52.64 ±  9%  turbostat.PKG_%
    895233            +2.2%     915336        aim7.jobs-per-min
     48750 ±  7%     -21.3%      38348 ±  9%  aim7.time.involuntary_context_switches
      1176            -5.7%       1109        aim7.time.system_time
      8891 ± 32%     -78.2%       1936 ± 16%  sched_debug.cfs_rq:/.load.avg
    886484 ± 42%     -96.6%      30376 ± 20%  sched_debug.cfs_rq:/.load.max
     78636 ± 40%     -92.5%       5891 ± 12%  sched_debug.cfs_rq:/.load.stddev
      2.18 ±  2%      -4.5%       2.09        perf-stat.i.cpi
      0.54 ±  4%      +8.8%       0.59        perf-stat.i.ipc
      1.90            -4.3%       1.82 ±  2%  perf-stat.overall.MPKI
      1.75            -5.1%       1.66        perf-stat.overall.cpi
      0.57            +5.4%       0.60        perf-stat.overall.ipc
     62989 ± 18%     -38.7%      38627 ± 33%  numa-vmstat.node0.nr_anon_pages
     63952 ± 18%     -38.0%      39673 ± 32%  numa-vmstat.node0.nr_inactive_anon
     63952 ± 18%     -38.0%      39673 ± 32%  numa-vmstat.node0.nr_zone_inactive_anon
     64470 ± 18%     +39.2%      89730 ± 14%  numa-vmstat.node1.nr_anon_pages
     66787 ± 17%     +36.8%      91333 ± 14%  numa-vmstat.node1.nr_inactive_anon
     11264 ±  5%    +216.1%      35608 ± 48%  numa-vmstat.node1.nr_kernel_stack
    834.42 ± 17%   +2010.2%      17608 ± 67%  numa-vmstat.node1.nr_page_table_pages
     66787 ± 17%     +36.8%      91333 ± 14%  numa-vmstat.node1.nr_zone_inactive_anon
    251945 ± 18%     -38.7%     154531 ± 33%  numa-meminfo.node0.AnonPages
    305057 ± 15%     -28.9%     216862 ± 22%  numa-meminfo.node0.AnonPages.max
     14698 ±  9%     -13.9%      12660 ±  7%  numa-meminfo.node0.Dirty
    271685 ± 17%     -36.2%     173249 ± 29%  numa-meminfo.node0.Inactive
    255806 ± 18%     -38.0%     158709 ± 32%  numa-meminfo.node0.Inactive(anon)
    291694 ± 10%     -25.8%     216523 ± 19%  numa-meminfo.node0.Slab
    257857 ± 18%     +39.2%     358935 ± 14%  numa-meminfo.node1.AnonPages
    282059 ± 16%     +34.8%     380109 ± 13%  numa-meminfo.node1.Inactive
    267122 ± 17%     +36.8%     365352 ± 14%  numa-meminfo.node1.Inactive(anon)
     11264 ±  5%    +216.2%      35619 ± 48%  numa-meminfo.node1.KernelStack
      3337 ± 17%   +2011.4%      70463 ± 67%  numa-meminfo.node1.PageTables
     24.42            -2.3       22.14 ±  2%  perf-profile.calltrace.cycles-pp.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill
     21.77            -2.2       19.59 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release
     21.80            -2.2       19.62 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range
     21.81            -2.2       19.64 ±  2%  perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode
     23.12            -2.2       20.96        perf-profile.calltrace.cycles-pp.release_pages.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict
     28.82            -2.0       26.78        perf-profile.calltrace.cycles-pp.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill.dentry_kill
     29.06            -2.0       27.04        perf-profile.calltrace.cycles-pp.evict.__dentry_kill.dentry_kill.dput.__fput
     29.03            -2.0       27.01        perf-profile.calltrace.cycles-pp.ext4_evict_inode.evict.__dentry_kill.dentry_kill.dput
     29.08            -2.0       27.06        perf-profile.calltrace.cycles-pp.__dentry_kill.dentry_kill.dput.__fput.__x64_sys_close
     29.08            -2.0       27.06        perf-profile.calltrace.cycles-pp.dentry_kill.dput.__fput.__x64_sys_close.do_syscall_64
     29.09            -2.0       27.07        perf-profile.calltrace.cycles-pp.dput.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe
     29.14            -2.0       27.12        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__close
     29.14            -2.0       27.12        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     29.14            -2.0       27.12        perf-profile.calltrace.cycles-pp.__close
     29.12            -2.0       27.11        perf-profile.calltrace.cycles-pp.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     29.11            -2.0       27.09        perf-profile.calltrace.cycles-pp.__fput.__x64_sys_close.do_syscall_64.entry_SYSCALL_64_after_hwframe.__close
     20.28            -1.3       18.93        perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio
     23.94            -1.3       22.60        perf-profile.calltrace.cycles-pp.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
     20.23            -1.3       18.89        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru
     20.26            -1.3       18.92        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.folio_add_lru.filemap_add_folio
     21.38            -1.3       20.05        perf-profile.calltrace.cycles-pp.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin.generic_perform_write
     21.29            -1.3       19.96        perf-profile.calltrace.cycles-pp.folio_batch_move_lru.folio_add_lru.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin
     26.64            -1.2       25.46        perf-profile.calltrace.cycles-pp.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write
     33.56            -0.7       32.84        perf-profile.calltrace.cycles-pp.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      0.80            -0.2        0.61        perf-profile.calltrace.cycles-pp.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write
      0.79            -0.2        0.61        perf-profile.calltrace.cycles-pp.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags
      0.83            -0.2        0.66 ±  2%  perf-profile.calltrace.cycles-pp.mem_cgroup_wb_stats.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter
      0.78            -0.2        0.61 ±  2%  perf-profile.calltrace.cycles-pp.cgroup_rstat_flush_locked.cgroup_rstat_flush.do_flush_stats.mem_cgroup_wb_stats.balance_dirty_pages
      0.88            -0.2        0.71 ±  2%  perf-profile.calltrace.cycles-pp.balance_dirty_pages.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write
      1.24 ±  2%      -0.1        1.12 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu
      1.24 ±  2%      -0.1        1.12 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release
      1.24 ±  2%      -0.1        1.12 ±  2%  perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.truncate_inode_pages_range
      1.26            -0.1        1.15 ±  2%  perf-profile.calltrace.cycles-pp.lru_add_drain_cpu.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode.evict
      1.26 ±  2%      -0.1        1.14 ±  2%  perf-profile.calltrace.cycles-pp.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.truncate_inode_pages_range.ext4_evict_inode
      1.88            -0.1        1.80        perf-profile.calltrace.cycles-pp.balance_dirty_pages_ratelimited_flags.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      0.74 ±  3%      -0.1        0.68 ±  2%  perf-profile.calltrace.cycles-pp.charge_memcg.__mem_cgroup_charge.__filemap_add_folio.filemap_add_folio.__filemap_get_folio
      0.92            -0.1        0.87        perf-profile.calltrace.cycles-pp.__mem_cgroup_charge.__filemap_add_folio.filemap_add_folio.__filemap_get_folio.ext4_da_write_begin
      0.84 ±  2%      -0.1        0.79        perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.folio_account_dirtied.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write
      0.72 ±  2%      -0.0        0.68        perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.folio_account_cleaned.__folio_cancel_dirty.try_to_free_buffers.truncate_cleanup_folio
      1.40            +0.0        1.44        perf-profile.calltrace.cycles-pp.folio_account_dirtied.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write.block_write_end
      0.52 ±  2%      +0.0        0.55        perf-profile.calltrace.cycles-pp.security_file_permission.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.15            +0.0        1.19        perf-profile.calltrace.cycles-pp.folio_create_empty_buffers.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      0.80            +0.0        0.85        perf-profile.calltrace.cycles-pp.delete_from_page_cache_batch.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill
      0.86            +0.0        0.91 ±  2%  perf-profile.calltrace.cycles-pp.get_page_from_freelist.__alloc_pages.folio_alloc.__filemap_get_folio.ext4_da_write_begin
      0.83            +0.1        0.88        perf-profile.calltrace.cycles-pp.fault_in_iov_iter_readable.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      0.80            +0.1        0.86        perf-profile.calltrace.cycles-pp.filemap_get_entry.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      1.73            +0.1        1.79        perf-profile.calltrace.cycles-pp.__folio_mark_dirty.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end
      0.73 ±  3%      +0.1        0.79 ±  2%  perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.filemap_read.vfs_read.ksys_read
      1.04            +0.1        1.10        perf-profile.calltrace.cycles-pp.__alloc_pages.folio_alloc.__filemap_get_folio.ext4_da_write_begin.generic_perform_write
      0.82 ±  3%      +0.1        0.88 ±  2%  perf-profile.calltrace.cycles-pp.touch_atime.filemap_read.vfs_read.ksys_read.do_syscall_64
      0.78 ±  3%      +0.1        0.84 ±  2%  perf-profile.calltrace.cycles-pp.ext4_file_read_iter.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.21            +0.1        1.28        perf-profile.calltrace.cycles-pp.folio_alloc.__filemap_get_folio.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      1.40            +0.1        1.48        perf-profile.calltrace.cycles-pp.copyin.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write
      2.25            +0.1        2.34        perf-profile.calltrace.cycles-pp.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write
      2.21            +0.1        2.30        perf-profile.calltrace.cycles-pp.try_to_free_buffers.truncate_cleanup_folio.truncate_inode_pages_range.ext4_evict_inode.evict
      1.74 ±  2%      +0.1        1.84        perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read
      1.13 ±  3%      +0.1        1.24 ±  2%  perf-profile.calltrace.cycles-pp.file_modified.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64
      1.82            +0.1        1.93        perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      2.01 ±  2%      +0.1        2.13        perf-profile.calltrace.cycles-pp._copy_to_iter.copy_page_to_iter.filemap_read.vfs_read.ksys_read
      2.38            +0.1        2.52        perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write
      2.44            +0.1        2.58        perf-profile.calltrace.cycles-pp.zero_user_segments.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      2.22 ±  2%      +0.1        2.36        perf-profile.calltrace.cycles-pp.copy_page_to_iter.filemap_read.vfs_read.ksys_read.do_syscall_64
      1.98 ±  3%      +0.1        2.13 ±  2%  perf-profile.calltrace.cycles-pp.ext4_da_map_blocks.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write
      2.03 ±  3%      +0.2        2.18 ±  2%  perf-profile.calltrace.cycles-pp.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter
      3.12            +0.2        3.28        perf-profile.calltrace.cycles-pp.truncate_cleanup_folio.truncate_inode_pages_range.ext4_evict_inode.evict.__dentry_kill
      3.92            +0.2        4.10        perf-profile.calltrace.cycles-pp.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      4.12            +0.2        4.32        perf-profile.calltrace.cycles-pp.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.34 ± 70%      +0.2        0.55 ±  2%  perf-profile.calltrace.cycles-pp.inode_needs_update_time.file_modified.ext4_buffered_write_iter.vfs_write.ksys_write
      4.84            +0.2        5.08        perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      6.05            +0.4        6.40        perf-profile.calltrace.cycles-pp.ext4_block_write_begin.ext4_da_write_begin.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.68            +0.4        1.09 ± 22%  perf-profile.calltrace.cycles-pp.__entry_text_start.write
      0.08 ±223%      +0.4        0.53 ±  2%  perf-profile.calltrace.cycles-pp.ext4_da_reserve_space.ext4_da_map_blocks.ext4_da_get_block_prep.ext4_block_write_begin.ext4_da_write_begin
      0.68            +0.6        1.23 ±  2%  perf-profile.calltrace.cycles-pp.__entry_text_start.read
      8.05 ±  4%      +0.8        8.84 ±  4%  perf-profile.calltrace.cycles-pp.filemap_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
     10.85 ±  3%      +1.0       11.87 ±  3%  perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
     11.49 ±  3%      +1.1       12.55 ±  3%  perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
     12.03 ±  2%      +1.1       13.12 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
     12.29 ±  2%      +1.1       13.40 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
     13.83 ±  2%      +1.2       15.02 ±  2%  perf-profile.calltrace.cycles-pp.read
     43.88            -3.7       40.20        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
     43.63            -3.7       39.96 ±  2%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     43.54            -3.6       39.91 ±  2%  perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
     24.42            -2.3       22.14 ±  2%  perf-profile.children.cycles-pp.__folio_batch_release
     23.44            -2.2       21.28        perf-profile.children.cycles-pp.release_pages
     28.83            -2.0       26.79        perf-profile.children.cycles-pp.truncate_inode_pages_range
     29.06            -2.0       27.04        perf-profile.children.cycles-pp.evict
     29.03            -2.0       27.01        perf-profile.children.cycles-pp.ext4_evict_inode
     29.08            -2.0       27.06        perf-profile.children.cycles-pp.__dentry_kill
     29.08            -2.0       27.06        perf-profile.children.cycles-pp.dentry_kill
     29.10            -2.0       27.08        perf-profile.children.cycles-pp.dput
     29.14            -2.0       27.12        perf-profile.children.cycles-pp.__close
     29.12            -2.0       27.11        perf-profile.children.cycles-pp.__x64_sys_close
     29.11            -2.0       27.09        perf-profile.children.cycles-pp.__fput
     22.60            -1.5       21.15        perf-profile.children.cycles-pp.folio_batch_move_lru
     23.95            -1.3       22.61        perf-profile.children.cycles-pp.filemap_add_folio
     21.41            -1.3       20.07        perf-profile.children.cycles-pp.folio_add_lru
     26.72            -1.2       25.56        perf-profile.children.cycles-pp.__filemap_get_folio
     33.65            -0.7       32.94        perf-profile.children.cycles-pp.ext4_da_write_begin
      1.79 ±  3%      -0.4        1.42 ±  2%  perf-profile.children.cycles-pp.cgroup_rstat_updated
      1.78 ±  2%      -0.2        1.58        perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
      0.80            -0.2        0.61 ±  2%  perf-profile.children.cycles-pp.do_flush_stats
      0.79            -0.2        0.61        perf-profile.children.cycles-pp.cgroup_rstat_flush
      0.78            -0.2        0.61        perf-profile.children.cycles-pp.cgroup_rstat_flush_locked
      0.83            -0.2        0.66 ±  2%  perf-profile.children.cycles-pp.mem_cgroup_wb_stats
      0.88            -0.2        0.71 ±  2%  perf-profile.children.cycles-pp.balance_dirty_pages
      0.59            -0.1        0.45 ±  2%  perf-profile.children.cycles-pp.mem_cgroup_css_rstat_flush
      1.29 ±  2%      -0.1        1.17 ±  2%  perf-profile.children.cycles-pp.lru_add_drain_cpu
      2.65            -0.1        2.56        perf-profile.children.cycles-pp.__mod_lruvec_page_state
      0.58 ±  3%      -0.1        0.49 ±  2%  perf-profile.children.cycles-pp.__count_memcg_events
      1.96            -0.1        1.88        perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags
      0.76 ±  3%      -0.1        0.70 ±  2%  perf-profile.children.cycles-pp.charge_memcg
      0.94 ±  2%      -0.1        0.88        perf-profile.children.cycles-pp.__mem_cgroup_charge
      0.07            +0.0        0.08        perf-profile.children.cycles-pp.bdev_getblk
      0.09            +0.0        0.10 ±  3%  perf-profile.children.cycles-pp.ext4_fill_raw_inode
      0.13 ±  2%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.ext4_do_update_inode
      0.29 ±  2%      +0.0        0.31        perf-profile.children.cycles-pp.up_write
      0.09 ±  6%      +0.0        0.10 ±  4%  perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
      0.15 ±  3%      +0.0        0.16 ±  3%  perf-profile.children.cycles-pp.jbd2_journal_grab_journal_head
      0.15 ±  3%      +0.0        0.17 ±  2%  perf-profile.children.cycles-pp.ext4_mark_iloc_dirty
      0.14 ±  4%      +0.0        0.16 ±  3%  perf-profile.children.cycles-pp.rcu_core
      0.36            +0.0        0.38        perf-profile.children.cycles-pp.generic_write_checks
      0.19 ±  2%      +0.0        0.20 ±  2%  perf-profile.children.cycles-pp.jbd2_journal_try_to_free_buffers
      0.11 ±  4%      +0.0        0.12 ±  4%  perf-profile.children.cycles-pp.folio_memcg_lock
      0.24 ±  2%      +0.0        0.26        perf-profile.children.cycles-pp.aa_file_perm
      0.29            +0.0        0.31 ±  2%  perf-profile.children.cycles-pp.folio_unlock
      0.47 ±  2%      +0.0        0.49        perf-profile.children.cycles-pp.ext4_es_insert_delayed_block
      0.28 ±  3%      +0.0        0.30        perf-profile.children.cycles-pp.__slab_free
      0.21 ±  2%      +0.0        0.23 ±  2%  perf-profile.children.cycles-pp.__ext4_mark_inode_dirty
      0.38 ±  2%      +0.0        0.40 ±  3%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.13 ±  3%      +0.0        0.15 ±  2%  perf-profile.children.cycles-pp.try_charge_memcg
      0.30 ±  2%      +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.disk_cp
      0.58            +0.0        0.61 ±  2%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      0.37 ±  3%      +0.0        0.40 ±  3%  perf-profile.children.cycles-pp.ext4_es_lookup_extent
      0.42            +0.0        0.46 ±  2%  perf-profile.children.cycles-pp.__mod_lruvec_state
      0.02 ±141%      +0.0        0.06 ±  9%  perf-profile.children.cycles-pp.update_load_avg
      0.55 ±  2%      +0.0        0.59 ±  3%  perf-profile.children.cycles-pp.xas_descend
      0.26 ±  7%      +0.0        0.30 ±  3%  perf-profile.children.cycles-pp.ext4_dirty_inode
      1.42            +0.0        1.46        perf-profile.children.cycles-pp.folio_account_dirtied
      1.16            +0.0        1.20        perf-profile.children.cycles-pp.folio_create_empty_buffers
      0.58            +0.0        0.62 ±  2%  perf-profile.children.cycles-pp.xas_store
      0.75            +0.0        0.79 ±  2%  perf-profile.children.cycles-pp.fault_in_readable
      0.51 ±  3%      +0.0        0.55 ±  2%  perf-profile.children.cycles-pp.ext4_da_reserve_space
      0.82            +0.0        0.86        perf-profile.children.cycles-pp.delete_from_page_cache_batch
      0.88            +0.0        0.92 ±  2%  perf-profile.children.cycles-pp.get_page_from_freelist
      0.56 ±  2%      +0.0        0.61 ±  2%  perf-profile.children.cycles-pp.inode_needs_update_time
      0.29 ±  3%      +0.0        0.34 ±  7%  perf-profile.children.cycles-pp.ext4_nonda_switch
      0.86            +0.1        0.92 ±  2%  perf-profile.children.cycles-pp.fault_in_iov_iter_readable
      1.04            +0.1        1.10        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.72            +0.1        0.77 ±  2%  perf-profile.children.cycles-pp.__cond_resched
      1.75            +0.1        1.80        perf-profile.children.cycles-pp.__folio_mark_dirty
      0.32 ±  7%      +0.1        0.38 ±  4%  perf-profile.children.cycles-pp.generic_update_time
      0.84            +0.1        0.90        perf-profile.children.cycles-pp.filemap_get_entry
      0.78 ±  3%      +0.1        0.85 ±  2%  perf-profile.children.cycles-pp.atime_needs_update
      0.43 ±  6%      +0.1        0.50 ±  3%  perf-profile.children.cycles-pp.__mark_inode_dirty
      1.06            +0.1        1.12        perf-profile.children.cycles-pp.__alloc_pages
      0.79 ±  3%      +0.1        0.86        perf-profile.children.cycles-pp.ext4_file_read_iter
      1.22            +0.1        1.29        perf-profile.children.cycles-pp.folio_alloc
      0.84 ±  3%      +0.1        0.91 ±  2%  perf-profile.children.cycles-pp.touch_atime
      0.89 ±  2%      +0.1        0.96 ±  2%  perf-profile.children.cycles-pp.apparmor_file_permission
      1.24            +0.1        1.32        perf-profile.children.cycles-pp.xas_load
      2.28            +0.1        2.37        perf-profile.children.cycles-pp.mark_buffer_dirty
      1.49            +0.1        1.58        perf-profile.children.cycles-pp.copyin
      1.13 ±  2%      +0.1        1.22 ±  2%  perf-profile.children.cycles-pp.security_file_permission
      2.23            +0.1        2.32        perf-profile.children.cycles-pp.try_to_free_buffers
      1.84 ±  2%      +0.1        1.94        perf-profile.children.cycles-pp.copyout
      1.37            +0.1        1.47        perf-profile.children.cycles-pp.__entry_text_start
      1.17 ±  3%      +0.1        1.28 ±  2%  perf-profile.children.cycles-pp.file_modified
      1.92            +0.1        2.04        perf-profile.children.cycles-pp.copy_page_from_iter_atomic
      2.04 ±  2%      +0.1        2.16        perf-profile.children.cycles-pp._copy_to_iter
      2.45            +0.1        2.58        perf-profile.children.cycles-pp.memset_orig
      2.45            +0.1        2.58        perf-profile.children.cycles-pp.zero_user_segments
      2.25 ±  2%      +0.1        2.39        perf-profile.children.cycles-pp.copy_page_to_iter
      2.04 ±  3%      +0.2        2.19 ±  2%  perf-profile.children.cycles-pp.ext4_da_get_block_prep
      0.86 ±  3%      +0.2        1.01 ±  2%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      2.00 ±  3%      +0.2        2.16 ±  2%  perf-profile.children.cycles-pp.ext4_da_map_blocks
      3.13            +0.2        3.28        perf-profile.children.cycles-pp.truncate_cleanup_folio
      3.95            +0.2        4.14        perf-profile.children.cycles-pp.__block_commit_write
      4.17            +0.2        4.37        perf-profile.children.cycles-pp.block_write_end
      4.89            +0.2        5.14        perf-profile.children.cycles-pp.ext4_da_do_write_end
      6.08            +0.3        6.43        perf-profile.children.cycles-pp.ext4_block_write_begin
      8.12 ±  4%      +0.8        8.92 ±  4%  perf-profile.children.cycles-pp.filemap_read
     10.92 ±  3%      +1.0       11.96 ±  3%  perf-profile.children.cycles-pp.vfs_read
     11.55 ±  3%      +1.1       12.61 ±  3%  perf-profile.children.cycles-pp.ksys_read
     14.32 ±  2%      +1.2       15.55 ±  2%  perf-profile.children.cycles-pp.read
     43.63            -3.7       39.96 ±  2%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      1.56 ±  4%      -0.3        1.24 ±  2%  perf-profile.self.cycles-pp.cgroup_rstat_updated
      0.57            -0.1        0.44 ±  3%  perf-profile.self.cycles-pp.mem_cgroup_css_rstat_flush
      0.10            -0.0        0.07 ±  6%  perf-profile.self.cycles-pp.cgroup_rstat_flush_locked
      0.12 ±  3%      +0.0        0.14 ±  3%  perf-profile.self.cycles-pp.folio_account_cleaned
      0.22 ±  2%      +0.0        0.24 ±  3%  perf-profile.self.cycles-pp.mod_objcg_state
      0.16 ±  2%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.drop_buffers
      0.20            +0.0        0.22 ±  2%  perf-profile.self.cycles-pp.aa_file_perm
      0.17 ±  2%      +0.0        0.19 ±  3%  perf-profile.self.cycles-pp.folio_create_empty_buffers
      0.34 ±  2%      +0.0        0.36 ±  2%  perf-profile.self.cycles-pp.block_invalidate_folio
      0.23 ±  2%      +0.0        0.25 ±  3%  perf-profile.self.cycles-pp.__fdget_pos
      0.28            +0.0        0.29        perf-profile.self.cycles-pp.folio_unlock
      0.09 ±  5%      +0.0        0.10 ±  7%  perf-profile.self.cycles-pp.try_charge_memcg
      0.20 ±  5%      +0.0        0.22 ±  3%  perf-profile.self.cycles-pp.kmem_cache_free
      0.28 ±  3%      +0.0        0.30        perf-profile.self.cycles-pp.__slab_free
      0.31 ±  2%      +0.0        0.33 ±  3%  perf-profile.self.cycles-pp.xas_store
      0.37 ±  2%      +0.0        0.40 ±  2%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.44            +0.0        0.47 ±  2%  perf-profile.self.cycles-pp.__cond_resched
      0.29 ±  2%      +0.0        0.32 ±  3%  perf-profile.self.cycles-pp.do_syscall_64
      0.36            +0.0        0.39 ±  2%  perf-profile.self.cycles-pp.filemap_get_entry
      0.42            +0.0        0.46 ±  2%  perf-profile.self.cycles-pp.copy_page_from_iter_atomic
      0.42            +0.0        0.45 ±  2%  perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.54            +0.0        0.57 ±  2%  perf-profile.self.cycles-pp.generic_perform_write
      0.42 ±  2%      +0.0        0.46 ±  2%  perf-profile.self.cycles-pp.xas_load
      0.47 ±  2%      +0.0        0.51        perf-profile.self.cycles-pp.ext4_da_do_write_end
      0.47 ±  2%      +0.0        0.51 ±  3%  perf-profile.self.cycles-pp.xas_descend
      0.72            +0.0        0.77 ±  2%  perf-profile.self.cycles-pp.fault_in_readable
      0.56            +0.0        0.60 ±  2%  perf-profile.self.cycles-pp.__filemap_get_folio
      0.28 ±  2%      +0.0        0.32 ±  7%  perf-profile.self.cycles-pp.ext4_nonda_switch
      1.01            +0.0        1.06        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.40            +0.0        0.44 ±  2%  perf-profile.self.cycles-pp.__entry_text_start
      0.40 ±  4%      +0.1        0.45 ±  4%  perf-profile.self.cycles-pp.atime_needs_update
      0.63 ±  3%      +0.1        0.68 ±  4%  perf-profile.self.cycles-pp.apparmor_file_permission
      0.56 ±  2%      +0.1        0.62 ±  2%  perf-profile.self.cycles-pp.ext4_da_write_begin
      0.50 ±  6%      +0.1        0.56 ±  5%  perf-profile.self.cycles-pp.__mod_memcg_lruvec_state
      0.77 ±  3%      +0.1        0.83 ±  2%  perf-profile.self.cycles-pp.ext4_file_read_iter
      0.92            +0.1        0.99 ±  2%  perf-profile.self.cycles-pp.vfs_read
      1.46            +0.1        1.54        perf-profile.self.cycles-pp.copyin
      1.53            +0.1        1.62        perf-profile.self.cycles-pp.__block_commit_write
      1.81 ±  2%      +0.1        1.90        perf-profile.self.cycles-pp.copyout
      2.42            +0.1        2.56        perf-profile.self.cycles-pp.memset_orig
      0.78 ±  2%      +0.1        0.91 ±  2%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      1.31 ±  2%      +0.1        1.45        perf-profile.self.cycles-pp.vfs_write
      1.73 ±  4%      +0.2        1.90 ±  5%  perf-profile.self.cycles-pp.filemap_read





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ