lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <ZHcJUF4f9VcnyGVt@xsang-OptiPlex-9020>
Date: Wed, 31 May 2023 16:46:08 +0800
From: Oliver Sang <oliver.sang@...el.com>
To: Shakeel Butt <shakeelb@...gle.com>
CC: Zhang Cathy <cathy.zhang@...el.com>, Yin Fengwei <fengwei.yin@...el.com>,
	Feng Tang <feng.tang@...el.com>, Eric Dumazet <edumazet@...gle.com>, Linux MM
	<linux-mm@...ck.org>, Cgroups <cgroups@...r.kernel.org>, Paolo Abeni
	<pabeni@...hat.com>, "davem@...emloft.net" <davem@...emloft.net>,
	"kuba@...nel.org" <kuba@...nel.org>, Brandeburg Jesse
	<jesse.brandeburg@...el.com>, Srinivas Suresh <suresh.srinivas@...el.com>,
	Chen Tim C <tim.c.chen@...el.com>, You Lizhen <lizhen.you@...el.com>,
	"eric.dumazet@...il.com" <eric.dumazet@...il.com>, "netdev@...r.kernel.org"
	<netdev@...r.kernel.org>, <philip.li@...el.com>, <yujie.liu@...el.com>,
	<oliver.sang@...el.com>
Subject: Re: [PATCH net-next 1/2] net: Keep sk->sk_forward_alloc as a proper
 size

hi, Shakeel,

On Wed, May 17, 2023 at 04:24:47PM +0000, Shakeel Butt wrote:
> On Tue, May 16, 2023 at 01:46:55PM +0800, Oliver Sang wrote:
> > hi Shakeel,
> > 
> > On Mon, May 15, 2023 at 12:50:31PM -0700, Shakeel Butt wrote:
> > > +Feng, Yin and Oliver
> > > 
> > > >
> > > > > Thanks a lot Cathy for testing. Do you see any performance improvement for
> > > > > the memcached benchmark with the patch?
> > > >
> > > > Yep, absolutely :- ) RPS (with/without patch) = +1.74
> > > 
> > > Thanks a lot Cathy.
> > > 
> > > Feng/Yin/Oliver, can you please test the patch at [1] with other
> > > workloads used by the test robot? Basically I wanted to know if it has
> > > any positive or negative impact on other perf benchmarks.
> > 
> > is it possible for you to resend patch with Signed-off-by?
> > without it, test robot will regard the patch as informal, then it cannot feed
> > into auto test process.
> > and could you tell us the base of this patch? it will help us apply it
> > correctly.
> > 
> > on the other hand, due to resource restraint, we normally cannot support
> > this type of on-demand test upon a single patch, patch set, or a branch.
> > instead, we try to merge them into so-called hourly-kernels, then distribute
> > tests and auto-bisects to various platforms.
> > after we applying your patch and merging it to hourly-kernels sccussfully,
> > if it really causes some performance changes, the test robot could spot out
> > this patch as 'fbc' and we will send report to you. this could happen within
> > several weeks after applying.
> > but due to the complexity of whole process (also limited resourse, such like
> > we cannot run all tests on all platforms), we cannot guanrantee capture all
> > possible performance impacts of this patch. and it's hard for us to provide
> > a big picture like what's the general performance impact of this patch.
> > this maybe is not exactly what you want. is it ok for you?
> > 
> > 
> 
> Yes, that is fine and thanks for the help. The patch is below:

we applied below patch upon v6.4-rc2, so far, we didn't spot out performance
impacts of it to other tests.

but we found -7.6% regression of netperf.Throughput_Mbps

testcase: netperf
test machine: 128 threads 4 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz (Ice Lake) with 256G memory
parameters:

	ip: ipv4
	runtime: 300s
	nr_threads: 50%
	cluster: cs-localhost
	send_size: 10K
	test: TCP_SENDFILE
	cpufreq_governor: performance


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.


=========================================================================================
cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/send_size/tbox_group/test/testcase:
  cs-localhost/gcc-11/performance/ipv4/x86_64-rhel-8.3/50%/debian-11.1-x86_64-20220510.cgz/300s/10K/lkp-icl-2sp2/TCP_SENDFILE/netperf

commit: 
  v6.4-rc2
  5e32037c50 ("memcg: skip stock refill in irq context")

        v6.4-rc2 5e32037c5065d2058264d41cd4c 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     23165            -7.6%      21414        netperf.Throughput_Mbps
   1482569            -7.6%    1370534        netperf.Throughput_total_Mbps

detail data as below [1]


at the same time, we tested Cathy's patch upon same test, found
a 29.4% improvement of netperf.Throughput_Mbps
just FYI


=========================================================================================
cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/send_size/tbox_group/test/testcase:
  cs-localhost/gcc-11/performance/ipv4/x86_64-rhel-8.3/50%/debian-11.1-x86_64-20220510.cgz/300s/10K/lkp-icl-2sp2/TCP_SENDFILE/netperf

commit: 
  ed23734c23 ("Merge tag 'net-6.4-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net")
  05d72a8bed ("net: Keep sk->sk_forward_alloc as a proper size")

ed23734c23d2fc1e 05d72a8bedfacfc46f300ab38e0 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     23218           +29.4%      30043        netperf.Throughput_Mbps
   1485996           +29.4%    1922763        netperf.Throughput_total_Mbps

detail data as below [2]


[1]

=========================================================================================
cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/send_size/tbox_group/test/testcase:
  cs-localhost/gcc-11/performance/ipv4/x86_64-rhel-8.3/50%/debian-11.1-x86_64-20220510.cgz/300s/10K/lkp-icl-2sp2/TCP_SENDFILE/netperf

commit: 
  v6.4-rc2
  5e32037c50 ("memcg: skip stock refill in irq context")

        v6.4-rc2 5e32037c5065d2058264d41cd4c 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
   5106608            -1.3%    5040930        vmstat.system.cs
    246222 ±  4%     -21.9%     192291 ±  8%  sched_debug.cpu.avg_idle.avg
    269582 ±  6%     -24.9%     202436 ± 13%  sched_debug.cpu.avg_idle.stddev
      2556            +0.9%       2579        turbostat.Bzy_MHz
     15.01            +0.8       15.76        turbostat.C1%
     30.63            +4.2%      31.90 ±  2%  turbostat.RAMWatt
     23165            -7.6%      21414        netperf.Throughput_Mbps
   1482569            -7.6%    1370534        netperf.Throughput_total_Mbps
    670.10           -11.8%     591.36        netperf.time.user_time
 5.429e+09            -7.6%  5.019e+09        netperf.workload
      6.93            +6.4%       7.38        perf-stat.i.MPKI
 4.404e+10            -5.4%  4.167e+10        perf-stat.i.branch-instructions
      0.88            +0.0        0.90        perf-stat.i.branch-miss-rate%
 3.823e+08            -2.7%  3.721e+08        perf-stat.i.branch-misses
      6.54 ±  2%      +0.4        6.90 ±  3%  perf-stat.i.cache-miss-rate%
  1.05e+08 ±  3%      +6.3%  1.117e+08 ±  3%  perf-stat.i.cache-misses
      1.29            +5.8%       1.37        perf-stat.i.cpi
     27150 ±  6%     +14.9%      31203 ±  5%  perf-stat.i.cpu-migrations
      2897 ±  3%      -5.7%       2733 ±  3%  perf-stat.i.cycles-between-cache-misses
      0.01 ± 12%      +0.0        0.01        perf-stat.i.dTLB-load-miss-rate%
   6712601 ± 12%      +7.8%    7237514        perf-stat.i.dTLB-load-misses
 6.874e+10            -5.4%  6.505e+10        perf-stat.i.dTLB-loads
      0.00 ±  5%      +0.0        0.00 ±  5%  perf-stat.i.dTLB-store-miss-rate%
    940096 ±  5%     +15.3%    1083508 ±  5%  perf-stat.i.dTLB-store-misses
 3.753e+10            -5.5%  3.547e+10        perf-stat.i.dTLB-stores
 2.332e+11            -5.4%  2.207e+11        perf-stat.i.instructions
      0.77            -5.4%       0.73        perf-stat.i.ipc
      1186            -5.3%       1123        perf-stat.i.metric.M/sec
    706578 ±  8%     +33.2%     941322 ±  5%  perf-stat.i.node-loads
   2812685 ±  8%     +15.6%    3250382 ± 10%  perf-stat.i.node-stores
      6.93            +6.4%       7.37        perf-stat.overall.MPKI
      0.87            +0.0        0.89        perf-stat.overall.branch-miss-rate%
      6.50 ±  2%      +0.4        6.86 ±  3%  perf-stat.overall.cache-miss-rate%
      1.29            +5.8%       1.37        perf-stat.overall.cpi
      2878 ±  3%      -5.8%       2711 ±  3%  perf-stat.overall.cycles-between-cache-misses
      0.01 ± 12%      +0.0        0.01        perf-stat.overall.dTLB-load-miss-rate%
      0.00 ±  5%      +0.0        0.00 ±  5%  perf-stat.overall.dTLB-store-miss-rate%
      0.77            -5.5%       0.73        perf-stat.overall.ipc
     12903            +2.4%      13208        perf-stat.overall.path-length
  4.39e+10            -5.4%  4.154e+10        perf-stat.ps.branch-instructions
  3.81e+08            -2.7%  3.708e+08        perf-stat.ps.branch-misses
 1.047e+08 ±  3%      +6.3%  1.113e+08 ±  3%  perf-stat.ps.cache-misses
     27021 ±  6%     +14.9%      31054 ±  5%  perf-stat.ps.cpu-migrations
   6672234 ± 12%      +7.8%    7195318        perf-stat.ps.dTLB-load-misses
 6.852e+10            -5.4%  6.484e+10        perf-stat.ps.dTLB-loads
    935167 ±  5%     +15.3%    1077856 ±  5%  perf-stat.ps.dTLB-store-misses
 3.741e+10            -5.5%  3.536e+10        perf-stat.ps.dTLB-stores
 2.324e+11            -5.4%  2.199e+11        perf-stat.ps.instructions
    704145 ±  8%     +33.2%     938240 ±  5%  perf-stat.ps.node-loads
   2802795 ±  8%     +15.5%    3238090 ± 10%  perf-stat.ps.node-stores
 7.006e+13            -5.4%  6.629e+13        perf-stat.total.instructions
     11.29            -0.9       10.42        perf-profile.calltrace.cycles-pp.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg
     11.22            -0.9       10.35        perf-profile.calltrace.cycles-pp.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg
     29.43            -0.7       28.74        perf-profile.calltrace.cycles-pp.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg.__sys_recvfrom
      7.04            -0.5        6.51        perf-profile.calltrace.cycles-pp._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg
      7.36            -0.5        6.86        perf-profile.calltrace.cycles-pp.generic_file_splice_read.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64
      6.56            -0.5        6.06        perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked
      6.45            -0.4        6.03        perf-profile.calltrace.cycles-pp.filemap_read.generic_file_splice_read.splice_direct_to_actor.do_splice_direct.do_sendfile
      2.95            -0.3        2.61 ±  7%  perf-profile.calltrace.cycles-pp.__check_object_size.simple_copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked
      2.58 ±  2%      -0.3        2.29 ±  7%  perf-profile.calltrace.cycles-pp.check_heap_object.__check_object_size.simple_copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter
      3.22            -0.3        2.93        perf-profile.calltrace.cycles-pp.simple_copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg
     10.00            -0.3        9.75        perf-profile.calltrace.cycles-pp.__dev_queue_xmit.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_recvmsg_locked
     10.15            -0.2        9.91        perf-profile.calltrace.cycles-pp.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_recvmsg_locked.tcp_recvmsg
      2.89            -0.2        2.66        perf-profile.calltrace.cycles-pp.filemap_get_read_batch.filemap_get_pages.filemap_read.generic_file_splice_read.splice_direct_to_actor
      3.12            -0.2        2.90        perf-profile.calltrace.cycles-pp.filemap_get_pages.filemap_read.generic_file_splice_read.splice_direct_to_actor.do_splice_direct
     10.47            -0.2       10.25        perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg
      2.66            -0.2        2.44        perf-profile.calltrace.cycles-pp.tcp_write_xmit.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
      2.42            -0.2        2.22        perf-profile.calltrace.cycles-pp.__tcp_transmit_skb.tcp_write_xmit.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      2.48            -0.2        2.29 ±  7%  perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu
      2.46            -0.2        2.27 ±  7%  perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
      2.23            -0.2        2.05        perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.do_tcp_sendpages.tcp_sendpage
      2.14            -0.2        1.96        perf-profile.calltrace.cycles-pp.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.do_tcp_sendpages
      1.27            -0.1        1.17        perf-profile.calltrace.cycles-pp.tcp_send_mss.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
      1.17            -0.1        1.09        perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
      1.10            -0.1        1.02        perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      0.91            -0.1        0.84        perf-profile.calltrace.cycles-pp.tcp_current_mss.tcp_send_mss.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      1.29            -0.1        1.23        perf-profile.calltrace.cycles-pp.copy_page_to_iter_pipe.filemap_read.generic_file_splice_read.splice_direct_to_actor.do_splice_direct
      0.77            -0.0        0.73        perf-profile.calltrace.cycles-pp.tcp_stream_alloc_skb.tcp_build_frag.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      0.81            -0.0        0.77        perf-profile.calltrace.cycles-pp.activate_task.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single.sysvec_call_function_single
      0.78            -0.0        0.74        perf-profile.calltrace.cycles-pp.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single
      0.55            -0.0        0.53        perf-profile.calltrace.cycles-pp.enqueue_entity.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending
      0.93            +0.0        0.96        perf-profile.calltrace.cycles-pp.try_to_wake_up.__wake_up_common.__wake_up_common_lock.sock_def_readable.tcp_data_queue
      1.05            +0.0        1.08        perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.sock_def_readable.tcp_data_queue.tcp_rcv_established
      1.10            +0.0        1.13        perf-profile.calltrace.cycles-pp.__wake_up_common_lock.sock_def_readable.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv
      1.20            +0.0        1.24        perf-profile.calltrace.cycles-pp.sock_def_readable.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
     15.73            +0.2       15.97        perf-profile.calltrace.cycles-pp.__do_softirq.do_softirq.__local_bh_enable_ip.__dev_queue_xmit.ip_finish_output2
     15.13            +0.3       15.38        perf-profile.calltrace.cycles-pp.net_rx_action.__do_softirq.do_softirq.__local_bh_enable_ip.__dev_queue_xmit
     13.50            +0.3       13.82        perf-profile.calltrace.cycles-pp.__napi_poll.net_rx_action.__do_softirq.do_softirq.__local_bh_enable_ip
     13.45            +0.3       13.77        perf-profile.calltrace.cycles-pp.process_backlog.__napi_poll.net_rx_action.__do_softirq.do_softirq
     13.06            +0.3       13.38        perf-profile.calltrace.cycles-pp.__netif_receive_skb_one_core.process_backlog.__napi_poll.net_rx_action.__do_softirq
      2.23 ±  2%      +0.4        2.60 ±  3%  perf-profile.calltrace.cycles-pp.release_sock.tcp_recvmsg.inet_recvmsg.sock_recvmsg.__sys_recvfrom
     12.08            +0.4       12.46        perf-profile.calltrace.cycles-pp.ip_local_deliver_finish.__netif_receive_skb_one_core.process_backlog.__napi_poll.net_rx_action
     12.02            +0.4       12.41        perf-profile.calltrace.cycles-pp.ip_protocol_deliver_rcu.ip_local_deliver_finish.__netif_receive_skb_one_core.process_backlog.__napi_poll
      1.12            +0.4        1.51 ±  3%  perf-profile.calltrace.cycles-pp.tcp_clean_rtx_queue.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.__release_sock
      1.31            +0.4        1.71 ±  3%  perf-profile.calltrace.cycles-pp.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock
     11.73            +0.4       12.14        perf-profile.calltrace.cycles-pp.tcp_v4_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish.__netif_receive_skb_one_core.process_backlog
      1.34 ± 13%      +0.4        1.76 ±  6%  perf-profile.calltrace.cycles-pp.__sk_mem_reduce_allocated.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg
      1.73 ± 14%      +0.5        2.19 ±  7%  perf-profile.calltrace.cycles-pp.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock.tcp_recvmsg
      1.38 ± 14%      +0.5        1.85 ±  7%  perf-profile.calltrace.cycles-pp.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock
      5.62            +0.5        6.11        perf-profile.calltrace.cycles-pp.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock.tcp_sendpage
      5.61            +0.5        6.10        perf-profile.calltrace.cycles-pp.tcp_v4_do_rcv.__release_sock.release_sock.tcp_sendpage.inet_sendpage
      8.89            +0.5        9.40        perf-profile.calltrace.cycles-pp.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish.__netif_receive_skb_one_core
      8.74            +0.5        9.26        perf-profile.calltrace.cycles-pp.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish
      2.86            +0.6        3.46 ±  3%  perf-profile.calltrace.cycles-pp.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu
      0.58 ±  3%      +0.6        1.19 ±  9%  perf-profile.calltrace.cycles-pp.mem_cgroup_charge_skmem.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
      1.29 ± 15%      +0.6        1.94 ±  8%  perf-profile.calltrace.cycles-pp.__sk_mem_reduce_allocated.tcp_clean_rtx_queue.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv
      7.18 ±  2%      +0.7        7.87 ±  2%  perf-profile.calltrace.cycles-pp.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv
      6.06            +0.7        6.76 ±  2%  perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established
      0.35 ± 70%      +0.7        1.07 ± 32%  perf-profile.calltrace.cycles-pp.refill_stock.__sk_mem_reduce_allocated.tcp_clean_rtx_queue.tcp_ack.tcp_rcv_established
      6.02            +0.7        6.75 ±  2%  perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.__release_sock
      6.05            +0.7        6.78 ±  2%  perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock
      0.39 ± 70%      +0.8        1.20 ± 22%  perf-profile.calltrace.cycles-pp.page_counter_try_charge.try_charge_memcg.mem_cgroup_charge_skmem.tcp_data_queue.tcp_rcv_established
     16.80            +0.8       17.62        perf-profile.calltrace.cycles-pp.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage
     46.63            +0.9       47.53        perf-profile.calltrace.cycles-pp.do_splice_direct.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.53 ±  4%      +0.9        1.46 ±  9%  perf-profile.calltrace.cycles-pp.page_counter_try_charge.try_charge_memcg.mem_cgroup_charge_skmem.__sk_mem_raise_allocated.__sk_mem_schedule
     46.04            +1.0       47.00        perf-profile.calltrace.cycles-pp.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64.do_syscall_64
      0.00            +1.0        0.98 ± 33%  perf-profile.calltrace.cycles-pp.page_counter_uncharge.drain_stock.refill_stock.__sk_mem_reduce_allocated.tcp_clean_rtx_queue
      0.00            +1.0        0.99 ± 33%  perf-profile.calltrace.cycles-pp.drain_stock.refill_stock.__sk_mem_reduce_allocated.tcp_clean_rtx_queue.tcp_ack
      9.51            +1.2       10.67 ±  2%  perf-profile.calltrace.cycles-pp.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage
      8.17            +1.2        9.34 ±  2%  perf-profile.calltrace.cycles-pp.__release_sock.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage
     10.68            +1.3       11.98        perf-profile.calltrace.cycles-pp.tcp_build_frag.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
      0.96 ± 15%      +1.4        2.34 ± 11%  perf-profile.calltrace.cycles-pp.try_charge_memcg.mem_cgroup_charge_skmem.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv
      7.84            +1.5        9.30        perf-profile.calltrace.cycles-pp.tcp_wmem_schedule.tcp_build_frag.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      7.60            +1.5        9.08        perf-profile.calltrace.cycles-pp.__sk_mem_schedule.tcp_wmem_schedule.tcp_build_frag.do_tcp_sendpages.tcp_sendpage
     36.91            +1.5       38.40        perf-profile.calltrace.cycles-pp.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor.do_splice_direct.do_sendfile
     37.04            +1.5       38.53        perf-profile.calltrace.cycles-pp.direct_splice_actor.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64
      7.41            +1.5        8.91        perf-profile.calltrace.cycles-pp.__sk_mem_raise_allocated.__sk_mem_schedule.tcp_wmem_schedule.tcp_build_frag.do_tcp_sendpages
     36.49            +1.5       38.02        perf-profile.calltrace.cycles-pp.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor.do_splice_direct
      1.47 ±  3%      +1.6        3.11 ±  7%  perf-profile.calltrace.cycles-pp.try_charge_memcg.mem_cgroup_charge_skmem.__sk_mem_raise_allocated.__sk_mem_schedule.tcp_wmem_schedule
     34.61            +1.7       36.26        perf-profile.calltrace.cycles-pp.pipe_to_sendpage.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor
     34.29            +1.7       35.97        perf-profile.calltrace.cycles-pp.sock_sendpage.pipe_to_sendpage.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor
     34.10            +1.7       35.79        perf-profile.calltrace.cycles-pp.kernel_sendpage.sock_sendpage.pipe_to_sendpage.__splice_from_pipe.generic_splice_sendpage
     33.73            +1.7       35.46        perf-profile.calltrace.cycles-pp.inet_sendpage.kernel_sendpage.sock_sendpage.pipe_to_sendpage.__splice_from_pipe
     33.24            +1.8       35.02        perf-profile.calltrace.cycles-pp.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage.pipe_to_sendpage
      4.46 ±  2%      +2.0        6.42 ±  2%  perf-profile.calltrace.cycles-pp.mem_cgroup_charge_skmem.__sk_mem_raise_allocated.__sk_mem_schedule.tcp_wmem_schedule.tcp_build_frag
     11.28            -0.9       10.40        perf-profile.children.cycles-pp.__skb_datagram_iter
     11.30            -0.9       10.42        perf-profile.children.cycles-pp.skb_copy_datagram_iter
     29.47            -0.7       28.77        perf-profile.children.cycles-pp.tcp_recvmsg_locked
      7.09            -0.5        6.56        perf-profile.children.cycles-pp._copy_to_iter
      6.70            -0.5        6.20        perf-profile.children.cycles-pp.copyout
      7.44            -0.5        6.94        perf-profile.children.cycles-pp.generic_file_splice_read
      6.58            -0.4        6.15        perf-profile.children.cycles-pp.filemap_read
      3.26            -0.3        2.97        perf-profile.children.cycles-pp.simple_copy_to_iter
      3.16            -0.3        2.88        perf-profile.children.cycles-pp.__check_object_size
      2.93            -0.2        2.70        perf-profile.children.cycles-pp.filemap_get_read_batch
      2.65 ±  2%      -0.2        2.42        perf-profile.children.cycles-pp.check_heap_object
      3.16            -0.2        2.93        perf-profile.children.cycles-pp.filemap_get_pages
      1.32            -0.1        1.22        perf-profile.children.cycles-pp.tcp_send_mss
      1.33            -0.1        1.23        perf-profile.children.cycles-pp.touch_atime
      1.22 ±  2%      -0.1        1.12        perf-profile.children.cycles-pp.security_file_permission
      5.62            -0.1        5.53        perf-profile.children.cycles-pp.lock_sock_nested
      1.08            -0.1        1.00        perf-profile.children.cycles-pp.atime_needs_update
      1.08            -0.1        1.00        perf-profile.children.cycles-pp.tcp_current_mss
      0.96 ±  3%      -0.1        0.88        perf-profile.children.cycles-pp.apparmor_file_permission
      1.35            -0.1        1.28        perf-profile.children.cycles-pp.copy_page_to_iter_pipe
      0.57 ±  3%      -0.1        0.51        perf-profile.children.cycles-pp._copy_from_user
      0.52            -0.1        0.46 ±  2%  perf-profile.children.cycles-pp.__fsnotify_parent
      1.06            -0.1        1.01        perf-profile.children.cycles-pp.__inet_lookup_established
      0.41            -0.0        0.36        perf-profile.children.cycles-pp.tcp_rate_check_app_limited
      0.52 ±  2%      -0.0        0.48 ±  2%  perf-profile.children.cycles-pp.netperf_sendfile
      0.74            -0.0        0.70        perf-profile.children.cycles-pp.__cond_resched
      0.48            -0.0        0.43        perf-profile.children.cycles-pp.tcp_event_new_data_sent
      0.64            -0.0        0.60        perf-profile.children.cycles-pp.__fget_light
      0.97            -0.0        0.93        perf-profile.children.cycles-pp.__alloc_skb
      0.60 ±  3%      -0.0        0.55 ±  3%  perf-profile.children.cycles-pp.ip_rcv
      0.78            -0.0        0.74        perf-profile.children.cycles-pp.tcp_stream_alloc_skb
      0.38            -0.0        0.34 ±  2%  perf-profile.children.cycles-pp.page_cache_pipe_buf_confirm
      0.59 ±  2%      -0.0        0.55 ±  2%  perf-profile.children.cycles-pp.__entry_text_start
      0.23 ±  5%      -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.xas_load
      0.48            -0.0        0.44        perf-profile.children.cycles-pp.sk_reset_timer
      0.42            -0.0        0.39        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.74 ±  2%      -0.0        0.71        perf-profile.children.cycles-pp.__kfree_skb
      0.69            -0.0        0.65        perf-profile.children.cycles-pp.read_tsc
      0.45            -0.0        0.42 ±  2%  perf-profile.children.cycles-pp.current_time
      0.57            -0.0        0.54        perf-profile.children.cycles-pp.kmem_cache_alloc_node
      0.40 ±  2%      -0.0        0.38 ±  2%  perf-profile.children.cycles-pp.__virt_addr_valid
      0.81            -0.0        0.78        perf-profile.children.cycles-pp.enqueue_task_fair
      0.43            -0.0        0.40        perf-profile.children.cycles-pp.__mod_timer
      0.38            -0.0        0.36        perf-profile.children.cycles-pp.tcp_established_options
      0.21 ±  2%      -0.0        0.18 ±  2%  perf-profile.children.cycles-pp.sockfd_lookup_light
      0.35            -0.0        0.32 ±  2%  perf-profile.children.cycles-pp.__put_user_8
      0.30 ±  3%      -0.0        0.27        perf-profile.children.cycles-pp.aa_file_perm
      0.48            -0.0        0.46 ±  2%  perf-profile.children.cycles-pp.__tcp_send_ack
      0.49 ±  2%      -0.0        0.47        perf-profile.children.cycles-pp.kmem_cache_free
      0.28 ±  3%      -0.0        0.26 ±  4%  perf-profile.children.cycles-pp.ip_rcv_finish_core
      0.11 ±  6%      -0.0        0.09 ±  5%  perf-profile.children.cycles-pp.xas_start
      0.24            -0.0        0.22 ±  3%  perf-profile.children.cycles-pp.tcp_tso_segs
      0.25            -0.0        0.23        perf-profile.children.cycles-pp.copy_page_to_iter
      0.30            -0.0        0.28 ±  2%  perf-profile.children.cycles-pp.__netif_receive_skb_core
      0.24            -0.0        0.22 ±  2%  perf-profile.children.cycles-pp.sanity
      0.78            -0.0        0.76        perf-profile.children.cycles-pp.page_cache_pipe_buf_release
      0.28 ±  3%      -0.0        0.26        perf-profile.children.cycles-pp.tcp_schedule_loss_probe
      0.27            -0.0        0.26        perf-profile.children.cycles-pp.rcu_all_qs
      0.30            -0.0        0.28        perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.23            -0.0        0.22 ±  2%  perf-profile.children.cycles-pp.set_next_entity
      0.16 ±  3%      -0.0        0.15 ±  5%  perf-profile.children.cycles-pp.skb_release_head_state
      0.15 ±  2%      -0.0        0.14 ±  2%  perf-profile.children.cycles-pp.folio_mark_accessed
      0.08            -0.0        0.07 ±  5%  perf-profile.children.cycles-pp.aa_sk_perm
      0.20 ±  2%      -0.0        0.18 ±  2%  perf-profile.children.cycles-pp._raw_spin_unlock_bh
      0.07            -0.0        0.06        perf-profile.children.cycles-pp.rb_next
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.skb_push
      0.07            +0.0        0.08        perf-profile.children.cycles-pp.cpuidle_governor_latency_req
      0.33            +0.0        0.34        perf-profile.children.cycles-pp.prepare_task_switch
      0.07            +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.switch_fpu_return
      0.11 ±  6%      +0.0        0.12 ±  4%  perf-profile.children.cycles-pp.resched_curr
      0.14 ±  3%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.check_preempt_curr
      0.21            +0.0        0.23 ±  2%  perf-profile.children.cycles-pp.ip_output
      0.49 ±  2%      +0.0        0.51 ±  2%  perf-profile.children.cycles-pp._raw_spin_lock
      0.59            +0.0        0.62        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.76 ±  3%      +0.1        0.90 ±  4%  perf-profile.children.cycles-pp.mem_cgroup_uncharge_skmem
      0.31 ±  2%      +0.2        0.47 ± 10%  perf-profile.children.cycles-pp.propagate_protected_usage
     84.35            +0.2       84.55        perf-profile.children.cycles-pp.do_syscall_64
     16.48            +0.2       16.68        perf-profile.children.cycles-pp.__local_bh_enable_ip
     15.96            +0.2       16.20        perf-profile.children.cycles-pp.do_softirq
     15.84            +0.2       16.09        perf-profile.children.cycles-pp.__do_softirq
     15.20            +0.3       15.46        perf-profile.children.cycles-pp.net_rx_action
     17.63            +0.3       17.89        perf-profile.children.cycles-pp.__dev_queue_xmit
     18.00            +0.3       18.29        perf-profile.children.cycles-pp.ip_finish_output2
     18.93            +0.3       19.22        perf-profile.children.cycles-pp.__ip_queue_xmit
     20.12            +0.3       20.43        perf-profile.children.cycles-pp.__tcp_transmit_skb
     12.38            +0.3       12.69        perf-profile.children.cycles-pp.tcp_write_xmit
     13.56            +0.3       13.87        perf-profile.children.cycles-pp.__napi_poll
     13.51            +0.3       13.83        perf-profile.children.cycles-pp.process_backlog
     13.12            +0.3       13.44        perf-profile.children.cycles-pp.__netif_receive_skb_one_core
     12.12            +0.4       12.51        perf-profile.children.cycles-pp.ip_local_deliver_finish
     12.08            +0.4       12.47        perf-profile.children.cycles-pp.ip_protocol_deliver_rcu
     11.84            +0.4       12.24        perf-profile.children.cycles-pp.tcp_v4_rcv
      3.87            +0.5        4.34        perf-profile.children.cycles-pp.tcp_ack
      2.89            +0.5        3.40 ±  2%  perf-profile.children.cycles-pp.tcp_clean_rtx_queue
      9.78            +0.5       10.31        perf-profile.children.cycles-pp.__tcp_push_pending_frames
      1.54 ±  4%      +0.7        2.26 ±  7%  perf-profile.children.cycles-pp.refill_stock
      1.26 ±  5%      +0.7        1.99 ±  8%  perf-profile.children.cycles-pp.drain_stock
      1.24 ±  5%      +0.7        1.96 ±  8%  perf-profile.children.cycles-pp.page_counter_uncharge
     17.03            +0.8       17.85        perf-profile.children.cycles-pp.do_tcp_sendpages
     46.66            +0.9       47.56        perf-profile.children.cycles-pp.do_splice_direct
      2.92 ±  2%      +0.9        3.86 ±  3%  perf-profile.children.cycles-pp.__sk_mem_reduce_allocated
     46.08            +0.9       47.03        perf-profile.children.cycles-pp.splice_direct_to_actor
      4.41            +1.0        5.43 ±  4%  perf-profile.children.cycles-pp.tcp_data_queue
     10.88            +1.3       12.18        perf-profile.children.cycles-pp.tcp_build_frag
     16.59            +1.4       17.98        perf-profile.children.cycles-pp.tcp_v4_do_rcv
     16.36            +1.4       17.77        perf-profile.children.cycles-pp.tcp_rcv_established
      7.93            +1.5        9.40        perf-profile.children.cycles-pp.tcp_wmem_schedule
      1.52 ±  4%      +1.5        2.98 ±  8%  perf-profile.children.cycles-pp.page_counter_try_charge
     36.96            +1.5       38.45        perf-profile.children.cycles-pp.generic_splice_sendpage
     37.07            +1.5       38.56        perf-profile.children.cycles-pp.direct_splice_actor
      7.75            +1.5        9.24        perf-profile.children.cycles-pp.__sk_mem_schedule
      7.59            +1.5        9.10        perf-profile.children.cycles-pp.__sk_mem_raise_allocated
     36.59            +1.5       38.12        perf-profile.children.cycles-pp.__splice_from_pipe
     11.95            +1.5       13.48 ±  2%  perf-profile.children.cycles-pp.release_sock
     10.33            +1.6       11.89 ±  2%  perf-profile.children.cycles-pp.__release_sock
     34.67            +1.7       36.32        perf-profile.children.cycles-pp.pipe_to_sendpage
     34.34            +1.7       36.02        perf-profile.children.cycles-pp.sock_sendpage
     34.15            +1.7       35.84        perf-profile.children.cycles-pp.kernel_sendpage
     33.84            +1.7       35.56        perf-profile.children.cycles-pp.inet_sendpage
     33.40            +1.8       35.16        perf-profile.children.cycles-pp.tcp_sendpage
      3.31 ±  4%      +2.6        5.93 ±  7%  perf-profile.children.cycles-pp.try_charge_memcg
      6.82            +3.0        9.82 ±  3%  perf-profile.children.cycles-pp.mem_cgroup_charge_skmem
      6.66            -0.5        6.15        perf-profile.self.cycles-pp.copyout
      2.88            -0.4        2.44 ±  2%  perf-profile.self.cycles-pp.__sk_mem_raise_allocated
      2.69            -0.2        2.50        perf-profile.self.cycles-pp.filemap_get_read_batch
      2.14 ±  2%      -0.2        1.95 ±  2%  perf-profile.self.cycles-pp.check_heap_object
      2.01            -0.1        1.88        perf-profile.self.cycles-pp.tcp_build_frag
      1.30            -0.1        1.22        perf-profile.self.cycles-pp.filemap_read
      1.04            -0.1        0.96        perf-profile.self.cycles-pp.do_sendfile
      0.70            -0.1        0.63 ±  2%  perf-profile.self.cycles-pp.__splice_from_pipe
      0.52            -0.1        0.46 ±  2%  perf-profile.self.cycles-pp.sendfile_tcp_stream
      0.75            -0.1        0.70        perf-profile.self.cycles-pp.do_tcp_sendpages
      0.55 ±  2%      -0.1        0.50 ±  2%  perf-profile.self.cycles-pp._copy_from_user
      0.42 ±  4%      -0.1        0.36 ±  2%  perf-profile.self.cycles-pp.sendfile
      0.67 ±  3%      -0.1        0.62 ±  2%  perf-profile.self.cycles-pp.apparmor_file_permission
      1.11            -0.0        1.06        perf-profile.self.cycles-pp.copy_page_to_iter_pipe
      0.54 ±  2%      -0.0        0.49        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.48            -0.0        0.43 ±  2%  perf-profile.self.cycles-pp.__fsnotify_parent
      0.80 ±  2%      -0.0        0.75        perf-profile.self.cycles-pp.__skb_datagram_iter
      0.81            -0.0        0.76        perf-profile.self.cycles-pp.tcp_write_xmit
      0.95            -0.0        0.91        perf-profile.self.cycles-pp.__inet_lookup_established
      0.62            -0.0        0.58        perf-profile.self.cycles-pp.__fget_light
      0.36            -0.0        0.32        perf-profile.self.cycles-pp.tcp_rate_check_app_limited
      0.34            -0.0        0.30        perf-profile.self.cycles-pp.inet_sendpage
      0.47            -0.0        0.43 ±  2%  perf-profile.self.cycles-pp.netperf_sendfile
      0.49 ±  5%      -0.0        0.45        perf-profile.self.cycles-pp.net_rx_action
      0.48            -0.0        0.44        perf-profile.self.cycles-pp.atime_needs_update
      0.67            -0.0        0.63        perf-profile.self.cycles-pp.tcp_v4_rcv
      0.43 ±  3%      -0.0        0.40        perf-profile.self.cycles-pp.do_syscall_64
      0.41            -0.0        0.37        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.36 ±  2%      -0.0        0.32 ±  2%  perf-profile.self.cycles-pp.page_cache_pipe_buf_confirm
      0.46            -0.0        0.42        perf-profile.self.cycles-pp.__local_bh_enable_ip
      0.48 ±  2%      -0.0        0.45        perf-profile.self.cycles-pp.tcp_sendpage
      0.45            -0.0        0.42        perf-profile.self.cycles-pp.tcp_current_mss
      0.31 ±  2%      -0.0        0.28        perf-profile.self.cycles-pp.kernel_sendpage
      0.34            -0.0        0.31 ±  2%  perf-profile.self.cycles-pp.__put_user_8
      0.66            -0.0        0.63        perf-profile.self.cycles-pp.read_tsc
      0.40            -0.0        0.37 ±  2%  perf-profile.self.cycles-pp.__check_object_size
      0.33            -0.0        0.30        perf-profile.self.cycles-pp.generic_splice_sendpage
      0.31            -0.0        0.28 ±  2%  perf-profile.self.cycles-pp.tcp_send_mss
      0.66            -0.0        0.64        perf-profile.self.cycles-pp.tcp_ack
      0.28 ±  2%      -0.0        0.25 ±  2%  perf-profile.self.cycles-pp.__sys_recvfrom
      0.44            -0.0        0.42 ±  2%  perf-profile.self.cycles-pp.__cond_resched
      0.39            -0.0        0.36        perf-profile.self.cycles-pp._copy_to_iter
      0.34 ±  2%      -0.0        0.32 ±  2%  perf-profile.self.cycles-pp.tcp_established_options
      0.24 ±  2%      -0.0        0.21 ±  4%  perf-profile.self.cycles-pp.tcp_wmem_schedule
      0.48 ±  2%      -0.0        0.46        perf-profile.self.cycles-pp.kmem_cache_free
      0.33            -0.0        0.31        perf-profile.self.cycles-pp.pipe_to_sendpage
      0.11 ±  6%      -0.0        0.09        perf-profile.self.cycles-pp.check_stack_object
      0.36            -0.0        0.34 ±  3%  perf-profile.self.cycles-pp.release_sock
      0.26            -0.0        0.24 ±  2%  perf-profile.self.cycles-pp.security_file_permission
      0.23            -0.0        0.21 ±  3%  perf-profile.self.cycles-pp.tcp_tso_segs
      0.44            -0.0        0.42        perf-profile.self.cycles-pp.kmem_cache_alloc_node
      0.31            -0.0        0.29 ±  2%  perf-profile.self.cycles-pp.current_time
      0.20 ±  3%      -0.0        0.18 ±  2%  perf-profile.self.cycles-pp.do_splice_direct
      0.25 ±  4%      -0.0        0.23 ±  2%  perf-profile.self.cycles-pp.aa_file_perm
      0.25 ±  3%      -0.0        0.23 ±  2%  perf-profile.self.cycles-pp.touch_atime
      0.19            -0.0        0.17 ±  2%  perf-profile.self.cycles-pp.process_backlog
      0.11 ±  6%      -0.0        0.09 ±  5%  perf-profile.self.cycles-pp.__get_task_ioprio
      0.21            -0.0        0.19        perf-profile.self.cycles-pp.sanity
      0.06            -0.0        0.04 ± 44%  perf-profile.self.cycles-pp.aa_sk_perm
      0.33 ±  2%      -0.0        0.31 ±  2%  perf-profile.self.cycles-pp.splice_direct_to_actor
      0.30            -0.0        0.28        perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.22            -0.0        0.20 ±  2%  perf-profile.self.cycles-pp.copy_page_to_iter
      0.16 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.tcp_stream_alloc_skb
      0.11 ±  3%      -0.0        0.10 ±  5%  perf-profile.self.cycles-pp.ip_protocol_deliver_rcu
      0.09 ±  6%      -0.0        0.07 ±  6%  perf-profile.self.cycles-pp.xas_start
      0.15 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.__sk_mem_schedule
      0.65            -0.0        0.63        perf-profile.self.cycles-pp.tcp_rcv_established
      0.25            -0.0        0.23        perf-profile.self.cycles-pp.__mod_timer
      0.15            -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.tcp_tx_timestamp
      0.30            -0.0        0.28 ±  2%  perf-profile.self.cycles-pp.__netif_receive_skb_core
      0.75            -0.0        0.74        perf-profile.self.cycles-pp.page_cache_pipe_buf_release
      0.18 ±  2%      -0.0        0.17 ±  2%  perf-profile.self.cycles-pp.sock_sendpage
      0.13 ±  2%      -0.0        0.12        perf-profile.self.cycles-pp._raw_spin_unlock_bh
      0.12 ±  3%      -0.0        0.11        perf-profile.self.cycles-pp.folio_mark_accessed
      0.12            -0.0        0.11 ±  3%  perf-profile.self.cycles-pp.simple_copy_to_iter
      0.06            -0.0        0.05        perf-profile.self.cycles-pp.splice_from_pipe_next
      0.11            -0.0        0.10        perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      0.25            +0.0        0.26        perf-profile.self.cycles-pp.__switch_to
      0.06 ±  8%      +0.0        0.07 ±  6%  perf-profile.self.cycles-pp.switch_fpu_return
      0.44 ±  2%      +0.0        0.46        perf-profile.self.cycles-pp._raw_spin_lock
      0.33            +0.0        0.36 ±  2%  perf-profile.self.cycles-pp.__schedule
      0.58            +0.0        0.61        perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.34 ±  3%      +0.0        0.38        perf-profile.self.cycles-pp.__x64_sys_sendfile64
      0.16 ±  8%      +0.0        0.20 ±  2%  perf-profile.self.cycles-pp.do_splice_to
      0.65 ±  2%      +0.1        0.73 ±  2%  perf-profile.self.cycles-pp.__sk_mem_reduce_allocated
      0.71 ±  3%      +0.1        0.84 ±  5%  perf-profile.self.cycles-pp.mem_cgroup_uncharge_skmem
      0.30 ±  2%      +0.2        0.47 ± 10%  perf-profile.self.cycles-pp.propagate_protected_usage
      3.34 ±  3%      +0.4        3.72 ±  5%  perf-profile.self.cycles-pp.mem_cgroup_charge_skmem
      1.08 ±  6%      +0.7        1.74 ±  8%  perf-profile.self.cycles-pp.page_counter_uncharge
      1.72 ±  3%      +1.2        2.87 ±  7%  perf-profile.self.cycles-pp.try_charge_memcg
      1.36 ±  5%      +1.4        2.73 ±  8%  perf-profile.self.cycles-pp.page_counter_try_charge



[2]

=========================================================================================
cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/send_size/tbox_group/test/testcase:
  cs-localhost/gcc-11/performance/ipv4/x86_64-rhel-8.3/50%/debian-11.1-x86_64-20220510.cgz/300s/10K/lkp-icl-2sp2/TCP_SENDFILE/netperf

commit: 
  ed23734c23 ("Merge tag 'net-6.4-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net")
  05d72a8bed ("net: Keep sk->sk_forward_alloc as a proper size")

ed23734c23d2fc1e 05d72a8bedfacfc46f300ab38e0 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
  5.95e+09           -12.7%  5.193e+09        cpuidle..time
      3328 ± 22%     +96.7%       6547 ± 21%  numa-vmstat.node2.nr_slab_reclaimable
     13.95            -2.0       11.93        mpstat.cpu.all.idle%
      2.69            +0.6        3.31        mpstat.cpu.all.usr%
   5106176            -6.6%    4769081        vmstat.system.cs
   2629481            -7.3%    2436543        vmstat.system.in
  11284480 ±  9%     +23.7%   13957802 ± 11%  meminfo.DirectMap2M
   1726173 ±  2%     -17.6%    1422506 ±  2%  meminfo.Mapped
   7247621           +11.2%    8061423        meminfo.Shmem
     13314 ± 22%     +96.7%      26192 ± 21%  numa-meminfo.node2.KReclaimable
     13314 ± 22%     +96.7%      26192 ± 21%  numa-meminfo.node2.SReclaimable
     71128 ±  5%     +28.0%      91013 ±  8%  numa-meminfo.node2.Slab
     15.26            -1.9       13.33        turbostat.C1%
     10.41           -15.8%       8.77        turbostat.CPU%c1
      0.26           +11.5%       0.29        turbostat.IPC
     30.71            -3.2%      29.72        turbostat.RAMWatt
   7854382 ±  2%     +10.3%    8664074 ±  2%  sched_debug.cfs_rq:/.min_vruntime.min
    708120 ±  2%     -15.5%     598098 ±  3%  sched_debug.cfs_rq:/.min_vruntime.stddev
    708203 ±  2%     -15.5%     598191 ±  3%  sched_debug.cfs_rq:/.spread0.stddev
      5317 ±  2%     -11.2%       4722 ±  5%  sched_debug.cpu.avg_idle.min
  10037310 ±  3%     -15.9%    8440803 ±  2%  sched_debug.cpu.nr_switches.max
   1290083 ±  2%     -22.0%    1006686 ±  3%  sched_debug.cpu.nr_switches.stddev
     23218           +29.4%      30043        netperf.Throughput_Mbps
   1485996           +29.4%    1922763        netperf.Throughput_total_Mbps
    160215 ±  3%    +107.9%     333022 ± 15%  netperf.time.involuntary_context_switches
      5567            +2.5%       5707        netperf.time.percent_of_cpu_this_job_got
     16093            +1.2%      16286        netperf.time.system_time
    669.70           +34.0%     897.24        netperf.time.user_time
     35419 ±  3%    +160.8%      92374 ±  5%  netperf.time.voluntary_context_switches
 5.442e+09           +29.4%  7.041e+09        netperf.workload
   2481590            +8.1%    2681600        proc-vmstat.nr_file_pages
   1892119           +10.6%    2092306        proc-vmstat.nr_inactive_anon
    431915 ±  2%     -17.9%     354649 ±  2%  proc-vmstat.nr_mapped
      3064            -4.5%       2927        proc-vmstat.nr_page_table_pages
   1813072           +11.0%    2013082        proc-vmstat.nr_shmem
     35384            +1.3%      35861        proc-vmstat.nr_slab_reclaimable
   1892119           +10.6%    2092306        proc-vmstat.nr_zone_inactive_anon
    491137 ±  2%     -20.0%     393067 ± 17%  proc-vmstat.numa_hint_faults_local
   5593417           +10.7%    6193714        proc-vmstat.numa_hit
   5431644           +10.5%    6001135        proc-vmstat.numa_local
     44132 ±  3%     +18.1%      52128 ±  6%  proc-vmstat.pgactivate
   5733229            +9.9%    6302633        proc-vmstat.pgalloc_normal
      7.00           -22.1%       5.45        perf-stat.i.MPKI
 4.405e+10           +13.7%  5.007e+10        perf-stat.i.branch-instructions
      0.87            -0.1        0.78        perf-stat.i.branch-miss-rate%
 3.795e+08            +1.6%  3.854e+08        perf-stat.i.branch-misses
      6.39            -3.3        3.09 ±  7%  perf-stat.i.cache-miss-rate%
 1.038e+08 ±  2%     -57.7%   43877506 ±  7%  perf-stat.i.cache-misses
 1.633e+09           -12.0%  1.438e+09        perf-stat.i.cache-references
   5163294            -6.8%    4814691        perf-stat.i.context-switches
      1.29           -10.0%       1.16        perf-stat.i.cpi
 3.016e+11            +1.8%  3.072e+11        perf-stat.i.cpu-cycles
     27516 ±  3%     -34.8%      17931        perf-stat.i.cpu-migrations
      2930 ±  2%    +153.5%       7428 ±  7%  perf-stat.i.cycles-between-cache-misses
      0.01            -0.0        0.01 ± 13%  perf-stat.i.dTLB-load-miss-rate%
   7226907           -11.0%    6428694 ± 13%  perf-stat.i.dTLB-load-misses
 6.872e+10           +13.4%  7.791e+10        perf-stat.i.dTLB-loads
      0.00 ±  3%      -0.0        0.00 ±  2%  perf-stat.i.dTLB-store-miss-rate%
    954320 ±  3%     -33.0%     639153 ±  2%  perf-stat.i.dTLB-store-misses
 3.753e+10           +12.5%  4.221e+10        perf-stat.i.dTLB-stores
 2.332e+11           +13.2%  2.639e+11        perf-stat.i.instructions
      0.78           +11.1%       0.86        perf-stat.i.ipc
      2.36            +1.8%       2.40        perf-stat.i.metric.GHz
    263.06 ±  2%     -45.6%     143.14 ±  5%  perf-stat.i.metric.K/sec
      1186           +13.0%       1340        perf-stat.i.metric.M/sec
     95.18            +2.5       97.70        perf-stat.i.node-load-miss-rate%
  15047143 ±  3%     -50.7%    7421607 ±  7%  perf-stat.i.node-load-misses
    736992 ±  4%     -79.2%     153436 ±  5%  perf-stat.i.node-loads
     76.94           -13.8       63.13 ±  5%  perf-stat.i.node-store-miss-rate%
   8866276           -61.9%    3375324 ±  7%  perf-stat.i.node-store-misses
   2808107 ±  7%     -34.1%    1851536 ± 14%  perf-stat.i.node-stores
      7.00           -22.2%       5.45        perf-stat.overall.MPKI
      0.86            -0.1        0.77        perf-stat.overall.branch-miss-rate%
      6.36            -3.3        3.05 ±  7%  perf-stat.overall.cache-miss-rate%
      1.29           -10.0%       1.16        perf-stat.overall.cpi
      2907 ±  2%    +142.1%       7040 ±  7%  perf-stat.overall.cycles-between-cache-misses
      0.01            -0.0        0.01 ± 13%  perf-stat.overall.dTLB-load-miss-rate%
      0.00 ±  3%      -0.0        0.00 ±  2%  perf-stat.overall.dTLB-store-miss-rate%
      0.77           +11.1%       0.86        perf-stat.overall.ipc
     95.33            +2.6       97.97        perf-stat.overall.node-load-miss-rate%
     75.97           -11.3       64.69 ±  4%  perf-stat.overall.node-store-miss-rate%
     12891           -12.6%      11262        perf-stat.overall.path-length
  4.39e+10           +13.7%   4.99e+10        perf-stat.ps.branch-instructions
 3.782e+08            +1.6%  3.841e+08        perf-stat.ps.branch-misses
 1.034e+08 ±  2%     -57.7%   43735005 ±  7%  perf-stat.ps.cache-misses
 1.627e+09           -11.9%  1.433e+09        perf-stat.ps.cache-references
   5145798            -6.8%    4798160        perf-stat.ps.context-switches
 3.006e+11            +1.8%  3.062e+11        perf-stat.ps.cpu-cycles
     27426 ±  3%     -34.8%      17883        perf-stat.ps.cpu-migrations
   7190273           -11.0%    6397079 ± 13%  perf-stat.ps.dTLB-load-misses
 6.849e+10           +13.4%  7.765e+10        perf-stat.ps.dTLB-loads
    950808 ±  3%     -33.0%     637446 ±  2%  perf-stat.ps.dTLB-store-misses
 3.741e+10           +12.5%  4.207e+10        perf-stat.ps.dTLB-stores
 2.324e+11           +13.2%   2.63e+11        perf-stat.ps.instructions
  14992384 ±  3%     -50.7%    7391904 ±  7%  perf-stat.ps.node-load-misses
    734606 ±  4%     -79.2%     153010 ±  5%  perf-stat.ps.node-loads
   8837267           -61.9%    3364441 ±  7%  perf-stat.ps.node-store-misses
   2799494 ±  7%     -34.1%    1845425 ± 14%  perf-stat.ps.node-stores
 7.015e+13           +13.0%   7.93e+13        perf-stat.total.instructions
      7.88            -6.8        1.06 ±  2%  perf-profile.calltrace.cycles-pp.tcp_wmem_schedule.tcp_build_frag.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      7.64            -6.8        0.84        perf-profile.calltrace.cycles-pp.__sk_mem_schedule.tcp_wmem_schedule.tcp_build_frag.do_tcp_sendpages.tcp_sendpage
      7.45            -6.7        0.76 ±  2%  perf-profile.calltrace.cycles-pp.__sk_mem_raise_allocated.__sk_mem_schedule.tcp_wmem_schedule.tcp_build_frag.do_tcp_sendpages
     10.74            -6.3        4.41        perf-profile.calltrace.cycles-pp.tcp_build_frag.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
     33.39            -6.1       27.33        perf-profile.calltrace.cycles-pp.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage.pipe_to_sendpage
     33.88            -6.0       27.93        perf-profile.calltrace.cycles-pp.inet_sendpage.kernel_sendpage.sock_sendpage.pipe_to_sendpage.__splice_from_pipe
     34.25            -5.9       28.39        perf-profile.calltrace.cycles-pp.kernel_sendpage.sock_sendpage.pipe_to_sendpage.__splice_from_pipe.generic_splice_sendpage
     34.43            -5.8       28.61        perf-profile.calltrace.cycles-pp.sock_sendpage.pipe_to_sendpage.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor
     34.75            -5.8       29.00        perf-profile.calltrace.cycles-pp.pipe_to_sendpage.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor
     36.66            -5.3       31.34        perf-profile.calltrace.cycles-pp.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor.do_splice_direct
     37.08            -5.2       31.85        perf-profile.calltrace.cycles-pp.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor.do_splice_direct.do_sendfile
     37.20            -5.2       32.00        perf-profile.calltrace.cycles-pp.direct_splice_actor.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64
     16.95            -5.1       11.89        perf-profile.calltrace.cycles-pp.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage
      8.23            -2.6        5.67 ±  2%  perf-profile.calltrace.cycles-pp.__release_sock.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage
     46.36            -2.5       43.86        perf-profile.calltrace.cycles-pp.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64.do_syscall_64
     46.96            -2.4       44.58        perf-profile.calltrace.cycles-pp.do_splice_direct.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      9.59            -2.3        7.24 ±  2%  perf-profile.calltrace.cycles-pp.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage
      2.87            -2.1        0.76        perf-profile.calltrace.cycles-pp.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu
     51.58            -2.0       49.62        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.sendfile.sendfile_tcp_stream.main.__libc_start_main
     49.43            -1.9       47.48        perf-profile.calltrace.cycles-pp.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe.sendfile
     51.31            -1.9       49.37        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.sendfile.sendfile_tcp_stream.main
      6.07            -1.8        4.22 ±  2%  perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock
      6.04            -1.8        4.20 ±  2%  perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.__release_sock
     52.41            -1.8       50.64        perf-profile.calltrace.cycles-pp.sendfile.sendfile_tcp_stream.main.__libc_start_main
     50.66            -1.7       48.91        perf-profile.calltrace.cycles-pp.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe.sendfile.sendfile_tcp_stream
      1.99            -1.5        0.48 ± 44%  perf-profile.calltrace.cycles-pp.tcp_v4_do_rcv.__release_sock.release_sock.tcp_recvmsg.inet_recvmsg
     53.77            -1.5       52.28        perf-profile.calltrace.cycles-pp.sendfile_tcp_stream.main.__libc_start_main
      1.88 ±  2%      -1.5        0.42 ± 44%  perf-profile.calltrace.cycles-pp.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock.tcp_recvmsg
      5.64            -1.5        4.19 ±  2%  perf-profile.calltrace.cycles-pp.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames
      6.14            -1.4        4.71 ±  2%  perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established
      5.67            -1.4        4.24 ±  2%  perf-profile.calltrace.cycles-pp.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock.tcp_sendpage
      2.08            -1.4        0.68 ±  8%  perf-profile.calltrace.cycles-pp.__release_sock.release_sock.tcp_recvmsg.inet_recvmsg.sock_recvmsg
      5.66            -1.4        4.28 ±  2%  perf-profile.calltrace.cycles-pp.tcp_v4_do_rcv.__release_sock.release_sock.tcp_sendpage.inet_sendpage
      2.22            -1.4        0.84 ±  8%  perf-profile.calltrace.cycles-pp.release_sock.tcp_recvmsg.inet_recvmsg.sock_recvmsg.__sys_recvfrom
      7.37            -1.3        6.07 ±  3%  perf-profile.calltrace.cycles-pp.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv
     12.84            -1.2       11.64        perf-profile.calltrace.cycles-pp.asm_sysvec_call_function_single.acpi_safe_halt.acpi_idle_enter.cpuidle_enter_state.cpuidle_enter
      7.52            -1.1        6.41 ±  2%  perf-profile.calltrace.cycles-pp.__dev_queue_xmit.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit
     11.36            -1.0       10.31        perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64_no_verify
     11.35            -1.0       10.30        perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
     11.47            -1.0       10.43        perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
     11.32            -1.0       10.28        perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
      9.96            -0.9        9.02        perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
      9.10            -0.9        8.24        perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
      9.03            -0.9        8.18        perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
      8.78            -0.8        7.95        perf-profile.calltrace.cycles-pp.acpi_idle_enter.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      1.03            -0.6        0.43 ± 44%  perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.sock_def_readable.tcp_data_queue.tcp_rcv_established
      1.19            -0.6        0.59 ±  2%  perf-profile.calltrace.cycles-pp.sock_def_readable.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
      1.32            -0.6        0.75 ±  2%  perf-profile.calltrace.cycles-pp.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.__release_sock.release_sock
      1.08            -0.5        0.54        perf-profile.calltrace.cycles-pp.__wake_up_common_lock.sock_def_readable.tcp_data_queue.tcp_rcv_established.tcp_v4_do_rcv
      1.12            -0.5        0.59 ±  3%  perf-profile.calltrace.cycles-pp.tcp_clean_rtx_queue.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.__release_sock
      2.46            -0.5        2.00 ±  8%  perf-profile.calltrace.cycles-pp.wait_woken.sk_wait_data.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg
      2.24            -0.4        1.80 ±  8%  perf-profile.calltrace.cycles-pp.schedule_timeout.wait_woken.sk_wait_data.tcp_recvmsg_locked.tcp_recvmsg
      2.19            -0.4        1.75 ±  8%  perf-profile.calltrace.cycles-pp.schedule.schedule_timeout.wait_woken.sk_wait_data.tcp_recvmsg_locked
      2.08            -0.4        1.65 ±  7%  perf-profile.calltrace.cycles-pp.__schedule.schedule.schedule_timeout.wait_woken.sk_wait_data
      3.07            -0.4        2.65        perf-profile.calltrace.cycles-pp.sk_wait_data.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg
      1.69            -0.4        1.32 ±  8%  perf-profile.calltrace.cycles-pp.tcp_clean_rtx_queue.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
      3.56            -0.3        3.27        perf-profile.calltrace.cycles-pp.acpi_safe_halt.acpi_idle_enter.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call
      8.87            -0.3        8.62        perf-profile.calltrace.cycles-pp.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish.__netif_receive_skb_one_core
      2.17            -0.2        1.96 ±  8%  perf-profile.calltrace.cycles-pp.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu
      8.73            -0.2        8.51        perf-profile.calltrace.cycles-pp.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish
      0.69            -0.2        0.53 ± 44%  perf-profile.calltrace.cycles-pp.dequeue_task_fair.__schedule.schedule.schedule_timeout.wait_woken
      0.60            -0.1        0.46 ± 44%  perf-profile.calltrace.cycles-pp.dequeue_entity.dequeue_task_fair.__schedule.schedule.schedule_timeout
      2.34            -0.1        2.23        perf-profile.calltrace.cycles-pp.sysvec_call_function_single.asm_sysvec_call_function_single.acpi_safe_halt.acpi_idle_enter.cpuidle_enter_state
      0.99            -0.1        0.92        perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
      1.78            -0.1        1.70        perf-profile.calltrace.cycles-pp.__sysvec_call_function_single.sysvec_call_function_single.asm_sysvec_call_function_single.acpi_safe_halt.acpi_idle_enter
      0.93            -0.1        0.85        perf-profile.calltrace.cycles-pp.__schedule.schedule_idle.do_idle.cpu_startup_entry.start_secondary
      0.60            -0.1        0.54        perf-profile.calltrace.cycles-pp.menu_select.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
      1.21            -0.1        1.16        perf-profile.calltrace.cycles-pp.sched_ttwu_pending.__sysvec_call_function_single.sysvec_call_function_single.asm_sysvec_call_function_single.acpi_safe_halt
      0.95            -0.0        0.90        perf-profile.calltrace.cycles-pp.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single.sysvec_call_function_single.asm_sysvec_call_function_single
      0.69            -0.0        0.66        perf-profile.calltrace.cycles-pp.napi_consume_skb.net_rx_action.__do_softirq.do_softirq.__local_bh_enable_ip
      0.78            -0.0        0.75        perf-profile.calltrace.cycles-pp.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending.__sysvec_call_function_single
      0.53            +0.0        0.55        perf-profile.calltrace.cycles-pp.enqueue_entity.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending
      0.58            +0.0        0.61 ±  2%  perf-profile.calltrace.cycles-pp.__alloc_skb.tcp_stream_alloc_skb.tcp_build_frag.do_tcp_sendpages.tcp_sendpage
      0.79            +0.1        0.90 ±  2%  perf-profile.calltrace.cycles-pp.tcp_stream_alloc_skb.tcp_build_frag.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      0.77            +0.1        0.90 ±  3%  perf-profile.calltrace.cycles-pp.page_cache_pipe_buf_release.__splice_from_pipe.generic_splice_sendpage.direct_splice_actor.splice_direct_to_actor
      1.04            +0.1        1.18 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_bh.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage
      0.96            +0.2        1.12 ±  2%  perf-profile.calltrace.cycles-pp.tcp_current_mss.tcp_send_mss.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      0.71            +0.2        0.89        perf-profile.calltrace.cycles-pp.do_splice_to.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64
      0.41 ± 50%      +0.2        0.64 ±  2%  perf-profile.calltrace.cycles-pp._copy_from_user.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe.sendfile
      0.41 ± 50%      +0.2        0.65        perf-profile.calltrace.cycles-pp.security_file_permission.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.32            +0.2        1.56        perf-profile.calltrace.cycles-pp.tcp_send_mss.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
     15.63            +0.3       15.90        perf-profile.calltrace.cycles-pp.__do_softirq.do_softirq.__local_bh_enable_ip.__dev_queue_xmit.ip_finish_output2
      1.10            +0.3        1.37 ±  2%  perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage.inet_sendpage
     15.79            +0.3       16.06        perf-profile.calltrace.cycles-pp.do_softirq.__local_bh_enable_ip.__dev_queue_xmit.ip_finish_output2.__ip_queue_xmit
      1.18            +0.3        1.45 ±  2%  perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
     15.88            +0.3       16.16        perf-profile.calltrace.cycles-pp.__local_bh_enable_ip.__dev_queue_xmit.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb
      0.31 ± 81%      +0.3        0.60 ±  2%  perf-profile.calltrace.cycles-pp.touch_atime.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64
      1.29            +0.3        1.60        perf-profile.calltrace.cycles-pp.copy_page_to_iter_pipe.filemap_read.generic_file_splice_read.splice_direct_to_actor.do_splice_direct
      2.14            +0.3        2.48 ±  2%  perf-profile.calltrace.cycles-pp.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.do_tcp_sendpages
      2.23            +0.4        2.60        perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.do_tcp_sendpages.tcp_sendpage
      2.42            +0.4        2.86 ±  2%  perf-profile.calltrace.cycles-pp.__tcp_transmit_skb.tcp_write_xmit.do_tcp_sendpages.tcp_sendpage.inet_sendpage
      2.66            +0.5        3.20 ±  2%  perf-profile.calltrace.cycles-pp.tcp_write_xmit.do_tcp_sendpages.tcp_sendpage.inet_sendpage.kernel_sendpage
      0.00            +0.5        0.54 ±  2%  perf-profile.calltrace.cycles-pp.__fget_light.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +0.6        0.56 ±  2%  perf-profile.calltrace.cycles-pp.__entry_text_start.sendfile.sendfile_tcp_stream.main.__libc_start_main
      4.35            +0.6        4.96 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_bh.lock_sock_nested.tcp_sendpage.inet_sendpage
      0.00            +0.7        0.74 ±  3%  perf-profile.calltrace.cycles-pp.try_to_wake_up.__wake_up_common.__wake_up_common_lock.sock_def_readable.tcp_rcv_established
      5.15            +0.8        5.93 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_bh.lock_sock_nested.tcp_sendpage.inet_sendpage.kernel_sendpage
      0.00            +0.8        0.84 ±  3%  perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.sock_def_readable.tcp_rcv_established.tcp_v4_do_rcv
      2.47            +0.8        3.31        perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
      2.49            +0.8        3.34        perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu
      5.49            +0.9        6.34 ±  2%  perf-profile.calltrace.cycles-pp.lock_sock_nested.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage
      0.00            +0.9        0.88 ±  3%  perf-profile.calltrace.cycles-pp.__wake_up_common_lock.sock_def_readable.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
      2.61 ±  2%      +0.9        3.53 ±  3%  perf-profile.calltrace.cycles-pp.check_heap_object.__check_object_size.simple_copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter
      0.00            +0.9        0.94 ±  2%  perf-profile.calltrace.cycles-pp.sock_def_readable.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_protocol_deliver_rcu
      2.98            +1.0        4.00 ±  2%  perf-profile.calltrace.cycles-pp.__check_object_size.simple_copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked
      2.91            +1.0        3.94        perf-profile.calltrace.cycles-pp.filemap_get_read_batch.filemap_get_pages.filemap_read.generic_file_splice_read.splice_direct_to_actor
     10.13            +1.0       11.17        perf-profile.calltrace.cycles-pp.__tcp_transmit_skb.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg
      3.14            +1.1        4.21        perf-profile.calltrace.cycles-pp.filemap_get_pages.filemap_read.generic_file_splice_read.splice_direct_to_actor.do_splice_direct
      3.24            +1.1        4.32 ±  2%  perf-profile.calltrace.cycles-pp.simple_copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg
     10.38            +1.3       11.66        perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg
     10.07            +1.3       11.41        perf-profile.calltrace.cycles-pp.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_recvmsg_locked.tcp_recvmsg
      9.94            +1.3       11.28        perf-profile.calltrace.cycles-pp.__dev_queue_xmit.ip_finish_output2.__ip_queue_xmit.__tcp_transmit_skb.tcp_recvmsg_locked
      6.53            +1.6        8.18        perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked
      7.02            +1.8        8.79        perf-profile.calltrace.cycles-pp._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg
     31.73            +1.9       33.63        perf-profile.calltrace.cycles-pp.tcp_recvmsg.inet_recvmsg.sock_recvmsg.__sys_recvfrom.__x64_sys_recvfrom
     31.85            +1.9       33.77        perf-profile.calltrace.cycles-pp.inet_recvmsg.sock_recvmsg.__sys_recvfrom.__x64_sys_recvfrom.do_syscall_64
      6.52            +1.9        8.44        perf-profile.calltrace.cycles-pp.filemap_read.generic_file_splice_read.splice_direct_to_actor.do_splice_direct.do_sendfile
     32.06            +1.9       34.00        perf-profile.calltrace.cycles-pp.sock_recvmsg.__sys_recvfrom.__x64_sys_recvfrom.do_syscall_64.entry_SYSCALL_64_after_hwframe
     32.54            +2.0       34.54        perf-profile.calltrace.cycles-pp.__sys_recvfrom.__x64_sys_recvfrom.do_syscall_64.entry_SYSCALL_64_after_hwframe.recv
     32.63            +2.0       34.64        perf-profile.calltrace.cycles-pp.__x64_sys_recvfrom.do_syscall_64.entry_SYSCALL_64_after_hwframe.recv.process_requests
     33.81            +2.0       35.82        perf-profile.calltrace.cycles-pp.recv.process_requests.spawn_child.accept_connection.accept_connections
     32.95            +2.0       34.96        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.recv.process_requests.spawn_child
     33.11            +2.0       35.14        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.recv.process_requests.spawn_child.accept_connection
      7.44            +2.1        9.57        perf-profile.calltrace.cycles-pp.generic_file_splice_read.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64
     11.23            +3.1       14.38        perf-profile.calltrace.cycles-pp.__skb_datagram_iter.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg
     11.30            +3.2       14.48        perf-profile.calltrace.cycles-pp.skb_copy_datagram_iter.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg
     29.26            +3.2       32.47        perf-profile.calltrace.cycles-pp.tcp_recvmsg_locked.tcp_recvmsg.inet_recvmsg.sock_recvmsg.__sys_recvfrom
      7.77            -6.8        0.94        perf-profile.children.cycles-pp.__sk_mem_schedule
      7.95            -6.8        1.12        perf-profile.children.cycles-pp.tcp_wmem_schedule
      7.62            -6.7        0.88        perf-profile.children.cycles-pp.__sk_mem_raise_allocated
     10.92            -6.3        4.63        perf-profile.children.cycles-pp.tcp_build_frag
      6.86 ±  2%      -6.2        0.62 ±  2%  perf-profile.children.cycles-pp.mem_cgroup_charge_skmem
     33.63            -5.9       27.72        perf-profile.children.cycles-pp.tcp_sendpage
     34.07            -5.8       28.26        perf-profile.children.cycles-pp.inet_sendpage
     34.39            -5.7       28.65        perf-profile.children.cycles-pp.kernel_sendpage
     34.58            -5.7       28.88        perf-profile.children.cycles-pp.sock_sendpage
     34.90            -5.6       29.28        perf-profile.children.cycles-pp.pipe_to_sendpage
     36.86            -5.2       31.69        perf-profile.children.cycles-pp.__splice_from_pipe
     37.23            -5.1       32.14        perf-profile.children.cycles-pp.generic_splice_sendpage
     37.33            -5.1       32.26        perf-profile.children.cycles-pp.direct_splice_actor
     17.14            -4.9       12.22        perf-profile.children.cycles-pp.do_tcp_sendpages
     10.36            -3.9        6.49        perf-profile.children.cycles-pp.__release_sock
      4.40            -3.6        0.78        perf-profile.children.cycles-pp.tcp_data_queue
     11.99            -3.6        8.40        perf-profile.children.cycles-pp.release_sock
      3.34 ±  4%      -3.1        0.26 ±  2%  perf-profile.children.cycles-pp.try_charge_memcg
     16.59            -3.0       13.62        perf-profile.children.cycles-pp.tcp_v4_do_rcv
     16.37            -2.9       13.46        perf-profile.children.cycles-pp.tcp_rcv_established
     46.40            -2.5       43.91        perf-profile.children.cycles-pp.splice_direct_to_actor
      2.93            -2.5        0.46        perf-profile.children.cycles-pp.__sk_mem_reduce_allocated
     46.99            -2.4       44.62        perf-profile.children.cycles-pp.do_splice_direct
     49.52            -1.9       47.59        perf-profile.children.cycles-pp.do_sendfile
     50.71            -1.7       48.97        perf-profile.children.cycles-pp.__x64_sys_sendfile64
      1.54 ±  5%      -1.5        0.06 ±  6%  perf-profile.children.cycles-pp.page_counter_try_charge
      1.56 ±  3%      -1.4        0.16 ±  2%  perf-profile.children.cycles-pp.refill_stock
      1.29 ±  4%      -1.2        0.06        perf-profile.children.cycles-pp.drain_stock
      1.26 ±  4%      -1.2        0.05        perf-profile.children.cycles-pp.page_counter_uncharge
     52.89            -1.2       51.68        perf-profile.children.cycles-pp.sendfile
     11.36            -1.0       10.31        perf-profile.children.cycles-pp.start_secondary
     11.47            -1.0       10.43        perf-profile.children.cycles-pp.secondary_startup_64_no_verify
     11.47            -1.0       10.43        perf-profile.children.cycles-pp.cpu_startup_entry
     11.45            -1.0       10.41        perf-profile.children.cycles-pp.do_idle
     53.93            -1.0       52.92        perf-profile.children.cycles-pp.sendfile_tcp_stream
      3.85            -0.9        2.91        perf-profile.children.cycles-pp.tcp_ack
     10.07            -0.9        9.13        perf-profile.children.cycles-pp.cpuidle_idle_call
      9.19            -0.9        8.34        perf-profile.children.cycles-pp.cpuidle_enter
      9.13            -0.8        8.28        perf-profile.children.cycles-pp.cpuidle_enter_state
      2.87            -0.8        2.03        perf-profile.children.cycles-pp.tcp_clean_rtx_queue
      8.84            -0.8        8.02        perf-profile.children.cycles-pp.acpi_safe_halt
      8.87            -0.8        8.04        perf-profile.children.cycles-pp.acpi_idle_enter
      7.77            -0.7        7.11        perf-profile.children.cycles-pp.asm_sysvec_call_function_single
      9.79            -0.6        9.14        perf-profile.children.cycles-pp.__tcp_push_pending_frames
      0.75 ±  4%      -0.6        0.14 ±  3%  perf-profile.children.cycles-pp.mem_cgroup_uncharge_skmem
      3.07            -0.4        2.63        perf-profile.children.cycles-pp.__schedule
      3.09            -0.4        2.67        perf-profile.children.cycles-pp.sk_wait_data
      2.47            -0.4        2.08        perf-profile.children.cycles-pp.wait_woken
      2.25            -0.4        1.87        perf-profile.children.cycles-pp.schedule_timeout
      2.20            -0.4        1.83        perf-profile.children.cycles-pp.schedule
      1.10 ±  2%      -0.3        0.82 ±  3%  perf-profile.children.cycles-pp.pick_next_task_fair
      0.73 ±  4%      -0.2        0.48 ±  6%  perf-profile.children.cycles-pp.newidle_balance
      0.28 ± 12%      -0.2        0.09 ±  5%  perf-profile.children.cycles-pp.cgroup_rstat_updated
      2.39            -0.1        2.28        perf-profile.children.cycles-pp.sysvec_call_function_single
      0.30 ±  4%      -0.1        0.20 ±  5%  perf-profile.children.cycles-pp.load_balance
      1.01            -0.1        0.93        perf-profile.children.cycles-pp.schedule_idle
      1.68            -0.1        1.60        perf-profile.children.cycles-pp.sock_def_readable
      1.82            -0.1        1.74        perf-profile.children.cycles-pp.__sysvec_call_function_single
      0.22 ±  5%      -0.1        0.14 ±  5%  perf-profile.children.cycles-pp.find_busiest_group
      1.51            -0.1        1.44        perf-profile.children.cycles-pp.__wake_up_common_lock
      0.20 ±  6%      -0.1        0.13 ±  5%  perf-profile.children.cycles-pp.update_sd_lb_stats
      1.27            -0.1        1.21        perf-profile.children.cycles-pp.try_to_wake_up
      1.43            -0.1        1.37        perf-profile.children.cycles-pp.__wake_up_common
      0.14 ±  3%      -0.1        0.09 ±  7%  perf-profile.children.cycles-pp.update_blocked_averages
      0.61            -0.1        0.56        perf-profile.children.cycles-pp.menu_select
      0.70            -0.1        0.64        perf-profile.children.cycles-pp.dequeue_task_fair
      0.15 ±  4%      -0.1        0.10 ±  5%  perf-profile.children.cycles-pp.update_sg_lb_stats
      0.63            -0.0        0.58        perf-profile.children.cycles-pp.dequeue_entity
      1.25            -0.0        1.20        perf-profile.children.cycles-pp.sched_ttwu_pending
      0.24 ±  2%      -0.0        0.19 ±  3%  perf-profile.children.cycles-pp.tcp_check_space
      0.98            -0.0        0.94        perf-profile.children.cycles-pp.ttwu_do_activate
      0.06            -0.0        0.02 ± 99%  perf-profile.children.cycles-pp.irqentry_exit
      0.30            -0.0        0.27 ±  2%  perf-profile.children.cycles-pp.native_irq_return_iret
      0.52            -0.0        0.48        perf-profile.children.cycles-pp.ttwu_queue_wakelist
      0.43            -0.0        0.40        perf-profile.children.cycles-pp.native_sched_clock
      0.08 ±  5%      -0.0        0.06 ±  9%  perf-profile.children.cycles-pp.raw_spin_rq_lock_nested
      0.22            -0.0        0.20 ±  2%  perf-profile.children.cycles-pp.__switch_to_asm
      0.48            -0.0        0.45        perf-profile.children.cycles-pp.sched_clock_cpu
      0.27            -0.0        0.24        perf-profile.children.cycles-pp.__switch_to
      0.21 ±  2%      -0.0        0.18 ±  4%  perf-profile.children.cycles-pp.___perf_sw_event
      0.11 ±  3%      -0.0        0.09        perf-profile.children.cycles-pp.ct_kernel_exit_state
      0.19 ±  2%      -0.0        0.17 ±  2%  perf-profile.children.cycles-pp.native_apic_msr_eoi_write
      0.29            -0.0        0.27        perf-profile.children.cycles-pp.update_curr
      0.06            -0.0        0.04 ± 44%  perf-profile.children.cycles-pp.update_irq_load_avg
      0.14 ±  2%      -0.0        0.12        perf-profile.children.cycles-pp.update_rq_clock_task
      0.11 ±  4%      -0.0        0.09 ±  7%  perf-profile.children.cycles-pp.resched_curr
      0.13 ±  4%      -0.0        0.11 ±  4%  perf-profile.children.cycles-pp.check_preempt_curr
      0.17 ±  2%      -0.0        0.15 ±  2%  perf-profile.children.cycles-pp.__x2apic_send_IPI_dest
      0.17 ±  2%      -0.0        0.15 ±  3%  perf-profile.children.cycles-pp.__update_load_avg_se
      0.12 ±  4%      -0.0        0.10 ±  3%  perf-profile.children.cycles-pp.finish_task_switch
      0.25            -0.0        0.23 ±  2%  perf-profile.children.cycles-pp.set_next_entity
      0.09            -0.0        0.08        perf-profile.children.cycles-pp.__wrgsbase_inactive
      0.06            -0.0        0.05        perf-profile.children.cycles-pp.ct_idle_exit
      0.10            +0.0        0.11        perf-profile.children.cycles-pp.tcp_chrono_stop
      0.07 ±  5%      +0.0        0.08        perf-profile.children.cycles-pp.rb_next
      0.05 ±  7%      +0.0        0.06 ±  7%  perf-profile.children.cycles-pp.__fdget
      0.08 ±  5%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.tcp_rearm_rto
      0.06 ±  8%      +0.0        0.07        perf-profile.children.cycles-pp.rb_first
      1.08            +0.0        1.10        perf-profile.children.cycles-pp.dev_hard_start_xmit
      0.11 ±  4%      +0.0        0.13 ±  2%  perf-profile.children.cycles-pp.inet_ehashfn
      0.07 ±  6%      +0.0        0.09 ±  4%  perf-profile.children.cycles-pp.demo_interval_tick
      0.12 ±  3%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.netif_skb_features
      0.28 ±  2%      +0.0        0.30        perf-profile.children.cycles-pp.ip_local_out
      0.09            +0.0        0.10 ±  4%  perf-profile.children.cycles-pp.tcp_queue_rcv
      0.05            +0.0        0.06 ±  7%  perf-profile.children.cycles-pp.__tcp_ack_snd_check
      0.16 ±  3%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.ip_send_check
      0.07 ±  7%      +0.0        0.08 ±  4%  perf-profile.children.cycles-pp.tcp_rtt_estimator
      0.06 ±  8%      +0.0        0.07 ±  5%  perf-profile.children.cycles-pp.iov_iter_pipe
      0.24 ±  3%      +0.0        0.26        perf-profile.children.cycles-pp.tcp_rcv_space_adjust
      0.25            +0.0        0.26        perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
      0.15            +0.0        0.17 ±  5%  perf-profile.children.cycles-pp.ipv4_dst_check
      0.06 ±  7%      +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.splice_from_pipe_next
      0.12 ±  3%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.tcp_update_skb_after_send
      0.60            +0.0        0.62        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.08            +0.0        0.10        perf-profile.children.cycles-pp.__list_add_valid
      0.11 ±  3%      +0.0        0.13 ±  2%  perf-profile.children.cycles-pp.__get_task_ioprio
      0.36            +0.0        0.38        perf-profile.children.cycles-pp.enqueue_to_backlog
      0.11            +0.0        0.13 ±  2%  perf-profile.children.cycles-pp.syscall_enter_from_user_mode
      0.12            +0.0        0.14 ±  2%  perf-profile.children.cycles-pp.tcp_push
      0.21            +0.0        0.23 ±  3%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      0.10 ±  5%      +0.0        0.12 ±  5%  perf-profile.children.cycles-pp.xas_start
      0.10            +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.tcp_update_pacing_rate
      0.06            +0.0        0.08 ±  5%  perf-profile.children.cycles-pp.tcp_event_data_recv
      0.12 ±  4%      +0.0        0.14        perf-profile.children.cycles-pp.tcp_downgrade_zcopy_pure
      0.17 ±  3%      +0.0        0.20 ±  2%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
      0.20 ±  2%      +0.0        0.22 ±  4%  perf-profile.children.cycles-pp.sockfd_lookup_light
      0.10 ±  4%      +0.0        0.13        perf-profile.children.cycles-pp.is_vmalloc_addr
      0.10 ±  4%      +0.0        0.13 ±  6%  perf-profile.children.cycles-pp.make_vfsgid
      0.10 ±  3%      +0.0        0.13 ±  2%  perf-profile.children.cycles-pp.make_vfsuid
      0.39            +0.0        0.42        perf-profile.children.cycles-pp.netif_rx_internal
      0.28 ±  2%      +0.0        0.30 ±  3%  perf-profile.children.cycles-pp.recv_tcp_stream
      0.13            +0.0        0.16 ±  4%  perf-profile.children.cycles-pp.check_stack_object
      0.13 ±  3%      +0.0        0.16 ±  2%  perf-profile.children.cycles-pp.tcp_release_cb
      0.12 ±  3%      +0.0        0.15 ±  2%  perf-profile.children.cycles-pp.demo_stream_interval
      0.26 ±  2%      +0.0        0.29 ±  2%  perf-profile.children.cycles-pp.tcp_add_backlog
      0.11 ±  3%      +0.0        0.14 ±  2%  perf-profile.children.cycles-pp.tcp_ack_update_rtt
      0.21            +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.ip_rcv_core
      0.18 ±  2%      +0.0        0.21 ±  3%  perf-profile.children.cycles-pp.__sk_dst_check
      0.07            +0.0        0.10 ±  3%  perf-profile.children.cycles-pp.__tcp_cleanup_rbuf
      0.41            +0.0        0.44        perf-profile.children.cycles-pp.__netif_rx
      0.17 ±  2%      +0.0        0.20        perf-profile.children.cycles-pp.__tcp_select_window
      0.14 ±  3%      +0.0        0.17 ±  3%  perf-profile.children.cycles-pp.tcp_mtu_probe
      0.34            +0.0        0.37        perf-profile.children.cycles-pp.kmalloc_reserve
      0.09 ±  4%      +0.0        0.12 ±  4%  perf-profile.children.cycles-pp.lock_timer_base
      0.17 ±  2%      +0.0        0.21        perf-profile.children.cycles-pp.tcp_tx_timestamp
      0.15            +0.0        0.19 ±  3%  perf-profile.children.cycles-pp.folio_mark_accessed
      0.20 ±  2%      +0.0        0.24        perf-profile.children.cycles-pp._raw_spin_unlock_bh
      0.40            +0.0        0.44        perf-profile.children.cycles-pp.tcp_mstamp_refresh
      0.15 ±  3%      +0.0        0.20 ±  5%  perf-profile.children.cycles-pp.inet_send_prepare
      0.37            +0.0        0.42        perf-profile.children.cycles-pp.__skb_clone
      0.14 ±  2%      +0.0        0.18 ±  5%  perf-profile.children.cycles-pp.ktime_get_coarse_real_ts64
      0.27            +0.0        0.32        perf-profile.children.cycles-pp.validate_xmit_skb
      0.18            +0.0        0.22 ±  2%  perf-profile.children.cycles-pp.fsnotify_perm
      0.17 ±  3%      +0.0        0.22 ±  2%  perf-profile.children.cycles-pp.skb_clone
      0.19 ±  2%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.rw_verify_area
      0.23 ±  2%      +0.1        0.28 ±  2%  perf-profile.children.cycles-pp.xas_load
      0.00            +0.1        0.05 ±  8%  perf-profile.children.cycles-pp.tcp_rbtree_insert
      0.28 ±  2%      +0.1        0.34        perf-profile.children.cycles-pp.tcp_schedule_loss_probe
      0.24            +0.1        0.30        perf-profile.children.cycles-pp.sanity
      0.32 ±  2%      +0.1        0.38 ±  2%  perf-profile.children.cycles-pp.dst_release
      0.58            +0.1        0.65        perf-profile.children.cycles-pp.kmem_cache_alloc_node
      0.31            +0.1        0.37        perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.24            +0.1        0.31        perf-profile.children.cycles-pp.tcp_tso_segs
      0.25            +0.1        0.32        perf-profile.children.cycles-pp.copy_page_to_iter
      0.48            +0.1        0.55        perf-profile.children.cycles-pp._raw_spin_lock
      0.28            +0.1        0.35        perf-profile.children.cycles-pp.rcu_all_qs
      0.32 ±  4%      +0.1        0.39 ±  2%  perf-profile.children.cycles-pp.sock_put
      0.50            +0.1        0.57        perf-profile.children.cycles-pp.kmem_cache_free
      0.34 ±  2%      +0.1        0.42        perf-profile.children.cycles-pp.__put_user_8
      0.29 ±  2%      +0.1        0.37 ±  2%  perf-profile.children.cycles-pp.aa_file_perm
      0.49            +0.1        0.57        perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      0.69            +0.1        0.77        perf-profile.children.cycles-pp.read_tsc
      0.16 ±  4%      +0.1        0.25 ±  4%  perf-profile.children.cycles-pp.skb_release_head_state
      0.38            +0.1        0.47        perf-profile.children.cycles-pp.tcp_established_options
      0.41            +0.1        0.50 ±  3%  perf-profile.children.cycles-pp.__virt_addr_valid
      0.48            +0.1        0.58        perf-profile.children.cycles-pp.__tcp_send_ack
      0.42            +0.1        0.52        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.99            +0.1        1.09        perf-profile.children.cycles-pp.__alloc_skb
      0.52            +0.1        0.64        perf-profile.children.cycles-pp.netperf_sendfile
      0.43            +0.1        0.55        perf-profile.children.cycles-pp.__mod_timer
      0.46            +0.1        0.58        perf-profile.children.cycles-pp.tcp_event_new_data_sent
      0.80            +0.1        0.92        perf-profile.children.cycles-pp.tcp_stream_alloc_skb
      0.47            +0.1        0.60        perf-profile.children.cycles-pp.sk_reset_timer
      0.46 ±  2%      +0.1        0.58 ±  2%  perf-profile.children.cycles-pp.current_time
      0.51            +0.1        0.64        perf-profile.children.cycles-pp.__fsnotify_parent
      0.59            +0.1        0.73        perf-profile.children.cycles-pp.__entry_text_start
      0.54            +0.1        0.68        perf-profile.children.cycles-pp._copy_from_user
      0.41            +0.1        0.54        perf-profile.children.cycles-pp.tcp_rate_check_app_limited
      0.39 ±  2%      +0.1        0.52        perf-profile.children.cycles-pp.page_cache_pipe_buf_confirm
      0.62            +0.1        0.76        perf-profile.children.cycles-pp.__fget_light
      0.79            +0.1        0.94 ±  2%  perf-profile.children.cycles-pp.page_cache_pipe_buf_release
      1.02            +0.2        1.19 ±  4%  perf-profile.children.cycles-pp.ktime_get
      0.97            +0.2        1.14        perf-profile.children.cycles-pp.napi_consume_skb
      0.78            +0.2        0.95        perf-profile.children.cycles-pp.__cond_resched
      1.14            +0.2        1.32        perf-profile.children.cycles-pp.tcp_current_mss
      0.74            +0.2        0.93        perf-profile.children.cycles-pp.do_splice_to
      0.76            +0.2        0.96        perf-profile.children.cycles-pp.__kfree_skb
      0.94 ±  2%      +0.3        1.19        perf-profile.children.cycles-pp.apparmor_file_permission
      1.38            +0.3        1.65        perf-profile.children.cycles-pp.tcp_send_mss
      1.09 ±  2%      +0.3        1.36        perf-profile.children.cycles-pp.atime_needs_update
      1.44            +0.3        1.72        perf-profile.children.cycles-pp.skb_release_data
     15.09            +0.3       15.40        perf-profile.children.cycles-pp.net_rx_action
      1.20            +0.3        1.51        perf-profile.children.cycles-pp.security_file_permission
      1.34            +0.3        1.67        perf-profile.children.cycles-pp.touch_atime
      1.35            +0.3        1.69        perf-profile.children.cycles-pp.copy_page_to_iter_pipe
     15.73            +0.3       16.08        perf-profile.children.cycles-pp.__do_softirq
     17.54            +0.4       17.89        perf-profile.children.cycles-pp.__dev_queue_xmit
     15.84            +0.4       16.20        perf-profile.children.cycles-pp.do_softirq
     17.91            +0.4       18.27        perf-profile.children.cycles-pp.ip_finish_output2
     18.82            +0.4       19.20        perf-profile.children.cycles-pp.__ip_queue_xmit
     20.03            +0.4       20.41        perf-profile.children.cycles-pp.__tcp_transmit_skb
     84.38            +0.4       84.81        perf-profile.children.cycles-pp.do_syscall_64
     16.35            +0.5       16.81        perf-profile.children.cycles-pp.__local_bh_enable_ip
     84.91            +0.5       85.42        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      4.52            +0.7        5.21        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      2.68            +0.9        3.62 ±  3%  perf-profile.children.cycles-pp.check_heap_object
      5.69            +1.0        6.64        perf-profile.children.cycles-pp.lock_sock_nested
      6.77            +1.0        7.80        perf-profile.children.cycles-pp._raw_spin_lock_bh
      3.19            +1.1        4.26 ±  2%  perf-profile.children.cycles-pp.__check_object_size
      2.94            +1.1        4.01        perf-profile.children.cycles-pp.filemap_get_read_batch
      3.29            +1.1        4.38 ±  2%  perf-profile.children.cycles-pp.simple_copy_to_iter
      3.16            +1.1        4.29        perf-profile.children.cycles-pp.filemap_get_pages
      6.68            +1.7        8.36        perf-profile.children.cycles-pp.copyout
      7.06            +1.8        8.85        perf-profile.children.cycles-pp._copy_to_iter
     31.77            +1.9       33.68        perf-profile.children.cycles-pp.tcp_recvmsg
     31.86            +1.9       33.78        perf-profile.children.cycles-pp.inet_recvmsg
     32.07            +1.9       34.01        perf-profile.children.cycles-pp.sock_recvmsg
     32.56            +2.0       34.56        perf-profile.children.cycles-pp.__sys_recvfrom
     32.65            +2.0       34.65        perf-profile.children.cycles-pp.__x64_sys_recvfrom
     33.95            +2.0       35.97        perf-profile.children.cycles-pp.recv
      6.63            +2.0        8.66        perf-profile.children.cycles-pp.filemap_read
     34.18            +2.0       36.23        perf-profile.children.cycles-pp.accept_connections
     34.18            +2.0       36.23        perf-profile.children.cycles-pp.accept_connection
     34.18            +2.0       36.23        perf-profile.children.cycles-pp.spawn_child
     34.18            +2.0       36.23        perf-profile.children.cycles-pp.process_requests
      7.51            +2.2        9.74        perf-profile.children.cycles-pp.generic_file_splice_read
     11.31            +3.2       14.48        perf-profile.children.cycles-pp.skb_copy_datagram_iter
     11.29            +3.2       14.46        perf-profile.children.cycles-pp.__skb_datagram_iter
     29.29            +3.2       32.51        perf-profile.children.cycles-pp.tcp_recvmsg_locked
      3.33 ±  3%      -3.0        0.32 ±  2%  perf-profile.self.cycles-pp.mem_cgroup_charge_skmem
      2.89            -2.6        0.29        perf-profile.self.cycles-pp.__sk_mem_raise_allocated
      1.72 ±  4%      -1.5        0.18 ±  3%  perf-profile.self.cycles-pp.try_charge_memcg
      1.38 ±  5%      -1.3        0.05 ±  7%  perf-profile.self.cycles-pp.page_counter_try_charge
      1.10 ±  4%      -1.1        0.04 ± 44%  perf-profile.self.cycles-pp.page_counter_uncharge
      5.95            -0.6        5.30        perf-profile.self.cycles-pp.acpi_safe_halt
      0.69 ±  4%      -0.6        0.12 ±  3%  perf-profile.self.cycles-pp.mem_cgroup_uncharge_skmem
      0.64 ±  2%      -0.5        0.16 ±  3%  perf-profile.self.cycles-pp.__sk_mem_reduce_allocated
      0.25 ± 13%      -0.2        0.08 ±  8%  perf-profile.self.cycles-pp.cgroup_rstat_updated
      0.27 ±  2%      -0.2        0.10 ±  3%  perf-profile.self.cycles-pp.refill_stock
      0.67 ±  2%      -0.1        0.55        perf-profile.self.cycles-pp.tcp_ack
      0.15 ±  3%      -0.1        0.05        perf-profile.self.cycles-pp.__sk_mem_schedule
      0.28 ±  6%      -0.1        0.20 ±  5%  perf-profile.self.cycles-pp.newidle_balance
      0.22            -0.0        0.17 ±  2%  perf-profile.self.cycles-pp.tcp_check_space
      0.24 ±  3%      -0.0        0.20 ±  2%  perf-profile.self.cycles-pp.enqueue_task_fair
      0.12 ±  3%      -0.0        0.08 ±  8%  perf-profile.self.cycles-pp.update_sg_lb_stats
      0.06            -0.0        0.02 ± 99%  perf-profile.self.cycles-pp.update_irq_load_avg
      0.30            -0.0        0.27 ±  2%  perf-profile.self.cycles-pp.native_irq_return_iret
      0.34            -0.0        0.30 ±  2%  perf-profile.self.cycles-pp.__schedule
      0.11            -0.0        0.08 ±  4%  perf-profile.self.cycles-pp.ct_kernel_exit_state
      0.22 ±  3%      -0.0        0.19        perf-profile.self.cycles-pp.__switch_to_asm
      0.41            -0.0        0.38        perf-profile.self.cycles-pp.native_sched_clock
      0.19 ±  2%      -0.0        0.16 ±  2%  perf-profile.self.cycles-pp.native_apic_msr_eoi_write
      0.22 ±  2%      -0.0        0.19        perf-profile.self.cycles-pp.menu_select
      0.26            -0.0        0.23 ±  2%  perf-profile.self.cycles-pp.__switch_to
      0.22 ±  2%      -0.0        0.20 ±  3%  perf-profile.self.cycles-pp.loopback_xmit
      0.18 ±  3%      -0.0        0.16 ±  4%  perf-profile.self.cycles-pp.___perf_sw_event
      0.11 ±  4%      -0.0        0.09 ±  7%  perf-profile.self.cycles-pp.resched_curr
      0.13            -0.0        0.11 ±  4%  perf-profile.self.cycles-pp.do_idle
      0.08 ±  6%      -0.0        0.06        perf-profile.self.cycles-pp.pick_next_task_fair
      0.17 ±  2%      -0.0        0.15 ±  2%  perf-profile.self.cycles-pp.__x2apic_send_IPI_dest
      0.14 ±  3%      -0.0        0.13        perf-profile.self.cycles-pp.__release_sock
      0.15 ±  3%      -0.0        0.13 ±  3%  perf-profile.self.cycles-pp.__update_load_avg_se
      0.10 ±  3%      -0.0        0.09        perf-profile.self.cycles-pp.dequeue_entity
      0.08 ±  4%      -0.0        0.07        perf-profile.self.cycles-pp.cpuidle_idle_call
      0.17            -0.0        0.16 ±  2%  perf-profile.self.cycles-pp.sock_def_readable
      0.07            -0.0        0.06        perf-profile.self.cycles-pp.cpuidle_enter_state
      0.07            -0.0        0.06        perf-profile.self.cycles-pp.__sock_wfree
      0.11            -0.0        0.10        perf-profile.self.cycles-pp.ttwu_queue_wakelist
      0.10            -0.0        0.09        perf-profile.self.cycles-pp.asm_sysvec_call_function_single
      0.09            -0.0        0.08        perf-profile.self.cycles-pp.update_rq_clock_task
      0.09            -0.0        0.08        perf-profile.self.cycles-pp.__wrgsbase_inactive
      0.08            -0.0        0.07        perf-profile.self.cycles-pp.finish_task_switch
      0.06            -0.0        0.05        perf-profile.self.cycles-pp.cpuidle_enter
      0.14            +0.0        0.15        perf-profile.self.cycles-pp.enqueue_to_backlog
      0.07            +0.0        0.08        perf-profile.self.cycles-pp.tcp_v4_fill_cb
      0.05            +0.0        0.06        perf-profile.self.cycles-pp.iov_iter_pipe
      0.06            +0.0        0.07        perf-profile.self.cycles-pp.__sk_dst_check
      0.06            +0.0        0.07 ±  5%  perf-profile.self.cycles-pp.demo_interval_tick
      0.06            +0.0        0.07 ±  5%  perf-profile.self.cycles-pp.rb_next
      0.07 ±  5%      +0.0        0.08        perf-profile.self.cycles-pp.tcp_rearm_rto
      0.12 ±  3%      +0.0        0.13        perf-profile.self.cycles-pp.tcp_wfree
      0.18 ±  2%      +0.0        0.20 ±  3%  perf-profile.self.cycles-pp.process_backlog
      0.07 ±  5%      +0.0        0.08 ±  5%  perf-profile.self.cycles-pp.tcp_chrono_stop
      0.05            +0.0        0.06 ±  7%  perf-profile.self.cycles-pp.sk_filter_trim_cap
      0.23            +0.0        0.24        perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
      0.06 ±  6%      +0.0        0.07 ±  5%  perf-profile.self.cycles-pp.splice_from_pipe_next
      0.08 ±  5%      +0.0        0.10 ±  6%  perf-profile.self.cycles-pp.tcp_event_new_data_sent
      0.11 ±  6%      +0.0        0.13 ±  5%  perf-profile.self.cycles-pp.exit_to_user_mode_prepare
      0.15 ±  4%      +0.0        0.16 ±  3%  perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
      0.10 ±  4%      +0.0        0.12        perf-profile.self.cycles-pp.tcp_push
      0.10            +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.direct_splice_actor
      0.10            +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.inet_ehashfn
      0.19            +0.0        0.21 ±  2%  perf-profile.self.cycles-pp.recv
      0.07            +0.0        0.09 ±  5%  perf-profile.self.cycles-pp.demo_stream_interval
      0.14 ±  3%      +0.0        0.16 ±  4%  perf-profile.self.cycles-pp.tcp_add_backlog
      0.15 ±  2%      +0.0        0.17 ±  3%  perf-profile.self.cycles-pp.ip_send_check
      0.14            +0.0        0.16 ±  5%  perf-profile.self.cycles-pp.ipv4_dst_check
      0.08            +0.0        0.10 ±  3%  perf-profile.self.cycles-pp.make_vfsuid
      0.06            +0.0        0.08 ±  4%  perf-profile.self.cycles-pp.tcp_rtt_estimator
      0.10 ±  4%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.10 ±  4%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.__get_task_ioprio
      0.09 ±  4%      +0.0        0.11 ±  4%  perf-profile.self.cycles-pp.inet_recvmsg
      0.10 ±  6%      +0.0        0.12        perf-profile.self.cycles-pp.tcp_schedule_loss_probe
      0.04 ± 50%      +0.0        0.06        perf-profile.self.cycles-pp.rb_first
      0.07            +0.0        0.09        perf-profile.self.cycles-pp.__list_add_valid
      0.08 ±  5%      +0.0        0.10 ±  6%  perf-profile.self.cycles-pp.xas_start
      0.08 ±  5%      +0.0        0.10 ±  3%  perf-profile.self.cycles-pp.make_vfsgid
      0.06 ±  6%      +0.0        0.08 ±  4%  perf-profile.self.cycles-pp.tcp_event_data_recv
      0.13 ±  3%      +0.0        0.16 ±  3%  perf-profile.self.cycles-pp.tcp_recvmsg
      0.10 ±  4%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.check_stack_object
      0.08 ±  5%      +0.0        0.10        perf-profile.self.cycles-pp.is_vmalloc_addr
      0.09 ±  5%      +0.0        0.11 ±  3%  perf-profile.self.cycles-pp.tcp_downgrade_zcopy_pure
      0.10 ±  3%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.tcp_release_cb
      0.22 ±  2%      +0.0        0.24        perf-profile.self.cycles-pp.do_splice_direct
      0.10 ±  7%      +0.0        0.12 ±  3%  perf-profile.self.cycles-pp.ip_protocol_deliver_rcu
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.self.cycles-pp.tcp_update_pacing_rate
      0.30            +0.0        0.32 ±  2%  perf-profile.self.cycles-pp.__alloc_skb
      0.58            +0.0        0.61        perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.23 ±  2%      +0.0        0.26 ±  2%  perf-profile.self.cycles-pp.recv_tcp_stream
      0.13            +0.0        0.16 ±  3%  perf-profile.self.cycles-pp._raw_spin_unlock_bh
      0.10 ±  4%      +0.0        0.13 ±  8%  perf-profile.self.cycles-pp.inet_send_prepare
      0.20            +0.0        0.23 ±  3%  perf-profile.self.cycles-pp.ip_rcv_core
      0.13 ±  3%      +0.0        0.15 ±  6%  perf-profile.self.cycles-pp.tcp_mtu_probe
      0.13 ±  3%      +0.0        0.16 ±  2%  perf-profile.self.cycles-pp.xas_load
      0.06 ±  7%      +0.0        0.09 ±  5%  perf-profile.self.cycles-pp.__tcp_cleanup_rbuf
      0.13 ±  3%      +0.0        0.16 ±  2%  perf-profile.self.cycles-pp.validate_xmit_skb
      0.12            +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.folio_mark_accessed
      0.16            +0.0        0.19 ±  3%  perf-profile.self.cycles-pp.__tcp_select_window
      0.27            +0.0        0.30        perf-profile.self.cycles-pp.__sys_recvfrom
      0.15 ±  2%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.tcp_tx_timestamp
      0.15 ±  2%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.do_splice_to
      0.31            +0.0        0.34        perf-profile.self.cycles-pp.__skb_clone
      0.12            +0.0        0.15 ±  3%  perf-profile.self.cycles-pp.simple_copy_to_iter
      0.14 ±  3%      +0.0        0.18 ±  2%  perf-profile.self.cycles-pp.rw_verify_area
      0.18            +0.0        0.22 ±  2%  perf-profile.self.cycles-pp.sock_sendpage
      0.12 ±  3%      +0.0        0.16        perf-profile.self.cycles-pp.syscall_exit_to_user_mode
      0.24            +0.0        0.28 ±  2%  perf-profile.self.cycles-pp.__mod_timer
      0.09 ±  5%      +0.0        0.12 ±  6%  perf-profile.self.cycles-pp.__tcp_send_ack
      0.16            +0.0        0.20        perf-profile.self.cycles-pp.fsnotify_perm
      0.11            +0.0        0.15 ±  7%  perf-profile.self.cycles-pp.ktime_get_coarse_real_ts64
      0.17 ±  2%      +0.0        0.21        perf-profile.self.cycles-pp.skb_clone
      0.20 ±  2%      +0.0        0.24 ±  5%  perf-profile.self.cycles-pp.__entry_text_start
      0.39 ±  2%      +0.0        0.44        perf-profile.self.cycles-pp.__ip_queue_xmit
      0.17 ±  2%      +0.0        0.22 ±  3%  perf-profile.self.cycles-pp.generic_file_splice_read
      0.18 ±  3%      +0.0        0.23 ±  6%  perf-profile.self.cycles-pp.lock_sock_nested
      0.47 ±  2%      +0.0        0.52 ±  2%  perf-profile.self.cycles-pp.tcp_recvmsg_locked
      0.22 ±  2%      +0.0        0.27        perf-profile.self.cycles-pp.filemap_get_pages
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.tcp_options_write
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.tcp_rbtree_insert
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.skb_network_protocol
      0.20            +0.1        0.25        perf-profile.self.cycles-pp.rcu_all_qs
      0.00            +0.1        0.05 ±  7%  perf-profile.self.cycles-pp.__tcp_ack_snd_check
      0.43            +0.1        0.48        perf-profile.self.cycles-pp._raw_spin_lock
      0.25            +0.1        0.30 ±  2%  perf-profile.self.cycles-pp.touch_atime
      0.46            +0.1        0.52 ±  2%  perf-profile.self.cycles-pp.net_rx_action
      0.16 ±  2%      +0.1        0.22 ±  3%  perf-profile.self.cycles-pp.tcp_stream_alloc_skb
      0.23 ±  2%      +0.1        0.28        perf-profile.self.cycles-pp.copy_page_to_iter
      0.33 ±  2%      +0.1        0.39        perf-profile.self.cycles-pp.splice_direct_to_actor
      0.31            +0.1        0.37 ±  2%  perf-profile.self.cycles-pp.dst_release
      0.21            +0.1        0.27        perf-profile.self.cycles-pp.sanity
      0.26            +0.1        0.32        perf-profile.self.cycles-pp.security_file_permission
      0.25 ±  2%      +0.1        0.31        perf-profile.self.cycles-pp.aa_file_perm
      1.04            +0.1        1.11 ±  3%  perf-profile.self.cycles-pp.do_sendfile
      0.43 ±  2%      +0.1        0.50        perf-profile.self.cycles-pp.kmem_cache_alloc_node
      0.40            +0.1        0.47        perf-profile.self.cycles-pp.do_syscall_64
      0.30 ±  2%      +0.1        0.37 ±  2%  perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.31 ±  4%      +0.1        0.38 ±  2%  perf-profile.self.cycles-pp.sock_put
      0.49            +0.1        0.56        perf-profile.self.cycles-pp.kmem_cache_free
      0.22            +0.1        0.29        perf-profile.self.cycles-pp.tcp_tso_segs
      0.64            +0.1        0.71        perf-profile.self.cycles-pp.tcp_v4_rcv
      0.34 ±  2%      +0.1        0.41        perf-profile.self.cycles-pp.generic_splice_sendpage
      0.32 ±  2%      +0.1        0.39        perf-profile.self.cycles-pp.kernel_sendpage
      0.33            +0.1        0.40        perf-profile.self.cycles-pp.__put_user_8
      0.57            +0.1        0.64        perf-profile.self.cycles-pp.tcp_clean_rtx_queue
      0.34            +0.1        0.42        perf-profile.self.cycles-pp.inet_sendpage
      0.67            +0.1        0.74        perf-profile.self.cycles-pp.read_tsc
      0.34            +0.1        0.42        perf-profile.self.cycles-pp.tcp_established_options
      0.33            +0.1        0.41        perf-profile.self.cycles-pp.pipe_to_sendpage
      0.31            +0.1        0.38        perf-profile.self.cycles-pp.tcp_send_mss
      0.32 ±  3%      +0.1        0.40        perf-profile.self.cycles-pp.current_time
      0.33 ±  2%      +0.1        0.42 ± 12%  perf-profile.self.cycles-pp.ktime_get
      0.36 ±  2%      +0.1        0.45 ±  2%  perf-profile.self.cycles-pp.release_sock
      0.38            +0.1        0.46 ±  2%  perf-profile.self.cycles-pp.__virt_addr_valid
      0.55            +0.1        0.64        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.71            +0.1        0.80        perf-profile.self.cycles-pp.__dev_queue_xmit
      0.41            +0.1        0.50        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.45            +0.1        0.54        perf-profile.self.cycles-pp.__local_bh_enable_ip
      0.64            +0.1        0.74        perf-profile.self.cycles-pp.tcp_rcv_established
      0.41            +0.1        0.50        perf-profile.self.cycles-pp.__check_object_size
      0.46            +0.1        0.56        perf-profile.self.cycles-pp.netperf_sendfile
      0.47            +0.1        0.57        perf-profile.self.cycles-pp.__cond_resched
      0.39            +0.1        0.49        perf-profile.self.cycles-pp._copy_to_iter
      0.51            +0.1        0.62 ±  2%  perf-profile.self.cycles-pp.sendfile_tcp_stream
      0.42            +0.1        0.53        perf-profile.self.cycles-pp.sendfile
      0.48 ±  2%      +0.1        0.58        perf-profile.self.cycles-pp.atime_needs_update
      0.46            +0.1        0.57        perf-profile.self.cycles-pp.tcp_current_mss
      0.49 ±  2%      +0.1        0.60        perf-profile.self.cycles-pp.tcp_sendpage
      0.95            +0.1        1.06 ±  2%  perf-profile.self.cycles-pp.__tcp_transmit_skb
      0.35            +0.1        0.48        perf-profile.self.cycles-pp.tcp_rate_check_app_limited
      0.47            +0.1        0.60        perf-profile.self.cycles-pp.__fsnotify_parent
      0.60            +0.1        0.74        perf-profile.self.cycles-pp.__fget_light
      0.36            +0.1        0.49        perf-profile.self.cycles-pp.page_cache_pipe_buf_confirm
      0.77            +0.1        0.90 ±  2%  perf-profile.self.cycles-pp.page_cache_pipe_buf_release
      0.53            +0.1        0.66        perf-profile.self.cycles-pp._copy_from_user
      0.71            +0.2        0.86        perf-profile.self.cycles-pp.__splice_from_pipe
      0.65 ±  2%      +0.2        0.83        perf-profile.self.cycles-pp.apparmor_file_permission
      0.81            +0.2        1.00        perf-profile.self.cycles-pp.tcp_write_xmit
      0.77            +0.2        0.97        perf-profile.self.cycles-pp.do_tcp_sendpages
      0.81            +0.2        1.05        perf-profile.self.cycles-pp.__skb_datagram_iter
      1.00            +0.2        1.25        perf-profile.self.cycles-pp.skb_release_data
      1.11            +0.3        1.38        perf-profile.self.cycles-pp.copy_page_to_iter_pipe
      2.20            +0.3        2.52        perf-profile.self.cycles-pp._raw_spin_lock_bh
      1.34            +0.4        1.69        perf-profile.self.cycles-pp.filemap_read
      2.01            +0.4        2.40        perf-profile.self.cycles-pp.tcp_build_frag
      4.49            +0.7        5.18        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      2.17 ±  2%      +0.8        2.99 ±  3%  perf-profile.self.cycles-pp.check_heap_object
      2.71            +1.0        3.73        perf-profile.self.cycles-pp.filemap_get_read_batch
      6.63            +1.7        8.29        perf-profile.self.cycles-pp.copyout



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


> 
> 
> From 93b3b4c5f356a5090551519522cfd5740ae7e774 Mon Sep 17 00:00:00 2001
> From: Shakeel Butt <shakeelb@...gle.com>
> Date: Tue, 16 May 2023 20:30:26 +0000
> Subject: [PATCH] memcg: skip stock refill in irq context
> 
> The linux kernel processes incoming packets in softirq on a given CPU
> and those packets may belong to different jobs. This is very normal on
> large systems running multiple workloads. With memcg enabled, network
> memory for such packets is charged to the corresponding memcgs of the
> jobs.
> 
> Memcg charging can be a costly operation and the memcg code implements
> a per-cpu memcg charge caching optimization to reduce the cost of
> charging. More specifically, the kernel charges the given memcg for more
> memory than requested and keep the remaining charge in a local per-cpu
> cache. The insight behind this heuristic is that there will be more
> charge requests for that memcg in near future. This optimization works
> well when a specific job runs on a CPU for long time and majority of the
> charging requests happen in process context. However the kernel's
> incoming packet processing does not work well with this optimization.
> 
> Recently Cathy Zhang has shown [1] that memcg charge flushing within the
> memcg charge path can become a performance bottleneck for the memcg
> charging of network traffic.
> 
> Perf profile:
> 
> 8.98%  mc-worker        [kernel.vmlinux]          [k] page_counter_cancel
>     |
>      --8.97%--page_counter_cancel
> 	       |
> 		--8.97%--page_counter_uncharge
> 			  drain_stock
> 			  __refill_stock
> 			  refill_stock
> 			  |
> 			   --8.91%--try_charge_memcg
> 				     mem_cgroup_charge_skmem
> 				     |
> 				      --8.91%--__sk_mem_raise_allocated
> 						__sk_mem_schedule
> 						|
> 						|--5.41%--tcp_try_rmem_schedule
> 						|          tcp_data_queue
> 						|          tcp_rcv_established
> 						|          tcp_v4_do_rcv
> 						|          tcp_v4_rcv
> 
> The simplest way to solve this issue is to not refill the memcg charge
> stock in the irq context. Since networking is the main source of memcg
> charging in the irq context, other users will not be impacted. In
> addition, this will preseve the memcg charge cache of the application
> running on that CPU.
> 
> There are also potential side effects. What if all the packets belong to
> the same application and memcg? More specifically, users can use Receive
> Flow Steering (RFS) to make sure the kernel process the packets of the
> application on the CPU where the application is running. This change may
> cause the kernel to do slowpath memcg charging more often in irq
> context.
> 
> Link: https://lore.kernel.org/all/IA0PR11MB73557DEAB912737FD61D2873FC749@IA0PR11MB7355.namprd11.prod.outlook.com [1]
> Signed-off-by: Shakeel Butt <shakeelb@...gle.com>
> ---
>  mm/memcontrol.c | 8 ++++++++
>  1 file changed, 8 insertions(+)
> 
> diff --git a/mm/memcontrol.c b/mm/memcontrol.c
> index 5abffe6f8389..2635aae82b3e 100644
> --- a/mm/memcontrol.c
> +++ b/mm/memcontrol.c
> @@ -2652,6 +2652,14 @@ static int try_charge_memcg(struct mem_cgroup *memcg, gfp_t gfp_mask,
>  	bool raised_max_event = false;
>  	unsigned long pflags;
>  
> +	/*
> +	 * Skip the refill in irq context as it may flush the charge cache of
> +	 * the process running on the CPUs or the kernel may have to process
> +	 * incoming packets for different memcgs.
> +	 */
> +	if (!in_task())
> +		batch = nr_pages;
> +
>  retry:
>  	if (consume_stock(memcg, nr_pages))
>  		return 0;
> -- 
> 2.40.1.606.ga4b1b128d6-goog
> 

View attachment "config-6.4.0-rc2-00001-g5e32037c5065" of type "text/plain" (158487 bytes)

View attachment "job-script" of type "text/plain" (8789 bytes)

View attachment "job.yaml" of type "text/plain" (6124 bytes)

View attachment "reproduce" of type "text/plain" (4499 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ