lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20160817064208.GA5622@aaronlu.sh.intel.com>
Date:	Wed, 17 Aug 2016 14:42:08 +0800
From:	Aaron Lu <aaron.lu@...el.com>
To:	Xin Long <lucien.xin@...il.com>
Cc:	kernel test robot <xiaolong.ye@...el.com>,
	Stephen Rothwell <sfr@...b.auug.org.au>, lkp@...org,
	"David S. Miller" <davem@...emloft.net>,
	LKML <linux-kernel@...r.kernel.org>
Subject: Re: [LKP] [lkp] [sctp] a6c2f79287: netperf.Throughput_Mbps -37.2%
 regression

On Wed, Aug 17, 2016 at 02:37:19PM +0800, Aaron Lu wrote:
> On Wed, Aug 17, 2016 at 02:14:05PM +0800, Aaron Lu wrote:
> > On Wed, Aug 17, 2016 at 01:41:04PM +0800, Xin Long wrote:
> > > > The perf-profile data for the two commits are attached(for the case of
> > > > prsctp_enable=1, the perf-profile data doesn't get collected for the 0
> > > > case for some reason, I'm checking the problem now).
> > > >
> > > > The CPU gets much more idle time in the bisected commit a6c2f79287:
> > > >
> > > >     68.89%     0.70%  [kernel.kallsyms]   [k] entry_SYSCALL_64_fastpath
> > > >     49.32%     0.12%  [kernel.kallsyms]   [k] sys_sendmsg
> > > >     49.17%     0.12%  [kernel.kallsyms]   [k] __sys_sendmsg
> > > >     48.58%     0.22%  [kernel.kallsyms]   [k] ___sys_sendmsg
> > > >     46.69%     0.06%  [kernel.kallsyms]   [k] sock_sendmsg
> > > >     46.31%     0.16%  [kernel.kallsyms]   [k] inet_sendmsg
> > > >     45.90%     0.98%  [kernel.kallsyms]   [k] sctp_sendmsg
> > > >     29.66%     0.45%  [kernel.kallsyms]   [k] sctp_do_sm
> > > >     29.54%     0.23%  [kernel.kallsyms]   [k] cpu_startup_entry
> > > >     28.81%     0.68%  [kernel.kallsyms]   [k] sctp_cmd_interpreter.isra.24
> > > >     26.20%     0.00%  [kernel.kallsyms]   [k] start_secondary
> > > >     23.04%     0.09%  [kernel.kallsyms]   [k] sctp_inq_push
> > > >     23.03%     0.08%  [kernel.kallsyms]   [k] call_cpuidle
> > > >     22.94%     0.00%  [kernel.kallsyms]   [k] cpuidle_enter
> > > >     22.60%     0.18%  [kernel.kallsyms]   [k] cpuidle_enter_state
> > > >     21.99%    21.99%  [kernel.kallsyms]   [k] intel_idle
> > > > ... ...
> > > >
> > > > While its immediate parent commit 826d253d57 is mostly busy working:
> > > >
> > > >     98.53%     0.83%  [kernel.kallsyms]   [k] entry_SYSCALL_64_fastpath
> > > >     78.13%     0.12%  [kernel.kallsyms]   [k] sys_sendmsg
> > > >     78.03%     0.16%  [kernel.kallsyms]   [k] __sys_sendmsg
> > > >     77.08%     0.28%  [kernel.kallsyms]   [k] ___sys_sendmsg
> > > >     74.44%     0.08%  [kernel.kallsyms]   [k] sock_sendmsg
> > > >     73.82%     0.13%  [kernel.kallsyms]   [k] inet_sendmsg
> > > >     73.34%     1.44%  [kernel.kallsyms]   [k] sctp_sendmsg
> > > >     47.52%     0.75%  [kernel.kallsyms]   [k] sctp_do_sm
> > > >     46.19%     0.90%  [kernel.kallsyms]   [k] sctp_cmd_interpreter.isra.24
> > > >     37.17%     1.43%  [kernel.kallsyms]   [k] sctp_outq_flush
> > > >     36.93%     0.08%  [kernel.kallsyms]   [k] sctp_outq_uncork
> > > >     34.24%     0.15%  [kernel.kallsyms]   [k] sctp_inq_push
> > > > ... ...
> > > > No idle related function above 1%.
> > > >
> > > > Will the bisected commit make the idle possible?
> > > No, not at all. :)
> > > 
> > > pls help to debug as I said in the last reply.
> > 
> > OK, will see how to do that.
> > 
> > In the meantime, I just tried to reproduce on my own desktop:
> > Sandybridge i7-2600 CPU @ 3.40GHz and it reproduced:
> > $ cat 4.7.0-rc6-01198-ga6c2f792873a/0/netperf.json
> > {
> >   "netperf.Throughput_Mbps": [
> >    752.9450000000002
> >   ]
> > }
> > $ cat 4.7.0-rc6-01197-g826d253d57b1/0/netperf.json
> > {
> >   "netperf.Throughput_Mbps": [
> >    1068.5556249999997
> >   ]
> > }
> 
> On top of
> commit 826d253d57b1 ("sctp: add SCTP_PR_ASSOC_STATUS on sctp sockopt")
> I applied the below commit:
> 
> From 98dd2532b14e29dcc2ab40a7348755531afa79e4 Mon Sep 17 00:00:00 2001
> From: Aaron Lu <aaron.lu@...el.com>
> Date: Wed, 17 Aug 2016 14:20:00 +0800
> Subject: [PATCH] sctp: test
> 
> ---
>  include/net/sctp/structs.h | 3 +++
>  1 file changed, 3 insertions(+)
> 
> diff --git a/include/net/sctp/structs.h b/include/net/sctp/structs.h
> index d8e464aacb20..932f2780d3a4 100644
> --- a/include/net/sctp/structs.h
> +++ b/include/net/sctp/structs.h
> @@ -602,6 +602,9 @@ struct sctp_chunk {
>  	/* This needs to be recoverable for SCTP_SEND_FAILED events. */
>  	struct sctp_sndrcvinfo sinfo;
>  
> +	unsigned long prsctp_param;
> +	int sent_count;
> +
>  	/* Which association does this belong to?  */
>  	struct sctp_association *asoc;
>  
> -- 
> 2.5.5
> 
> Then the performance dropped to the same as the bisected commit
> a6c2f792873a:
> $ cat 4.7.0-rc6-01198-g98dd2532b14e/0/netperf.json
> {
>   "netperf.Throughput_Mbps": [
>    754.494375
>   ]
> }
> 
> I think this agrees with the perf data in that the newly added function

Actually, I mean the modified functions like sctp_chunk_abandoned and
__sctp_packet_append_chunk, etc.

> doesn't show up in the perf-profile but still, the performance drops.
> So the only possible reason is the newly added fields to the sctp_chunk
> structure.
> 
> Is this expected?
> 
> Thanks,
> Aaron

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ