lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <alpine.DEB.2.11.1412151220330.16494@nanos>
Date:	Mon, 15 Dec 2014 15:25:24 +0100 (CET)
From:	Thomas Gleixner <tglx@...utronix.de>
To:	Huang Ying <ying.huang@...el.com>
cc:	LKML <linux-kernel@...r.kernel.org>, LKP ML <lkp@...org>,
	Rick Jones <netperf-feedback@...perf.org>,
	Peter Zijlstra <peterz@...radead.org>, netdev@...r.kernel.org
Subject: Re: [LKP] [genirq] c291ee62216:

On Mon, 15 Dec 2014, Huang Ying wrote:
> FYI, we noticed the below changes on
> 
> git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git irq/urgent
> commit c291ee622165cb2c8d4e7af63fffd499354a23be ("genirq: Prevent proc race against freeing of irq descriptors")
> 
> testbox/testcase/testparams: lkp-nex04/netperf/performance-300s-200%-SCTP_STREAM

>                            time.voluntary_context_switches
> 
>   100500 ++-----------------------------------------------------------------+
>          O       O                    O          O                          |
>          |  O                            O          O                       |
>   100000 ++           O  O  O O     O          O                            |
>          |    O                  O          O          O                    |
>          |          O                                                       |
>    99500 ++                                                                 |
>          |                                                                  |
>    99000 ++                                                                 |
>          |                                                                  |
>          |                                                                  |
>    98500 ++        .*.*..                          .*.. .*..    .*..*..     |
>          *..*.*..*.      *..*.*..*..*.*..*..*..*.*.    *    *..*       *.*..*
>          |                                                                  |
>    98000 ++-----------------------------------------------------------------+
> 
> 
> 	[*] bisect-good sample
> 	[O] bisect-bad  sample

Cute. Looking at netperf source it seems to do a high frequency
readout of /proc/stat from all involved threads. Which of course
explains that the number of context switches is going up as the stuff
is going to content on the sparse_irq_mutex.

While its possible to fix^W band aid that case, I'm really not too
happy to do so just to please a wreckaged use case. High frequency
polling of /proc/stat is just asking for trouble and on larger
machines it's a complete scalability fail. Especially the interrupt
part is amazingly horrible

      for_each_irq_nr()
	for_each_possible_cpu()

Is it really required for netperf to do that stat poll in a loop or
can it be made smarter?

Btw, in that test scenario runs netserver and the test threads on the
same machine. So the utilization data is pretty useless anyway because
all threads will read more or less the same data which cannot be
correlated to a particular instance.

Thanks,

	tglx

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ