lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <002601d525ae$a2c00c90$e84025b0$@net>
Date:   Tue, 18 Jun 2019 01:20:01 -0700
From:   "Doug Smythies" <dsmythies@...us.net>
To:     "'Rafael J. Wysocki'" <rafael@...nel.org>,
        "'Pavel Machek'" <pavel@....cz>
Cc:     "'kernel list'" <linux-kernel@...r.kernel.org>,
        "'ACPI Devel Maling List'" <linux-acpi@...r.kernel.org>,
        "'Zhang, Rui'" <rui.zhang@...el.com>,
        "'Rafael J. Wysocki'" <rjw@...ysocki.net>,
        "'Viresh Kumar'" <viresh.kumar@...aro.org>,
        "'Linux PM'" <linux-pm@...r.kernel.org>,
        "'Thomas Gleixner'" <tglx@...utronix.de>,
        "'Ingo Molnar'" <mingo@...hat.com>,
        "'Borislav Petkov'" <bp@...en8.de>,
        "'H. Peter Anvin'" <hpa@...or.com>,
        "'the arch/x86 maintainers'" <x86@...nel.org>
Subject: RE: 5.2-rc2: low framerate in flightgear, cpu not running at full speed, thermal related?

On 2019.06.13 01:53 Rafael J. Wysocki wrote:

> I personally doubt that any thermal throttling is involved here.

In earlier e-mails on this thread, Pavel showed his core and package
temperatures as 97 and 98 degrees. If thermal throttling is not
involved, it should be. The description of the observed CPU
frequencies also matched my experiences with thermal throttling
for the intel-pstate/powersave example. Myself, I can not determine
if throttling is involved for the acpi-cpufreq/ondemand
case, just from the clock frequencies, because,
at least on my system, it uses the kidle_inject method
instead of the pstate method.

I continued doing experiments, enabling thermald (normally
disabled on my system) and forcing thermal throttling on
my test server. My system never gets hot enough, so I used
a low trip point of 55 degrees.

The intel_pstate/powersave and intel_cpufreq/ondemand,
which both used the pstate method, outperformed the
acpi-cpufreq/ondemand by 30%, in a constant thermal
throttling mode. This seems the opposite of Pavel's
results, if indeed his system is thermal throttling.
(I can write these experiments up in more detail
if desired.)

On 2019.06.13 Pavel Machek wrote:

> But it seems that your assumptions are incorrect for my workload.

Fair enough. I don't have the appropriate hardware for this, and
am trying to guess at a similar workflow for testing.
Perhaps, just mudding things here rather than helping.

>
> flightgear is single-threaded, and in my configuration saturates the
> CPU, because it would like to achieve higher framerate than my system
> is capable of.

Are you sure? Use turbostat and observe.
Example 1:
intel_cpufreq/ondemand, kernel 5.2-rc3, two 100% loads:

doug@s15:~$ sudo turbostat --quiet --hide IRQ,Avg_MHz,SMI,\
> GFXMHz,TSC_MHz,CorWatt,CoreTmp,GFX%rc6,Pkg%pc2,Pkg%pc3,\
> Pkg%pc6,POLL,C1,C1E,C3,C6,C1%,C1E%,C3%,C6% \
> --interval 5
Core    CPU     Busy%   Bzy_MHz POLL%   CPU%c1  CPU%c3  CPU%c6  CPU%c7  PkgTmp  PkgWatt GFXWatt
-       -       25.24   2780    0.00    25.70   0.01    49.05   0.00    56      23.82   0.12
0       0       1.62    2781    0.00    1.90    0.02    96.46   0.00    56      23.82   0.12
0       4       0.05    2833    0.00    3.47
1       1       100.00  2780    0.00    0.00    0.00    0.00    0.00
1       5       0.03    2893    0.00    99.97
2       2       0.03    2906    0.00    99.97   0.00    0.00    0.00
2       6       100.00  2780    0.00    0.00
3       3       0.07    2797    0.00    0.18    0.01    99.74   0.00
3       7       0.10    2834    0.00    0.14

Example 2:
acpi_cpufreq/ondemand, kernel 5.2-rc3, two 100% loads:

doug@s15:~$ sudo turbostat --quiet --hide IRQ,Avg_MHz,SMI,\
> GFXMHz,TSC_MHz,CorWatt,CoreTmp,GFX%rc6,Pkg%pc2,Pkg%pc3,\
> Pkg%pc6,POLL,C1,C1E,C3,C6,C1%,C1E%,C3%,C6% \
> --interval 5
Core    CPU     Busy%   Bzy_MHz POLL%   CPU%c1  CPU%c3  CPU%c6  CPU%c7  PkgTmp  PkgWatt GFXWatt
-       -       18.06   3069    0.00    18.45   0.01    63.47   0.00    55      21.50   0.12
0       0       15.55   3049    0.00    1.31    0.02    83.12   0.00    55      21.50   0.12
0       4       0.64    2651    0.00    16.22
1       1       62.43   3075    0.00    8.15    0.00    29.43   0.00
1       5       7.71    3068    0.00    62.81
2       2       50.56   3091    0.00    0.78    0.00    48.66   0.00
2       6       0.44    2346    0.00    50.89
3       3       2.30    2901    0.00    5.00    0.01    92.69   0.00
3       7       4.79    3016    0.00    2.49

With the additional C6 % coming from the kidle_inj tasks.
Observation 1: The loaded CPUs migrate much more often in this scenario.
Observation 2: While the package watts are different by over 2 watts,
the long term (done over many hours) average was the same for all methods.

... Doug


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ