lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20200625110006.q3iepcrh2uh4oizv@e107158-lin.cambridge.arm.com>
Date:   Thu, 25 Jun 2020 12:00:06 +0100
From:   Qais Yousef <qais.yousef@....com>
To:     Valentin Schneider <valentin.schneider@....com>
Cc:     Ingo Molnar <mingo@...hat.com>,
        Peter Zijlstra <peterz@...radead.org>,
        Juri Lelli <juri.lelli@...hat.com>,
        Vincent Guittot <vincent.guittot@...aro.org>,
        Dietmar Eggemann <dietmar.eggemann@....com>,
        Steven Rostedt <rostedt@...dmis.org>,
        Ben Segall <bsegall@...gle.com>, Mel Gorman <mgorman@...e.de>,
        Patrick Bellasi <patrick.bellasi@...bug.net>,
        Chris Redpath <chris.redpath@....com>,
        Lukasz Luba <lukasz.luba@....com>, linux-kernel@...r.kernel.org
Subject: Re: [PATCH v3 2/2] sched/uclamp: Protect uclamp fast path code with
 static key

Hi Valentin

On 06/25/20 01:16, Valentin Schneider wrote:
> 
> Hi Qais,
> 
> On 24/06/20 18:26, Qais Yousef wrote:
> [...]
> > ---
> >
> > This takes a different approach to PSI which introduces a config option
> >
> > ```
> >       CONFIG_PSI_DEFAULT_DISABLED
> >
> >         Require boot parameter to enable pressure stall information
> >         tracking (NEW)
> >
> >       boot param psi
> > ```
> >
> > via commit e0c274472d5d "psi: make disabling/enabling easier for vendor kernels"
> >
> > uclamp has a clearer points of entry when userspace would like to use it so we
> > can automatically flip the switch if the kernel is running on a userspace that
> > wants to user utilclamp without any extra userspace visible switches.
> >
> > I wanted to make this dependent on schedutil being the governor too, but beside
> > the complexity, uclamp is used for capacity awareness. We could certainly
> > construct a more complex condition, but I'm not sure it's worth it. Open to
> > hear more opinions and points of views on this :)
> >
> 
> I think the toggling conditions are good as they are. However, speaking of
> schedutil, doesn't this patch break the RT frequency boost? Mind you it
> might be too late for me to be thinking about this stuff.

Good catch. I did test RT, but I just realized the RT test ran after it
enabled uclamp again. So I missed the case when it wasn't enabled.

> 
> In schedutil_cpu_util(), when uclamp isn't compiled it, we have an explicit
> 'goto max'. When uclamp *is* compiled in, that's taken care of by the
> "natural" RT uclamp aggregation... Which doesn't happen until we flip the
> static key. 
> 
> It's yucky, but if you declare the key in the internal sched header, you
> could reuse it in the existing 'goto max' (or sysctl value, when we make
> that tweakable) path. 

Not sure if this is the best way forward. I need to think about it.
While I am not keen on enabling in kernel users of util clamp, but there was
already an attempt to do so. This approach will not allow us to implement
something in the future for that. Which maybe is what we want..

> 
> >
> >  kernel/sched/core.c | 54 +++++++++++++++++++++++++++++++++++++++++----
> >  1 file changed, 50 insertions(+), 4 deletions(-)
> >
> > diff --git a/kernel/sched/core.c b/kernel/sched/core.c
> > index 235b2cae00a0..44e03d4fd19d 100644
> > --- a/kernel/sched/core.c
> > +++ b/kernel/sched/core.c
> > @@ -794,6 +794,25 @@ unsigned int sysctl_sched_uclamp_util_max = SCHED_CAPACITY_SCALE;
> >  /* All clamps are required to be less or equal than these values */
> >  static struct uclamp_se uclamp_default[UCLAMP_CNT];
> >  
> > +/*
> > + * This static key is used to reduce the uclamp overhead in the fast path. It
> > + * only disables the call to uclamp_rq_{inc, dec}() in enqueue/dequeue_task().
> > + *
> > + * This allows users to continue to enable uclamp in their kernel config with
> > + * minimum uclamp overhead in the fast path.
> > + *
> > + * As soon as userspace modifies any of the uclamp knobs, the static key is
> > + * enabled, since we have an actual users that make use of uclamp
> > + * functionality.
> > + *
> > + * The knobs that would enable this static key are:
> > + *
> > + *   * A task modifying its uclamp value with sched_setattr().
> 
> That one makes it not just userspace, right? While the sched_setattr()
> stuff is expected to be unexported, it isn't ATM and we may expect some
> modules to ask for a uclamp API eventually.

This has already come up with another thread [1]. I think in-kernel users
shouldn't go through this path. I will propose something to give stronger
guarantees in this regard.

> > -	if (update_root_tg)
> > +	if (update_root_tg) {
> >  		uclamp_update_root_tg();
> > +		static_branch_enable(&sched_uclamp_used);
> 
> I don't think it matters ATM, but shouldn't we flip that *before* updating
> the TG's to avoid any future surprises? 

What sort of surprises are you thinking of?

Thanks

--
Qais Yousef

[1] https://lore.kernel.org/lkml/20200624175236.nblndmg6dfq2vr2u@e107158-lin.cambridge.arm.com/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ