[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190822092123.GL2349@hirez.programming.kicks-ass.net>
Date: Thu, 22 Aug 2019 11:21:23 +0200
From: Peter Zijlstra <peterz@...radead.org>
To: Valentin Schneider <valentin.schneider@....com>
Cc: linux-kernel@...r.kernel.org, mingo@...nel.org,
liangyan.peng@...ux.alibaba.com, shanpeic@...ux.alibaba.com,
xlpang@...ux.alibaba.com, pjt@...gle.com, stable@...r.kernel.org,
Ben Segall <bsegall@...gle.com>
Subject: Re: [PATCH] sched/fair: Add missing unthrottle_cfs_rq()
On Tue, Aug 20, 2019 at 11:54:20AM +0100, Valentin Schneider wrote:
> Turns out a cfs_rq->runtime_remaining can become positive in
> assign_cfs_rq_runtime(), but this codepath has no call to
> unthrottle_cfs_rq().
>
> This can leave us in a situation where we have a throttled cfs_rq with
> positive ->runtime_remaining, which breaks the math in
> distribute_cfs_runtime(): this function expects a negative value so that
> it may safely negate it into a positive value.
>
> Add the missing unthrottle_cfs_rq(). While at it, add a WARN_ON where
> we expect negative values, and pull in a comment from the mailing list
> that didn't make it in [1].
>
> [1]: https://lkml.kernel.org/r/BANLkTi=NmCxKX6EbDQcJYDJ5kKyG2N1ssw@mail.gmail.com
>
> Cc: <stable@...r.kernel.org>
> Fixes: ec12cb7f31e2 ("sched: Accumulate per-cfs_rq cpu usage and charge against bandwidth")
> Reported-by: Liangyan <liangyan.peng@...ux.alibaba.com>
> Signed-off-by: Valentin Schneider <valentin.schneider@....com>
Thanks!
> ---
> kernel/sched/fair.c | 17 ++++++++++++-----
> 1 file changed, 12 insertions(+), 5 deletions(-)
>
> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index 1054d2cf6aaa..219ff3f328e5 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -4385,6 +4385,11 @@ static inline u64 cfs_rq_clock_task(struct cfs_rq *cfs_rq)
> return rq_clock_task(rq_of(cfs_rq)) - cfs_rq->throttled_clock_task_time;
> }
>
> +static inline int cfs_rq_throttled(struct cfs_rq *cfs_rq)
> +{
> + return cfs_bandwidth_used() && cfs_rq->throttled;
> +}
> +
> /* returns 0 on failure to allocate runtime */
> static int assign_cfs_rq_runtime(struct cfs_rq *cfs_rq)
> {
> @@ -4411,6 +4416,9 @@ static int assign_cfs_rq_runtime(struct cfs_rq *cfs_rq)
>
> cfs_rq->runtime_remaining += amount;
>
> + if (cfs_rq->runtime_remaining > 0 && cfs_rq_throttled(cfs_rq))
> + unthrottle_cfs_rq(cfs_rq);
> +
> return cfs_rq->runtime_remaining > 0;
> }
>
> @@ -4439,11 +4447,6 @@ void account_cfs_rq_runtime(struct cfs_rq *cfs_rq, u64 delta_exec)
> __account_cfs_rq_runtime(cfs_rq, delta_exec);
> }
>
> -static inline int cfs_rq_throttled(struct cfs_rq *cfs_rq)
> -{
> - return cfs_bandwidth_used() && cfs_rq->throttled;
> -}
> -
> /* check whether cfs_rq, or any parent, is throttled */
> static inline int throttled_hierarchy(struct cfs_rq *cfs_rq)
> {
> @@ -4628,6 +4631,10 @@ static u64 distribute_cfs_runtime(struct cfs_bandwidth *cfs_b, u64 remaining)
> if (!cfs_rq_throttled(cfs_rq))
> goto next;
>
> + /* By the above check, this should never be true */
> + WARN_ON(cfs_rq->runtime_remaining > 0);
> +
> + /* Pick the minimum amount to return to a positive quota state */
> runtime = -cfs_rq->runtime_remaining + 1;
> if (runtime > remaining)
> runtime = remaining;
> --
> 2.22.0
>
Powered by blists - more mailing lists