[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <87h7g62jxm.ffs@tglx>
Date: Wed, 04 Aug 2021 01:21:57 +0200
From: Thomas Gleixner <tglx@...utronix.de>
To: Waiman Long <longman@...hat.com>,
Johannes Weiner <hannes@...xchg.org>,
Michal Hocko <mhocko@...nel.org>,
Vladimir Davydov <vdavydov.dev@...il.com>,
Andrew Morton <akpm@...ux-foundation.org>,
Vlastimil Babka <vbabka@...e.cz>, Roman Gushchin <guro@...com>
Cc: linux-kernel@...r.kernel.org, cgroups@...r.kernel.org,
linux-mm@...ck.org, Shakeel Butt <shakeelb@...gle.com>,
Muchun Song <songmuchun@...edance.com>,
Luis Goncalves <lgoncalv@...hat.com>,
Waiman Long <longman@...hat.com>,
Sebastian Andrzej Siewior <bigeasy@...utronix.de>,
Daniel Bristot de Oliveira <bristot@...hat.com>,
Linus Torvalds <torvalds@...ux-foundation.org>
Subject: Re: [PATCH] mm/memcg: Disable task obj_stock for PREEMPT_RT
Waiman,
On Tue, Aug 03 2021 at 13:55, Waiman Long wrote:
please Cc RT people on RT related patches.
> For PREEMPT_RT kernel, preempt_disable() and local_irq_save()
> are typically converted to local_lock() and local_lock_irqsave()
> respectively.
That's just wrong. local_lock has a clear value even on !RT kernels. See
https://www.kernel.org/doc/html/latest/locking/locktypes.html#local-lock
> These two variants of local_lock() are essentially
> the same.
Only on RT kernels.
> + * For PREEMPT_RT kernel, preempt_disable() and local_irq_save() may have
> + * to be changed to variants of local_lock(). This eliminates the
> + * performance advantage of using preempt_disable(). Fall back to always
> + * use local_irq_save() and use only irq_obj for simplicity.
Instead of adding that comment you could have just done the full
conversion, but see below.
> */
> +static inline bool use_task_obj_stock(void)
> +{
> + return !IS_ENABLED(CONFIG_PREEMPT_RT) && likely(in_task());
> +}
> +
> static inline struct obj_stock *get_obj_stock(unsigned long *pflags)
> {
> struct memcg_stock_pcp *stock;
>
> - if (likely(in_task())) {
> + if (use_task_obj_stock()) {
> *pflags = 0UL;
> preempt_disable();
> stock = this_cpu_ptr(&memcg_stock);
This is clearly the kind of conditional locking which is frowned upon
rightfully.
So if we go to reenable memcg for RT we end up with:
if (use_task_obj_stock()) {
preempt_disable();
} else {
local_lock_irqsave(memcg_stock_lock, flags);
}
and further down we end up with:
> @@ -2212,7 +2222,7 @@ static void drain_local_stock(struct work_struct *dummy)
>
> stock = this_cpu_ptr(&memcg_stock);
> drain_obj_stock(&stock->irq_obj);
> - if (in_task())
> + if (use_task_obj_stock())
> drain_obj_stock(&stock->task_obj);
> drain_stock(stock);
> clear_bit(FLUSHING_CACHED_CHARGE, &stock->flags);
/*
* The only protection from memory hotplug vs. drain_stock races is
* that we always operate on local CPU stock here with IRQ disabled
*/
- local_irq_save(flags);
+ local_lock_irqsave(memcg_stock_lock, flags);
...
if (use_task_obj_stock())
drain_obj_stock(&stock->task_obj);
which is incomprehensible garbage.
The comment above the existing local_irq_save() is garbage w/o any local
lock conversion already today (and even before the commit which
introduced stock::task_obj) simply because that comment does not explain
the why.
I can just assume that for stock->task_obj the IRQ protection is
completely irrelevant. If not and _all_ members of stock have to be
protected against memory hotplug by disabling interrupts then any other
function which just disables preemption is broken.
To complete the analysis of drain_local_stock(). AFAICT that function
can only be called from task context. So what is the purpose of this
in_task() conditional there?
if (in_task())
drain_obj_stock(&stock->task_obj);
I assume it's mechanical conversion of:
- drain_obj_stock(stock);
+ drain_obj_stock(&stock->irq_obj);
+ if (in_task())
+ drain_obj_stock(&stock->task_obj);
all over the place without actually looking at the surrounding code,
comments and call sites.
This patch is certainly in line with that approach, but it's just adding
more confusion.
Thanks,
tglx
Powered by blists - more mailing lists