[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20220117131304.pdc3mfdowkzovw6q@localhost.localdomain>
Date: Mon, 17 Jan 2022 14:13:04 +0100
From: Juri Lelli <juri.lelli@...hat.com>
To: Vlastimil Babka <vbabka@...e.cz>
Cc: Sebastian Andrzej Siewior <bigeasy@...utronix.de>,
Xin Long <lucien.xin@...il.com>,
Hyeonggon Yoo <42.hyeyoo@...il.com>,
LKML <linux-kernel@...r.kernel.org>, linux-mm@...ck.org,
Christoph Lameter <cl@...ux.com>,
Pekka Enberg <penberg@...nel.org>,
David Rientjes <rientjes@...gle.com>,
Joonsoo Kim <iamjoonsoo.kim@....com>,
Andrew Morton <akpm@...ux-foundation.org>,
Antoine Tenart <atenart@...nel.org>,
Clark Williams <williams@...hat.com>
Subject: Re: [PATCH] mm: slub: fix a deadlock warning in kmem_cache_destroy
Hi,
On 17/01/22 13:40, Vlastimil Babka wrote:
> +CC Clark
>
> On 1/17/22 10:33, Sebastian Andrzej Siewior wrote:
> > On 2022-01-17 16:32:46 [+0800], Xin Long wrote:
> >> another issue. From the code analysis, this issue does exist on the
> >> upstream kernel, though I couldn't build an upstream RT kernel for the
> >> testing.
> >
> > This should also reproduce in v5.16 since the commit in question is
> > there.
>
> Yeah. I remember we had some issues with the commit during development, but
> I'd hope those were resolved and the commit that's ultimately merged got the
> fixes, see this subthread:
>
> https://lore.kernel.org/all/0b36128c-3e12-77df-85fe-a153a714569b@quicinc.com/
>
> >> > > CPU0 CPU1
> >> > > ---- ----
> >> > > cpus_read_lock()
> >> > > kn->active++
> >> > > cpus_read_lock() [a]
> >> > > wait until kn->active == 0
> >> > >
> >> > > Although cpu_hotplug_lock is a RWSEM, [a] will not block in there. But as
> >> > > lockdep annotations are added for cpu_hotplug_lock, a deadlock warning
> >> > > would be detected:
> >
> > The cpu_hotplug_lock is a per-CPU RWSEM. The lock in [a] will block if
> > there is a writer pending.
> >
> >> > > ======================================================
> >> > > WARNING: possible circular locking dependency detected
> >> > > ------------------------------------------------------
> >> > > dmsetup/1832 is trying to acquire lock:
> >> > > ffff986f5a0f9f20 (kn->count#144){++++}-{0:0}, at: kernfs_remove+0x1d/0x30
> >> > >
> >> > > but task is already holding lock:
> >> > > ffffffffa43817c0 (slab_mutex){+.+.}-{3:3}, at: kmem_cache_destroy+0x2a/0x120
> >> > >
> >
> > I tried to create & destroy a cryptarget which creates/destroy a cache
> > via bio_put_slab(). Either the callchain is different or something else
> > is but I didn't see a lockdep warning.
>
> RHEL-8 kernel seems to be 4.18, unless RT uses a newer one. Could be some
> silently relevant backport is missing? How about e.g. 59450bbc12be ("mm,
> slab, slub: stop taking cpu hotplug lock") ?
Hummm, looks like we have backported commit 59450bbc12be in RHEL-8.
Xin Long, would you be able to check if you still see the lockdep splat
with latest upstream RT?
git://git.kernel.org/pub/scm/linux/kernel/git/rt/linux-rt-devel.git linux-5.16.y-rt
Thanks!
Juri
Powered by blists - more mailing lists