[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190430132811.GB2589@hirez.programming.kicks-ass.net>
Date: Tue, 30 Apr 2019 15:28:11 +0200
From: Peter Zijlstra <peterz@...radead.org>
To: Sebastian Andrzej Siewior <bigeasy@...utronix.de>
Cc: linux-rt-users <linux-rt-users@...r.kernel.org>,
LKML <linux-kernel@...r.kernel.org>,
Thomas Gleixner <tglx@...utronix.de>,
Daniel Bristot de Oliveira <bristot@...hat.com>,
Clark Williams <williams@...hat.com>,
Juri Lelli <juri.lelli@...hat.com>,
Oleg Nesterov <oleg@...hat.com>, jack@...e.com
Subject: Re: [RT WARNING] DEBUG_LOCKS_WARN_ON(rt_mutex_owner(lock) !=
current) with fsfreeze (4.19.25-rt16)
On Tue, Apr 30, 2019 at 02:51:31PM +0200, Sebastian Andrzej Siewior wrote:
> On 2019-04-19 10:56:27 [+0200], Juri Lelli wrote:
> > On 26/03/19 10:34, Juri Lelli wrote:
> > > Hi,
> > >
> > > Running this reproducer on a 4.19.25-rt16 kernel (with lock debugging
> > > turned on) produces warning below.
> >
> > And I now think this might lead to an actual crash.
>
> Peter, could you please take a look at the thread:
> https://lkml.kernel.org/r/20190419085627.GI4742@localhost.localdomain
>
> I assumed that returning to userland with acquired locks is something we
> did not want…
Yeah, but AFAIK fs freezing code has a history of doing exactly that..
This is just the latest incarnation here.
So the immediate problem here is that the task doing thaw isn't the same
that did freeze, right? The thing is, I'm not seeing how that isn't a
problem with upstream either.
The freeze code seems to do: percpu_down_write() for the various states,
and then frobs lockdep state.
Thaw then does the reverse, frobs lockdep and then does: percpu_up_write().
percpu_down_write() directly relies on down_write(), and
percpu_up_write() on up_write(). And note how __up_write() has:
DEBUG_RWSEMS_WARN_ON(sem->owner != current, sem);
So why isn't this same code coming unstuck in mainline?
Powered by blists - more mailing lists