[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <87pmdqs5vr.ffs@tglx>
Date: Mon, 14 Nov 2022 02:04:56 +0100
From: Thomas Gleixner <tglx@...utronix.de>
To: Steven Rostedt <rostedt@...dmis.org>
Cc: linux-kernel@...r.kernel.org,
Linus Torvalds <torvalds@...ux-foundation.org>,
Stephen Boyd <sboyd@...nel.org>,
Guenter Roeck <linux@...ck-us.net>,
Anna-Maria Gleixner <anna-maria@...utronix.de>,
Andrew Morton <akpm@...ux-foundation.org>,
Julia Lawall <Julia.Lawall@...ia.fr>
Subject: Re: [PATCH v6 4/6] timers: Add timer_shutdown_sync() to be called
before freeing timers
On Sun, Nov 13 2022 at 19:11, Steven Rostedt wrote:
> On Sun, 13 Nov 2022 22:52:16 +0100
> Thomas Gleixner <tglx@...utronix.de> wrote:
>> > We are hitting a common bug were a timer is being triggered after it
>> > is
>>
>> We are hitting? Talking in pluralis majestatis by now?
>
> Should I say Chromebooks are hitting?
That would be at least more comprehensible than 'We', unless you (or
whoever is 'We') is a synomym for chromeborks.
>> > freed. This causes a corruption in the timer link list and crashes the
>> > kernel. Unfortunately it is not easy to know what timer it was that was
>>
>> Well, that's not entirely true. debugobjects can tell you exactly what
>> happens.
>
> Only if you have it enabled when it happens, and it has too much
> overhead to run in production. The full series changes debug object
> timers to report an issue if there's a timer not in the shutdown state
> when it is freed.
The series changes 'debug object timers' to report an issue?
Can you pretty please stop this completely nonsensical blurb? This
series has absolutely nothing to do with debugobjects at least not to
my knowledge. If the series expands the magics of debugobjects then
you fundamentaly failed to explain that.
> This catches potential issues similar to how lockdep can catch
> potential deadlocks without having to hit the deadlock.
By introducing new problems?
> The current debug object timers only catches it if the race condition
> is hit.
True. But most if not all of the mentioned issues have been reported
before via debugobject enabled kernels. So what's the actual benefit?
>> > + * @timer: The timer to be freed
>> > + *
>> > + * Shutdown the timer before freeing. This will return when all pending timers
>> > + * have finished and it is safe to free the timer.
>>
>> "_ALL_ pending timers have finished?"
>>
>> This is about exactly _ONE_ timer, i.e. the one which is handed in via
>> the @timer argument.
>>
>> You want to educate people to do the right thing and then you go and
>> provide them uncomprehensible documentation garbage. How is that
>> supposed to work?
>
> I don't know. Other people I showed this to appeared to understand it.
> But I'm all for updates.
Do I really need to explain to you what the diffference between 'all
pending timers' and the one which is subject of the function call is?
No, I'm not rewriting this for you and your peers who care obviously as
much about correctness as you do.
>> Can you please stop this frenzy and get your act together?
>
> What the hell. I'm just trying to get this in because it's a thorn in
> our side.
It's not a thorn in 'our' (who ever is our) side. It's a fundamental
problem of circular shutdown dependencies as I explained to you long
ago.
> Sorry I'm not up to par with your expectations. I'm willing to make
> changes, but let's leave out the insults. This work is being done on
> top of my day job.
Sure and because of that you are talking about this as a 'thorn on our
side'. If that's a thorn at (I assume) your employers side, which is
then related to your day job, then you should have the backing of that
company to spend company time on it and not inflict half baken changes
on the kernel which solve nothing.
Coming back to your claim that I'm insulting. Please point me to the
actual insult I commenced and I'm happy to apologize.
Thanks,
Thomas
Powered by blists - more mailing lists