[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250704132518.GDaGfWPnAZI2FY8EnM@fat_crate.local>
Date: Fri, 4 Jul 2025 15:25:18 +0200
From: Borislav Petkov <bp@...en8.de>
To: Breno Leitao <leitao@...ian.org>
Cc: Len Brown <lenb@...nel.org>, James Morse <james.morse@....com>,
Jonathan Corbet <corbet@....net>, tony.luck@...el.com,
rafael@...nel.org, Alexei Starovoitov <ast@...nel.org>,
kbusch@...nel.org, rmikey@...a.com, kuba@...nel.org,
linux-edac@...r.kernel.org, mchehab@...nel.org,
linux-acpi@...r.kernel.org, linux-kernel@...r.kernel.org,
linux-doc@...r.kernel.org, kernel-team@...a.com
Subject: Re: [PATCH 0/2] panic: taint flag for recoverable hardware errors
On Fri, Jul 04, 2025 at 01:15:06PM +0100, Breno Leitao wrote:
> The information is not there to show correlation of broken hardware,
> but,
I didn't say that.
I say that users will misunderstand this taint. Like all the other things we
have issued wrt RAS - people jump to conclusions without even reading english
text. Not to even talk about taint flags.
You having to explain it here basically proves my point.
> For instance, reading from `cat /proc/sys/kernel/tainted` might be
> *way easier* than parsing *thousands* different RAS tools logs for you
> to find what is going on.
Thousands huh? I know of only two but maybe you will enlighten me.
And those I know can simply dump you an error log which you can check. It is
way easy already.
> Anyway, I am happy to add this information somewhere else if you think
> that taint is not the right place.
Documentation/admin-guide/kdump/vmcoreinfo.rst could be one place.
But again, this is redundant info which you can read out from logs which you
already *have* to collect anyway, in a large fleet.
IMO, you have everything already and this is not really needed.
--
Regards/Gruss,
Boris.
https://people.kernel.org/tglx/notes-about-netiquette
Powered by blists - more mailing lists