[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20220427234954.GA3204792@paulmck-ThinkPad-P17-Gen-1>
Date: Wed, 27 Apr 2022 16:49:54 -0700
From: "Paul E. McKenney" <paulmck@...nel.org>
To: "Guilherme G. Piccoli" <gpiccoli@...lia.com>
Cc: akpm@...ux-foundation.org, bhe@...hat.com, pmladek@...e.com,
kexec@...ts.infradead.org, linux-kernel@...r.kernel.org,
bcm-kernel-feedback-list@...adcom.com, coresight@...ts.linaro.org,
linuxppc-dev@...ts.ozlabs.org, linux-alpha@...r.kernel.org,
linux-arm-kernel@...ts.infradead.org, linux-edac@...r.kernel.org,
linux-hyperv@...r.kernel.org, linux-leds@...r.kernel.org,
linux-mips@...r.kernel.org, linux-parisc@...r.kernel.org,
linux-pm@...r.kernel.org, linux-remoteproc@...r.kernel.org,
linux-s390@...r.kernel.org, linux-tegra@...r.kernel.org,
linux-um@...ts.infradead.org, linux-xtensa@...ux-xtensa.org,
netdev@...r.kernel.org, openipmi-developer@...ts.sourceforge.net,
rcu@...r.kernel.org, sparclinux@...r.kernel.org,
xen-devel@...ts.xenproject.org, x86@...nel.org,
kernel-dev@...lia.com, kernel@...ccoli.net, halves@...onical.com,
fabiomirmar@...il.com, alejandro.j.jimenez@...cle.com,
andriy.shevchenko@...ux.intel.com, arnd@...db.de, bp@...en8.de,
corbet@....net, d.hatayama@...fujitsu.com,
dave.hansen@...ux.intel.com, dyoung@...hat.com,
feng.tang@...el.com, gregkh@...uxfoundation.org,
mikelley@...rosoft.com, hidehiro.kawai.ez@...achi.com,
jgross@...e.com, john.ogness@...utronix.de, keescook@...omium.org,
luto@...nel.org, mhiramat@...nel.org, mingo@...hat.com,
peterz@...radead.org, rostedt@...dmis.org,
senozhatsky@...omium.org, stern@...land.harvard.edu,
tglx@...utronix.de, vgoyal@...hat.com, vkuznets@...hat.com,
will@...nel.org, Benjamin Herrenschmidt <benh@...nel.crashing.org>,
Catalin Marinas <catalin.marinas@....com>,
Florian Fainelli <f.fainelli@...il.com>,
Frederic Weisbecker <frederic@...nel.org>,
"H. Peter Anvin" <hpa@...or.com>,
Hari Bathini <hbathini@...ux.ibm.com>,
Joel Fernandes <joel@...lfernandes.org>,
Jonathan Hunter <jonathanh@...dia.com>,
Josh Triplett <josh@...htriplett.org>,
Lai Jiangshan <jiangshanlai@...il.com>,
Leo Yan <leo.yan@...aro.org>,
Mathieu Desnoyers <mathieu.desnoyers@...icios.com>,
Mathieu Poirier <mathieu.poirier@...aro.org>,
Michael Ellerman <mpe@...erman.id.au>,
Mike Leach <mike.leach@...aro.org>,
Mikko Perttunen <mperttunen@...dia.com>,
Neeraj Upadhyay <quic_neeraju@...cinc.com>,
Nicholas Piggin <npiggin@...il.com>,
Paul Mackerras <paulus@...ba.org>,
Suzuki K Poulose <suzuki.poulose@....com>,
Thierry Reding <thierry.reding@...il.com>,
Thomas Bogendoerfer <tsbogend@...ha.franken.de>
Subject: Re: [PATCH 20/30] panic: Add the panic informational notifier list
On Wed, Apr 27, 2022 at 07:49:14PM -0300, Guilherme G. Piccoli wrote:
> The goal of this new panic notifier is to allow its users to
> register callbacks to run earlier in the panic path than they
> currently do. This aims at informational mechanisms, like dumping
> kernel offsets and showing device error data (in case it's simple
> registers reading, for example) as well as mechanisms to disable
> log flooding (like hung_task detector / RCU warnings) and the
> tracing dump_on_oops (when enabled).
>
> Any (non-invasive) information that should be provided before
> kmsg_dump() as well as log flooding preventing code should fit
> here, as long it offers relatively low risk for kdump.
>
> For now, the patch is almost a no-op, although it changes a bit
> the ordering in which some panic notifiers are executed - specially
> affected by this are the notifiers responsible for disabling the
> hung_task detector / RCU warnings, which now run first. In a
> subsequent patch, the panic path will be refactored, then the
> panic informational notifiers will effectively run earlier,
> before ksmg_dump() (and usually before kdump as well).
>
> We also defer documenting it all properly in the subsequent
> refactor patch. Finally, while at it, we removed some useless
> header inclusions too.
>
> Cc: Benjamin Herrenschmidt <benh@...nel.crashing.org>
> Cc: Catalin Marinas <catalin.marinas@....com>
> Cc: Florian Fainelli <f.fainelli@...il.com>
> Cc: Frederic Weisbecker <frederic@...nel.org>
> Cc: "H. Peter Anvin" <hpa@...or.com>
> Cc: Hari Bathini <hbathini@...ux.ibm.com>
> Cc: Joel Fernandes <joel@...lfernandes.org>
> Cc: Jonathan Hunter <jonathanh@...dia.com>
> Cc: Josh Triplett <josh@...htriplett.org>
> Cc: Lai Jiangshan <jiangshanlai@...il.com>
> Cc: Leo Yan <leo.yan@...aro.org>
> Cc: Mathieu Desnoyers <mathieu.desnoyers@...icios.com>
> Cc: Mathieu Poirier <mathieu.poirier@...aro.org>
> Cc: Michael Ellerman <mpe@...erman.id.au>
> Cc: Mike Leach <mike.leach@...aro.org>
> Cc: Mikko Perttunen <mperttunen@...dia.com>
> Cc: Neeraj Upadhyay <quic_neeraju@...cinc.com>
> Cc: Nicholas Piggin <npiggin@...il.com>
> Cc: Paul Mackerras <paulus@...ba.org>
> Cc: Suzuki K Poulose <suzuki.poulose@....com>
> Cc: Thierry Reding <thierry.reding@...il.com>
> Cc: Thomas Bogendoerfer <tsbogend@...ha.franken.de>
> Signed-off-by: Guilherme G. Piccoli <gpiccoli@...lia.com>
>From an RCU perspective:
Acked-by: Paul E. McKenney <paulmck@...nel.org>
> ---
> arch/arm64/kernel/setup.c | 2 +-
> arch/mips/kernel/relocate.c | 2 +-
> arch/powerpc/kernel/setup-common.c | 2 +-
> arch/x86/kernel/setup.c | 2 +-
> drivers/bus/brcmstb_gisb.c | 2 +-
> drivers/hwtracing/coresight/coresight-cpu-debug.c | 4 ++--
> drivers/soc/tegra/ari-tegra186.c | 3 ++-
> include/linux/panic_notifier.h | 1 +
> kernel/hung_task.c | 3 ++-
> kernel/panic.c | 4 ++++
> kernel/rcu/tree.c | 1 -
> kernel/rcu/tree_stall.h | 3 ++-
> kernel/trace/trace.c | 2 +-
> 13 files changed, 19 insertions(+), 12 deletions(-)
>
> diff --git a/arch/arm64/kernel/setup.c b/arch/arm64/kernel/setup.c
> index 3505789cf4bd..ac2c7e8c9c6a 100644
> --- a/arch/arm64/kernel/setup.c
> +++ b/arch/arm64/kernel/setup.c
> @@ -444,7 +444,7 @@ static struct notifier_block arm64_panic_block = {
>
> static int __init register_arm64_panic_block(void)
> {
> - atomic_notifier_chain_register(&panic_notifier_list,
> + atomic_notifier_chain_register(&panic_info_list,
> &arm64_panic_block);
> return 0;
> }
> diff --git a/arch/mips/kernel/relocate.c b/arch/mips/kernel/relocate.c
> index 56b51de2dc51..650811f2436a 100644
> --- a/arch/mips/kernel/relocate.c
> +++ b/arch/mips/kernel/relocate.c
> @@ -459,7 +459,7 @@ static struct notifier_block kernel_location_notifier = {
>
> static int __init register_kernel_offset_dumper(void)
> {
> - atomic_notifier_chain_register(&panic_notifier_list,
> + atomic_notifier_chain_register(&panic_info_list,
> &kernel_location_notifier);
> return 0;
> }
> diff --git a/arch/powerpc/kernel/setup-common.c b/arch/powerpc/kernel/setup-common.c
> index 1468c3937bf4..d04b8bf8dbc7 100644
> --- a/arch/powerpc/kernel/setup-common.c
> +++ b/arch/powerpc/kernel/setup-common.c
> @@ -757,7 +757,7 @@ void __init setup_panic(void)
> &ppc_fadump_block);
>
> if (IS_ENABLED(CONFIG_RANDOMIZE_BASE) && kaslr_offset() > 0)
> - atomic_notifier_chain_register(&panic_notifier_list,
> + atomic_notifier_chain_register(&panic_info_list,
> &kernel_offset_notifier);
>
> /* Low-level platform-specific routines that should run on panic */
> diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
> index c95b9ac5a457..599b25346964 100644
> --- a/arch/x86/kernel/setup.c
> +++ b/arch/x86/kernel/setup.c
> @@ -1266,7 +1266,7 @@ static struct notifier_block kernel_offset_notifier = {
>
> static int __init register_kernel_offset_dumper(void)
> {
> - atomic_notifier_chain_register(&panic_notifier_list,
> + atomic_notifier_chain_register(&panic_info_list,
> &kernel_offset_notifier);
> return 0;
> }
> diff --git a/drivers/bus/brcmstb_gisb.c b/drivers/bus/brcmstb_gisb.c
> index 1ea7b015e225..c64e087fba7a 100644
> --- a/drivers/bus/brcmstb_gisb.c
> +++ b/drivers/bus/brcmstb_gisb.c
> @@ -486,7 +486,7 @@ static int __init brcmstb_gisb_arb_probe(struct platform_device *pdev)
>
> if (list_is_singular(&brcmstb_gisb_arb_device_list)) {
> register_die_notifier(&gisb_die_notifier);
> - atomic_notifier_chain_register(&panic_notifier_list,
> + atomic_notifier_chain_register(&panic_info_list,
> &gisb_panic_notifier);
> }
>
> diff --git a/drivers/hwtracing/coresight/coresight-cpu-debug.c b/drivers/hwtracing/coresight/coresight-cpu-debug.c
> index 1874df7c6a73..7b1012454525 100644
> --- a/drivers/hwtracing/coresight/coresight-cpu-debug.c
> +++ b/drivers/hwtracing/coresight/coresight-cpu-debug.c
> @@ -535,7 +535,7 @@ static int debug_func_init(void)
> &debug_func_knob_fops);
>
> /* Register function to be called for panic */
> - ret = atomic_notifier_chain_register(&panic_notifier_list,
> + ret = atomic_notifier_chain_register(&panic_info_list,
> &debug_notifier);
> if (ret) {
> pr_err("%s: unable to register notifier: %d\n",
> @@ -552,7 +552,7 @@ static int debug_func_init(void)
>
> static void debug_func_exit(void)
> {
> - atomic_notifier_chain_unregister(&panic_notifier_list,
> + atomic_notifier_chain_unregister(&panic_info_list,
> &debug_notifier);
> debugfs_remove_recursive(debug_debugfs_dir);
> }
> diff --git a/drivers/soc/tegra/ari-tegra186.c b/drivers/soc/tegra/ari-tegra186.c
> index 02577853ec49..4ef05ebed739 100644
> --- a/drivers/soc/tegra/ari-tegra186.c
> +++ b/drivers/soc/tegra/ari-tegra186.c
> @@ -73,7 +73,8 @@ static struct notifier_block tegra186_ari_panic_nb = {
> static int __init tegra186_ari_init(void)
> {
> if (of_machine_is_compatible("nvidia,tegra186"))
> - atomic_notifier_chain_register(&panic_notifier_list, &tegra186_ari_panic_nb);
> + atomic_notifier_chain_register(&panic_info_list,
> + &tegra186_ari_panic_nb);
>
> return 0;
> }
> diff --git a/include/linux/panic_notifier.h b/include/linux/panic_notifier.h
> index 0bb9dc0dea04..7364a346bcb0 100644
> --- a/include/linux/panic_notifier.h
> +++ b/include/linux/panic_notifier.h
> @@ -7,6 +7,7 @@
>
> extern struct atomic_notifier_head panic_notifier_list;
> extern struct atomic_notifier_head panic_hypervisor_list;
> +extern struct atomic_notifier_head panic_info_list;
>
> extern bool crash_kexec_post_notifiers;
>
> diff --git a/kernel/hung_task.c b/kernel/hung_task.c
> index 52501e5f7655..1b2d7111d5ac 100644
> --- a/kernel/hung_task.c
> +++ b/kernel/hung_task.c
> @@ -85,6 +85,7 @@ hung_task_panic(struct notifier_block *this, unsigned long event, void *ptr)
>
> static struct notifier_block panic_block = {
> .notifier_call = hung_task_panic,
> + .priority = INT_MAX, /* run early to prevent potential log flood */
> };
>
> static void check_hung_task(struct task_struct *t, unsigned long timeout)
> @@ -378,7 +379,7 @@ static int watchdog(void *dummy)
>
> static int __init hung_task_init(void)
> {
> - atomic_notifier_chain_register(&panic_notifier_list, &panic_block);
> + atomic_notifier_chain_register(&panic_info_list, &panic_block);
>
> /* Disable hung task detector on suspend */
> pm_notifier(hungtask_pm_notify, 0);
> diff --git a/kernel/panic.c b/kernel/panic.c
> index ef76f3f9c44d..73ca1bc44e30 100644
> --- a/kernel/panic.c
> +++ b/kernel/panic.c
> @@ -76,6 +76,9 @@ EXPORT_SYMBOL(panic_notifier_list);
> ATOMIC_NOTIFIER_HEAD(panic_hypervisor_list);
> EXPORT_SYMBOL(panic_hypervisor_list);
>
> +ATOMIC_NOTIFIER_HEAD(panic_info_list);
> +EXPORT_SYMBOL(panic_info_list);
> +
> static long no_blink(int state)
> {
> return 0;
> @@ -291,6 +294,7 @@ void panic(const char *fmt, ...)
> * add information to the kmsg dump output.
> */
> atomic_notifier_call_chain(&panic_hypervisor_list, PANIC_NOTIFIER, buf);
> + atomic_notifier_call_chain(&panic_info_list, PANIC_NOTIFIER, buf);
> atomic_notifier_call_chain(&panic_notifier_list, PANIC_NOTIFIER, buf);
>
> panic_print_sys_info(false);
> diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c
> index a4b8189455d5..d5a2674ae81c 100644
> --- a/kernel/rcu/tree.c
> +++ b/kernel/rcu/tree.c
> @@ -35,7 +35,6 @@
> #include <linux/panic.h>
> #include <linux/panic_notifier.h>
> #include <linux/percpu.h>
> -#include <linux/notifier.h>
> #include <linux/cpu.h>
> #include <linux/mutex.h>
> #include <linux/time.h>
> diff --git a/kernel/rcu/tree_stall.h b/kernel/rcu/tree_stall.h
> index 0c5d8516516a..d8a5840aad5d 100644
> --- a/kernel/rcu/tree_stall.h
> +++ b/kernel/rcu/tree_stall.h
> @@ -97,11 +97,12 @@ static int rcu_panic(struct notifier_block *this, unsigned long ev, void *ptr)
>
> static struct notifier_block rcu_panic_block = {
> .notifier_call = rcu_panic,
> + .priority = INT_MAX, /* run early to prevent potential log flood */
> };
>
> static int __init check_cpu_stall_init(void)
> {
> - atomic_notifier_chain_register(&panic_notifier_list, &rcu_panic_block);
> + atomic_notifier_chain_register(&panic_info_list, &rcu_panic_block);
> return 0;
> }
> early_initcall(check_cpu_stall_init);
> diff --git a/kernel/trace/trace.c b/kernel/trace/trace.c
> index c1d8a3622ccc..7d02f7a66bb1 100644
> --- a/kernel/trace/trace.c
> +++ b/kernel/trace/trace.c
> @@ -10138,7 +10138,7 @@ __init static int tracer_alloc_buffers(void)
> /* All seems OK, enable tracing */
> tracing_disabled = 0;
>
> - atomic_notifier_chain_register(&panic_notifier_list,
> + atomic_notifier_chain_register(&panic_info_list,
> &trace_panic_notifier);
>
> register_die_notifier(&trace_die_notifier);
> --
> 2.36.0
>
Powered by blists - more mailing lists