[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <D5EEJIV08CS8.267DGCUDZLRR3@kernel.org>
Date: Tue, 05 Nov 2024 18:56:47 +0200
From: "Jarkko Sakkinen" <jarkko@...nel.org>
To: "Yazen Ghannam" <yazen.ghannam@....com>, "Shuai Xue"
<xueshuai@...ux.alibaba.com>
Cc: <mark.rutland@....com>, <catalin.marinas@....com>, <mingo@...hat.com>,
<robin.murphy@....com>, <Jonathan.Cameron@...wei.com>, <bp@...en8.de>,
<rafael@...nel.org>, <wangkefeng.wang@...wei.com>, <tanxiaofei@...wei.com>,
<mawupeng1@...wei.com>, <tony.luck@...el.com>, <linmiaohe@...wei.com>,
<naoya.horiguchi@....com>, <james.morse@....com>, <tongtiangen@...wei.com>,
<gregkh@...uxfoundation.org>, <will@...nel.org>,
<linux-acpi@...r.kernel.org>, <linux-mm@...ck.org>,
<linux-kernel@...r.kernel.org>, <akpm@...ux-foundation.org>,
<linux-edac@...r.kernel.org>, <x86@...nel.org>, <justin.he@....com>,
<ardb@...nel.org>, <ying.huang@...el.com>, <ashish.kalra@....com>,
<baolin.wang@...ux.alibaba.com>, <tglx@...utronix.de>,
<dave.hansen@...ux.intel.com>, <lenb@...nel.org>, <hpa@...or.com>,
<robert.moore@...el.com>, <lvying6@...wei.com>, <xiexiuqi@...wei.com>,
<zhuo.song@...ux.alibaba.com>
Subject: Re: [PATCH v16 1/3] ACPI: APEI: send SIGBUS to current task if
synchronous memory error not recovered
On Tue Nov 5, 2024 at 5:09 PM EET, Yazen Ghannam wrote:
> > diff --git a/drivers/acpi/apei/ghes.c b/drivers/acpi/apei/ghes.c
> > index ada93cfde9ba..af3339dd3817 100644
> > --- a/drivers/acpi/apei/ghes.c
> > +++ b/drivers/acpi/apei/ghes.c
> > @@ -801,6 +801,16 @@ static bool ghes_do_proc(struct ghes *ghes,
> > }
> > }
> >
> > + /*
> > + * If no memory failure work is queued for abnormal synchronous
> > + * errors, do a force kill.
> > + */
> > + if (sync && !queued) {
> > + pr_err(HW_ERR GHES_PFX "%s:%d: hardware memory corruption (SIGBUS)\n",
>
> Is this always a memory error? The code flow above implies that an
> unrecoverable ARM processor error can all be !queued. So should the
> message be more generic like "synchronous unrecoverable error" or
> similar?
>
> In any case, this is just a minor nit if this is not an issue in
> practice.
One minor thing that came to mind after reading your response: wouldn't
it be a better idea to use dev_err() against ghes->dev, rather than raw
pr_err()?
That would better context information and also I just (re-)checked the
file and also ghes_remove() is using dev_err().
>
> Reviewed-by: Yazen Ghannam <yazen.ghannam@....com>
>
> Thanks,
> Yazen
BR, Jarkko
Powered by blists - more mailing lists