[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20210420154253.GA577572@agluck-desk2.amr.corp.intel.com>
Date: Tue, 20 Apr 2021 08:42:53 -0700
From: "Luck, Tony" <tony.luck@...el.com>
To: Jue Wang <juew@...gle.com>
Cc: nao.horiguchi@...il.com, akpm@...ux-foundation.org, bp@...en8.de,
david@...hat.com, linux-kernel@...r.kernel.org, linux-mm@...ck.org,
luto@...nel.org, naoya.horiguchi@....com, osalvador@...e.de,
yaoaili@...gsoft.com
Subject: Re: [PATCH v1 3/3] mm,hwpoison: add kill_accessing_process() to find
error virtual address
On Mon, Apr 19, 2021 at 06:49:15PM -0700, Jue Wang wrote:
> On Tue, 13 Apr 2021 07:43:20 +0900, Naoya Horiguchi wrote:
> ...
> > + * This function is intended to handle "Action Required" MCEs on already
> > + * hardware poisoned pages. They could happen, for example, when
> > + * memory_failure() failed to unmap the error page at the first call, or
> > + * when multiple Action Optional MCE events races on different CPUs with
> > + * Local MCE enabled.
>
> +Tony Luck
>
> Hey Tony, I thought SRAO MCEs are broadcasted to all cores in the system
> as they come without an execution context, is it correct?
>
> If Yes, Naoya, I think we might want to remove the comments about the
> "multiple Action Optional MCE racing" part.
Jue,
Correct. SRAO machine checks are broadcast. But rather than remove the
second part, just replace with "multiple local machine checks on different
CPUs".
-Tony
Powered by blists - more mailing lists