[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20210421010414.GA25579@hori.linux.bs1.fc.nec.co.jp>
Date: Wed, 21 Apr 2021 01:04:14 +0000
From: HORIGUCHI NAOYA(堀口 直也)
<naoya.horiguchi@....com>
To: "Luck, Tony" <tony.luck@...el.com>
CC: Jue Wang <juew@...gle.com>,
"nao.horiguchi@...il.com" <nao.horiguchi@...il.com>,
"akpm@...ux-foundation.org" <akpm@...ux-foundation.org>,
"bp@...en8.de" <bp@...en8.de>,
"david@...hat.com" <david@...hat.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"linux-mm@...ck.org" <linux-mm@...ck.org>,
"luto@...nel.org" <luto@...nel.org>,
"osalvador@...e.de" <osalvador@...e.de>,
"yaoaili@...gsoft.com" <yaoaili@...gsoft.com>
Subject: Re: [PATCH v1 3/3] mm,hwpoison: add kill_accessing_process() to find
error virtual address
On Tue, Apr 20, 2021 at 08:42:53AM -0700, Luck, Tony wrote:
> On Mon, Apr 19, 2021 at 06:49:15PM -0700, Jue Wang wrote:
> > On Tue, 13 Apr 2021 07:43:20 +0900, Naoya Horiguchi wrote:
> > ...
> > > + * This function is intended to handle "Action Required" MCEs on already
> > > + * hardware poisoned pages. They could happen, for example, when
> > > + * memory_failure() failed to unmap the error page at the first call, or
> > > + * when multiple Action Optional MCE events races on different CPUs with
> > > + * Local MCE enabled.
> >
> > +Tony Luck
> >
> > Hey Tony, I thought SRAO MCEs are broadcasted to all cores in the system
> > as they come without an execution context, is it correct?
> >
> > If Yes, Naoya, I think we might want to remove the comments about the
> > "multiple Action Optional MCE racing" part.
>
> Jue,
>
> Correct. SRAO machine checks are broadcast. But rather than remove the
> second part, just replace with "multiple local machine checks on different
> CPUs".
This looks more precise, so I replaced as such in v3.
Thanks,
Naoya Horiguchi
Powered by blists - more mailing lists