[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20251007173851.GD310336@yaz-khff2.amd.com>
Date: Tue, 7 Oct 2025 13:38:51 -0400
From: Yazen Ghannam <yazen.ghannam@....com>
To: "Luck, Tony" <tony.luck@...el.com>
Cc: Borislav Petkov <bp@...en8.de>,
"linux-edac@...r.kernel.org" <linux-edac@...r.kernel.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"avadhut.naik@....com" <avadhut.naik@....com>,
"john.allen@....com" <john.allen@....com>
Subject: Re: [PATCH] RAS/AMD/FMPM: Add option to ignore CEs
On Tue, Oct 07, 2025 at 04:52:55PM +0000, Luck, Tony wrote:
> > I think it's a common use case without FMPM.
> >
> > IOW, log correctable errors but don't offline memory because of them.
> >
> > Does that sounds better or about the same?
>
> Linux has /proc/sys/vm/enable_soft_offline toggle for that case.
>
Thanks, that's a good suggestion.
We would still need a check in fru_handle_mem_poison() to skip saving
records to persistent storage.
And we would need a code update in _retire_row_mi300() to use the
soft_offline path.
Thanks,
Yazen
Powered by blists - more mailing lists