[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <3908561D78D1C84285E8C5FCA982C28F3282054D@ORSMSX114.amr.corp.intel.com>
Date: Wed, 28 May 2014 17:21:57 +0000
From: "Luck, Tony" <tony.luck@...el.com>
To: "masbock@...ux.vnet.ibm.com" <masbock@...ux.vnet.ibm.com>,
Chen Yucong <slaoub@...il.com>
CC: Borislav Petkov <bp@...en8.de>,
LKML <linux-kernel@...r.kernel.org>,
linux-edac <linux-edac@...r.kernel.org>, X86 ML <x86@...nel.org>
Subject: RE: [RFC PATCH 0/3] RAS: Correctable Errors Collector thing
> A possible alternative would be to soft-offline the page. This is
> currently done in APEI code when corrected memory error thresholds are
> exceeded and reported by UEFI via a generic hardware error source
> (GHES).
+1
This is what the existing mcelog(8) daemon does when it sees an excessive
number of corrected errors on a page (using /sys/devices/system/memory/soft_offline_page
as the user->kernel interface to get to this function).
-Tony
Powered by blists - more mailing lists