[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <tip-680b6cfd3cee30a7d997d49430fb73af84523853@git.kernel.org>
Date: Thu, 17 Sep 2009 21:37:09 GMT
From: tip-bot for Hidetoshi Seto <seto.hidetoshi@...fujitsu.com>
To: linux-tip-commits@...r.kernel.org
Cc: linux-kernel@...r.kernel.org, hpa@...or.com, mingo@...hat.com,
jin.dongming@...css.fujitsu.com, seto.hidetoshi@...fujitsu.com,
ak@...ux.intel.com, tglx@...utronix.de, mingo@...e.hu
Subject: [tip:x86/pat] x86, mce: CE in last bank prevents panic by unknown MCE
Commit-ID: 680b6cfd3cee30a7d997d49430fb73af84523853
Gitweb: http://git.kernel.org/tip/680b6cfd3cee30a7d997d49430fb73af84523853
Author: Hidetoshi Seto <seto.hidetoshi@...fujitsu.com>
AuthorDate: Wed, 26 Aug 2009 16:20:36 +0900
Committer: Ingo Molnar <mingo@...e.hu>
CommitDate: Wed, 26 Aug 2009 20:21:11 +0200
x86, mce: CE in last bank prevents panic by unknown MCE
If MCE handler is called but none of mces_seen have machine
check event which might signal the MCE (i.e. event higher than
MCE_KEEP_SEVERITY), panic with "Machine check from unknown
source" will be taken since the MCE is assumed to be signaled
from external agent or so.
Usually mces_seen never point MCE_KEEP_SEVERITY event such as
CE. But it can happen because initial value of mces_seen is
accidentally modified by mce_no_way_out() - in case if
mce_no_way_out() run through all banks and the last bank has
the CE, mces_seen points the CE and the "panic by unknown" will
not be taken.
This patch fixes this undesired behavior, and clarifies the logic.
Signed-off-by: Hidetoshi Seto <seto.hidetoshi@...fujitsu.com>
Cc: H. Peter Anvin <hpa@...or.com>
Cc: Andi Kleen <ak@...ux.intel.com>
Cc: Jin Dongming <jin.dongming@...css.fujitsu.com>
LKML-Reference: <4A94E244.3020301@...fujitsu.com>
Signed-off-by: Ingo Molnar <mingo@...e.hu>
Reported-by: Jin Dongming <jin.dongming@...css.fujitsu.com>
---
arch/x86/kernel/cpu/mcheck/mce.c | 8 ++++----
1 files changed, 4 insertions(+), 4 deletions(-)
diff --git a/arch/x86/kernel/cpu/mcheck/mce.c b/arch/x86/kernel/cpu/mcheck/mce.c
index 54bd1b2..325559d 100644
--- a/arch/x86/kernel/cpu/mcheck/mce.c
+++ b/arch/x86/kernel/cpu/mcheck/mce.c
@@ -612,7 +612,7 @@ out:
* This way we prevent any potential data corruption in a unrecoverable case
* and also makes sure always all CPU's errors are examined.
*
- * Also this detects the case of an machine check event coming from outer
+ * Also this detects the case of a machine check event coming from outer
* space (not detected by any CPUs) In this case some external agent wants
* us to shut down, so panic too.
*
@@ -665,7 +665,7 @@ static void mce_reign(void)
* No machine check event found. Must be some external
* source or one CPU is hung. Panic.
*/
- if (!m && tolerant < 3)
+ if (global_worst <= MCE_KEEP_SEVERITY && tolerant < 3)
mce_panic("Machine check from unknown source", NULL, NULL);
/*
@@ -889,11 +889,11 @@ void do_machine_check(struct pt_regs *regs, long error_code)
mce_setup(&m);
m.mcgstatus = mce_rdmsrl(MSR_IA32_MCG_STATUS);
- no_way_out = mce_no_way_out(&m, &msg);
-
final = &__get_cpu_var(mces_seen);
*final = m;
+ no_way_out = mce_no_way_out(&m, &msg);
+
barrier();
/*
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists