[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20141113212355.GH14070@pd.tnic>
Date: Thu, 13 Nov 2014 22:23:55 +0100
From: Borislav Petkov <bp@...en8.de>
To: "Luck, Tony" <tony.luck@...el.com>,
Andy Lutomirski <luto@...capital.net>
Cc: Oleg Nesterov <oleg@...hat.com>, X86 ML <x86@...nel.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
Peter Zijlstra <peterz@...radead.org>,
Andi Kleen <andi@...stfloor.org>, kvm ML <kvm@...r.kernel.org>
Subject: Re: [RFC PATCH] x86, entry: Switch stacks on a paranoid entry from
userspace
On Thu, Nov 13, 2014 at 11:59:37AM +0100, Borislav Petkov wrote:
> I've been thinking about it recently too - adding MCA functionality to
> qemu/kvm could be very useful, especially the thresholding stuff, for
> testing RAS kernel code.
Btw, qemu monitor has a mce injection command with which I was able
to tickle some response from the guest kernel. I'll play more with it
tomorrow and try to tickle a response from the memory failure code.
[ 195.328466] Disabling lock debugging due to kernel taint
[ 195.328466] [Hardware Error]: System Fatal error.
[ 195.328466] [Hardware Error]: CPU:1 (10:2:3) MC4_STATUS[Over|UE|MiscV|PCC|AddrV|UECC]: 0xfe002000001f012b
[ 195.328466] [Hardware Error]: MC4_ADDR: 0x0000000000000000
[ 195.328466] [Hardware Error]: MC4 Error (node 1): ECC Error in the Probe Filter directory.
[ 195.328466] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: WR
[ 195.328466] mce: [Hardware Error]: CPU 1: Machine Check Exception: 3 Bank 4: fe002000001f012b
[ 195.328466] mce: [Hardware Error]: RIP 10:<ffffffff8100e0b5> {default_idle+0x25/0x240}
[ 195.328466] mce: [Hardware Error]: TSC b9e2f56f95 MISC d1d1dad1deadbeef
[ 195.328466] mce: [Hardware Error]: PROCESSOR 2:100f23 TIME 1415915466 SOCKET 1 APIC 1 microcode 1000065
[ 195.328466] [Hardware Error]: System Fatal error.
[ 195.328466] [Hardware Error]: CPU:1 (10:2:3) MC4_STATUS[Over|UE|MiscV|PCC|AddrV|UECC]: 0xfe002000001f012b
[ 195.328466] [Hardware Error]: MC4_ADDR: 0x0000000000000000
[ 195.328466] [Hardware Error]: MC4 Error (node 1): ECC Error in the Probe Filter directory.
[ 195.328466] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: WR
[ 195.328466] mce: [Hardware Error]: Machine check: Invalid
[ 195.328466] Kernel panic - not syncing: Fatal machine check on current CPU
[ 195.328466] Kernel Offset: 0x0 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffff9fffffff)
[ 195.328466] ---[ end Kernel panic - not syncing: Fatal machine check on current CPU
--
Regards/Gruss,
Boris.
Sent from a fat crate under my desk. Formatting is fine.
--
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists