[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4DF1D0B1.1020302@jp.fujitsu.com>
Date: Fri, 10 Jun 2011 17:07:13 +0900
From: Hidetoshi Seto <seto.hidetoshi@...fujitsu.com>
To: "Luck, Tony" <tony.luck@...el.com>
CC: Ingo Molnar <mingo@...e.hu>, Borislav Petkov <bp@...64.org>,
linux-kernel@...r.kernel.org, "Huang, Ying" <ying.huang@...el.com>,
Avi Kivity <avi@...hat.com>
Subject: Re: [PATCH 05/10] MCE: Mask out address mask bits below address granuality
(2011/06/10 6:33), Luck, Tony wrote:
> From: Andi Kleen <andi@...stfloor.org>
>
> SER enabled systems report the address granuality for each
> reported address in a machine check. But the bits below
> the granuality are undefined. Mask them out before
> logging the machine check.
>
> Signed-off-by: Andi Kleen <ak@...ux.intel.com>
> Signed-off-by: Tony Luck <tony.luck@...el.com>
> ---
> arch/x86/kernel/cpu/mcheck/mce.c | 12 +++++++++++-
> 1 files changed, 11 insertions(+), 1 deletions(-)
>
> diff --git a/arch/x86/kernel/cpu/mcheck/mce.c b/arch/x86/kernel/cpu/mcheck/mce.c
> index 0349e87..ffc8d11 100644
> --- a/arch/x86/kernel/cpu/mcheck/mce.c
> +++ b/arch/x86/kernel/cpu/mcheck/mce.c
> @@ -539,8 +539,18 @@ static void mce_read_aux(struct mce *m, int i)
> {
> if (m->status & MCI_STATUS_MISCV)
> m->misc = mce_rdmsrl(MSR_IA32_MCx_MISC(i));
> - if (m->status & MCI_STATUS_ADDRV)
> + if (m->status & MCI_STATUS_ADDRV) {
> m->addr = mce_rdmsrl(MSR_IA32_MCx_ADDR(i));
> +
> + /*
> + * Mask the reported address by the reported granuality.
> + */
> + if (mce_ser && (m->status & MCI_STATUS_MISCV)) {
> + u8 shift = m->misc & 0x1f;
> + m->addr >>= shift;
> + m->addr <<= shift;
> + }
> + }
> }
>
> DEFINE_PER_CPU(unsigned, mce_poll_count);
Why do you have to mask it out in kernel, why not in user/logger?
One possible story is:
"... the brand-new Xeon XXXX has new MCx_***_VALID bit in ****
register, if it is set the lower bits of MCx_ADDR indicates
****, otherwise the bits are undefined ..."
So I think that kernel should convey the raw value from hardware to
userland. Even if it contains some noise on it, user can determine
whether it is useful or not. And more, since this is an error record,
there will be no second chance to retrieve the data afterward.
Thanks,
H.Seto
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists