[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20070322195043.GA19732@cip.informatik.uni-erlangen.de>
Date: Thu, 22 Mar 2007 20:50:43 +0100
From: Thomas Glanzmann <thomas@...nzmann.de>
To: LKML <linux-kernel@...r.kernel.org>
Subject: MCE Erros
Hello,
I have two Dual Opteron Machines where I get two MCE errors on. The
first one is:
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 4 northbridge TSC edc587de6e99
ADDR 1001a0000
Northbridge GART error
bit61 = error uncorrected
TLB error 'generic transaction, level generic'
STATUS a40000000005001b MCGSTATUS 0
I see this error exactly 8 times. What does 'GART' mean?
And here is another one another box:
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 4 northbridge TSC f23151075b21d
ADDR b8898250
Northbridge Chipkill ECC error
Chipkill ECC syndrome = f858
bit32 = err cpu0
bit46 = corrected ecc error
bit62 = error overflow (multiple errors)
bus error 'local node origin, request didn't time out
generic read mem transaction
memory access, level generic'
STATUS d42c4001f8080813 MCGSTATUS 0
How do I identify the broken Memory Modules?
Thomas
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists