lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <21a09968-296b-5b21-8079-6d9d4e0769d4@molgen.mpg.de>
Date:   Wed, 12 Apr 2023 17:11:26 +0200
From:   Paul Menzel <pmenzel@...gen.mpg.de>
To:     Thomas Gleixner <tglx@...utronix.de>,
        Ingo Molnar <mingo@...hat.com>, Borislav Petkov <bp@...en8.de>,
        Dave Hansen <dave.hansen@...ux.intel.com>, x86@...nel.org
Cc:     LKML <linux-kernel@...r.kernel.org>
Subject: AMD EPYC 25 (19h): Hardware Error: Machine Check: 0 Bank 17:
 d42040000000011b

Dear Linux folks,


On a Dell PowerEdge R7525 with AMD EPYC 7763 64-Core Processor, Linux 
5.15.94 logs the machine check exceptions (MCE) below:

```
[5154053.127240] mce: [Hardware Error]: Machine check events logged
[5154053.133711] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 17: 
d42040000000011b
[5154053.141948] mce: [Hardware Error]: TSC 0 ADDR b3cbdbbc0 PPIN 
2b615bef7f48098 SYND 6bd210000a801002 IPID 9600650f00
[5154053.152893] mce: [Hardware Error]: PROCESSOR 2:a00f11 TIME 
1679213602 SOCKET 0 APIC 6 microcode a001173
[5608214.292978] mce: [Hardware Error]: Machine check events logged
[5608214.299771] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 17: 
d42040000000011b
[5608214.308066] mce: [Hardware Error]: TSC 0 ADDR b3cbdbbc0 PPIN 
2b615bef7f48098 SYND 6bd210000a801002 IPID 9600650f00
[5608214.319102] mce: [Hardware Error]: PROCESSOR 2:a00f11 TIME 
1679667766 SOCKET 0 APIC 6 microcode a001173
[5707500.646385] mce: [Hardware Error]: Machine check events logged
[5707500.652973] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 17: 
d42041000000011b
[5707500.661238] mce: [Hardware Error]: TSC 0 ADDR b3cbdbbc0 PPIN 
2b615bef7f48098 SYND 6bd210000a801002 IPID 9600650f00
[5707500.672271] mce: [Hardware Error]: PROCESSOR 2:a00f11 TIME 
1679767053 SOCKET 0 APIC 6 microcode a001173
[5810063.788078] mce: [Hardware Error]: Machine check events logged
[5810063.794698] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 17: 
d42041000000011b
[5810063.803126] mce: [Hardware Error]: TSC 0 ADDR b3cbdbbc0 PPIN 
2b615bef7f48098 SYND 6bd210000a801002 IPID 9600650f00
[5810063.814264] mce: [Hardware Error]: PROCESSOR 2:a00f11 TIME 
1679869617 SOCKET 0 APIC 6 microcode a001173
```

Does GNU/Linux offer a way to decode this automatically?


Kind regards,

Paul


PS:

```
$ lscpu
Architecture:            x86_64
   CPU op-mode(s):        32-bit, 64-bit
   Address sizes:         48 bits physical, 48 bits virtual
   Byte Order:            Little Endian
CPU(s):                  256
   On-line CPU(s) list:   0-255
Vendor ID:               AuthenticAMD
   Model name:            AMD EPYC 7763 64-Core Processor
     CPU family:          25
     Model:               1
     Thread(s) per core:  2
     Core(s) per socket:  64
     Socket(s):           2
     Stepping:            1
     Frequency boost:     enabled
     CPU max MHz:         3529.0520
     CPU min MHz:         1500.0000
     BogoMIPS:            4890.81
     Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep 
mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx 
mmxext fxsr_o
                          pt pdpe1gb rdtscp lm constant_tsc rep_good 
nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor 
ssse3 fma
                           cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt 
aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm 
sse4a mi
                          salignsse 3dnowprefetch osvw ibs skinit wdt 
tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 
cdp_l3 in
                          vpcid_single hw_pstate ssbd mba ibrs ibpb 
stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 invpcid cqm rdt_a rdseed adx 
smap clflus
                          hopt clwb sha_ni xsaveopt xsavec xgetbv1 
xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf 
xsaveerptr rd
                          pru wbnoinvd amd_ppin arat npt lbrv svm_lock 
nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter 
pfthreshold
                           v_vmsave_vmload vgif v_spec_ctrl umip pku 
ospke vaes vpclmulqdq rdpid overflow_recov succor smca
Virtualization features:
   Virtualization:        AMD-V
Caches (sum of all):
   L1d:                   4 MiB (128 instances)
   L1i:                   4 MiB (128 instances)
   L2:                    64 MiB (128 instances)
   L3:                    512 MiB (16 instances)
NUMA:
   NUMA node(s):          8
   NUMA node0 CPU(s):     0-15,128-143
   NUMA node1 CPU(s):     16-31,144-159
   NUMA node2 CPU(s):     32-47,160-175
   NUMA node3 CPU(s):     48-63,176-191
   NUMA node4 CPU(s):     64-79,192-207
   NUMA node5 CPU(s):     80-95,208-223
   NUMA node6 CPU(s):     96-111,224-239
   NUMA node7 CPU(s):     112-127,240-255
Vulnerabilities:
   Itlb multihit:         Not affected
   L1tf:                  Not affected
   Mds:                   Not affected
   Meltdown:              Not affected
   Mmio stale data:       Not affected
   Retbleed:              Not affected
   Spec store bypass:     Mitigation; Speculative Store Bypass disabled 
via prctl and seccomp
   Spectre v1:            Mitigation; usercopy/swapgs barriers and 
__user pointer sanitization
   Spectre v2:            Mitigation; Retpolines, IBPB conditional, 
IBRS_FW, STIBP always-on, RSB filling, PBRSB-eIBRS Not affected
   Srbds:                 Not affected
   Tsx async abort:       Not affected
```

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ