[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20251205094510.4671-1-nnmlinux@linux.ibm.com>
Date: Fri, 5 Dec 2025 03:45:06 -0600
From: Narayana Murty N <nnmlinux@...ux.ibm.com>
To: mahesh@...ux.ibm.com, maddy@...ux.ibm.com, mpe@...erman.id.au,
christophe.leroy@...roup.eu, gregkh@...uxfoundation.org,
oohall@...il.com, npiggin@...il.com
Cc: linuxppc-dev@...ts.ozlabs.org, linux-kernel@...r.kernel.org,
tyreld@...ux.ibm.com, vaibhav@...ux.ibm.com, sbhat@...ux.ibm.com,
ganeshgr@...ux.ibm.com, sourabhjain@...ux.ibm.com
Subject: [PATCH 0/4] powerpc/pseries: Add full RTAS-based error injection support
The series introduces complete support for RTAS-based hardware error
injection on pseries platforms. The implementation replaces the legacy
MMIO-based approach with a full PAPR-compliant workflow built around the
RTAS services:
ibm,open-errinjct
ibm,errinjct
ibm,close-errinjct
The new pseries_eeh_err_inject() interface enables controlled injection
of synthetic PCI, memory, and cache/TLB faults for platform validation,
EEH testing, firmware diagnostics, and tooling (e.g., bpftrace-based
tracing).
Current testing scope:
At this stage, the feature can be triggered only from VFIO-passthrough
devices assigned to a guest, and from userspace-exposed VFIO devices
using the VFIO_EEH_PE_INJECT_ERR ioctl.
Key Highlights
Dynamic acquisition of all required RTAS tokens and correct
open/errinjct/close session handling as defined by PAPR. A 1KB
naturally-aligned, zero-initialized RTAS working buffer is populated
exactly per PAPR buffer definitions.
Support for a wide range of error types:
0x03 - recovered-special-event
0x04 - corrupted-page
0x07 - ioa-bus-error (32-bit)
0x0F - ioa-bus-error-64 (64-bit)
0x09 - corrupted-dcache-start
0x0A - corrupted-dcache-end
0x0B - corrupted-icache-start
0x0C - corrupted-icache-end
0x0D - corrupted-tlb-start
0x0E - corrupted-tlb-end
All RTAS parameters use proper big-endian formatting (cpu_to_be32()).
Robust status-handling, printk-based diagnostics, and thorough
validation for invalid or unsupported conditions.
Error-specific buffer population logic is factored into helpers for
clarity and maintainability.
Fully tested on PowerVM with firmware that supports RTAS error
injection, along with the companion QEMU support posted here:
https://lore.kernel.org/all/20251029150618.186803-1-nnmlinux@linux.ibm.com/
Signed-off-by: Narayana Murty N <nnmlinux@...ux.ibm.com>
---
Change Log:
RFC v0: https://lore.kernel.org/all/20251107091009.43034-1-nnmlinux@linux.ibm.com/
* Fixed PAPR ibm,open-errinjct output format (token,status order)
* Added pr_fmt handling for EEH subsystem compatibility
* Split into logical 4-patch series
* Addressed all review feedback on validation + rtas.c
Narayana Murty N (4):
powerpc/rtas: Handle special return format for
RTAS_FN_IBM_OPEN_ERRINJCT
powerpc/pseries: Add RTAS error injection buffer infrastructure
powerpc/pseries: Add RTAS error injection validation helpers
powerpc/pseries: Implement RTAS error injection via
pseries_eeh_err_inject
arch/powerpc/include/asm/rtas.h | 21 +
arch/powerpc/include/uapi/asm/eeh.h | 18 +
arch/powerpc/kernel/rtas.c | 42 +-
arch/powerpc/platforms/pseries/eeh_pseries.c | 428 ++++++++++++++++++-
4 files changed, 485 insertions(+), 24 deletions(-)
--
2.51.1
Powered by blists - more mailing lists