[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20251117124159.3560-1-ankita@nvidia.com>
Date: Mon, 17 Nov 2025 12:41:53 +0000
From: <ankita@...dia.com>
To: <ankita@...dia.com>, <jgg@...pe.ca>, <yishaih@...dia.com>,
<skolothumtho@...dia.com>, <kevin.tian@...el.com>, <alex@...zbot.org>,
<aniketa@...dia.com>, <vsethi@...dia.com>, <mochs@...dia.com>
CC: <Yunxiang.Li@....com>, <yi.l.liu@...el.com>,
<zhangdongdong@...incomputing.com>, <avihaih@...dia.com>,
<bhelgaas@...gle.com>, <peterx@...hat.com>, <pstanner@...hat.com>,
<apopple@...dia.com>, <kvm@...r.kernel.org>, <linux-kernel@...r.kernel.org>,
<cjia@...dia.com>, <kwankhede@...dia.com>, <targupta@...dia.com>,
<zhiw@...dia.com>, <danw@...dia.com>, <dnigam@...dia.com>, <kjaju@...dia.com>
Subject: [PATCH v1 0/6] vfio/nvgrace-gpu: Support huge PFNMAP and wait
From: Ankit Agrawal <ankita@...dia.com>
NVIDIA's Grace based system have large GPU device memory. The device
memory is mapped as VM_PFNMAP in the VMM VMA. The nvgrace-gpu
module could make use of the huge PFNMAP support added in mm [1].
To achieve this, nvgrace-gpu module is updated to implement huge_fault ops.
The implementation establishes mapping according to the order request.
Note that if the PFN or the VMA address is unaligned to the order, the
mapping fallbacks to the PTE level.
Secondly, it is expected that the mapping not be re-established until
the GPU is ready post reset. Presence of the mappings during that time
could potentially leads to harmless corrected RAS events to be logged if
the CPU attempts to do speculative reads on the GPU memory.
Wait for the GPU to be ready on the first fault. The GPU readiness can
be checked through BAR0 registers as is already being done at the device
probe.
Patch 1 updates the mapping mechanism to be done through faults.
Patch 2 splits the code to map at the various levels.
Patch 3 implements support for huge pfnmap.
Path 4-6 intercepts reset request and ensures that the GP is ready
before re-establishing the mapping after reset.
Applied over 6.18-rc4.
Link: https://lore.kernel.org/all/20240826204353.2228736-1-peterx@redhat.com/ [1]
Signed-off-by: Ankit Agrawal <ankita@...dia.com>
Ankit Agrawal (6):
vfio/nvgrace-gpu: Use faults to map device memory
vfio: export function to map the VMA
vfio/nvgrace-gpu: Add support for huge pfnmap
vfio: export vfio_find_cap_start
vfio/nvgrace-gpu: split the code to wait for GPU ready
vfio/nvgrace-gpu: vfio/nvgrace-gpu: wait for the GPU mem to be ready
drivers/vfio/pci/nvgrace-gpu/main.c | 170 ++++++++++++++++++++++------
drivers/vfio/pci/vfio_pci_config.c | 3 +-
drivers/vfio/pci/vfio_pci_core.c | 46 +++++---
include/linux/vfio_pci_core.h | 3 +
4 files changed, 168 insertions(+), 54 deletions(-)
--
2.34.1
Powered by blists - more mailing lists