[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20241113013430.GC35230@nvidia.com>
Date: Tue, 12 Nov 2024 21:34:30 -0400
From: Jason Gunthorpe <jgg@...dia.com>
To: Nicolin Chen <nicolinc@...dia.com>, tglx@...utronix.de,
alex.williamson@...hat.com
Cc: Robin Murphy <robin.murphy@....com>, maz@...nel.org,
bhelgaas@...gle.com, leonro@...dia.com,
shameerali.kolothum.thodi@...wei.com, dlemoal@...nel.org,
kevin.tian@...el.com, smostafa@...gle.com,
andriy.shevchenko@...ux.intel.com, reinette.chatre@...el.com,
eric.auger@...hat.com, ddutile@...hat.com, yebin10@...wei.com,
brauner@...nel.org, apatel@...tanamicro.com,
shivamurthy.shastri@...utronix.de, anna-maria@...utronix.de,
nipun.gupta@....com, marek.vasut+renesas@...lbox.org,
linux-arm-kernel@...ts.infradead.org, linux-kernel@...r.kernel.org,
linux-pci@...r.kernel.org, kvm@...r.kernel.org
Subject: Re: [PATCH RFCv1 0/7] vfio: Allow userspace to specify the address
for each MSI vector
On Tue, Nov 12, 2024 at 01:54:58PM -0800, Nicolin Chen wrote:
> On Mon, Nov 11, 2024 at 01:09:20PM +0000, Robin Murphy wrote:
> > On 2024-11-09 5:48 am, Nicolin Chen wrote:
> > > To solve this problem the VMM should capture the MSI IOVA allocated by the
> > > guest kernel and relay it to the GIC driver in the host kernel, to program
> > > the correct MSI IOVA. And this requires a new ioctl via VFIO.
> >
> > Once VFIO has that information from userspace, though, do we really need
> > the whole complicated dance to push it right down into the irqchip layer
> > just so it can be passed back up again? AFAICS
> > vfio_msi_set_vector_signal() via VFIO_DEVICE_SET_IRQS already explicitly
> > rewrites MSI-X vectors, so it seems like it should be pretty
> > straightforward to override the message address in general at that
> > level, without the lower layers having to be aware at all, no?
>
> Didn't see that clearly!! It works with a simple following override:
> --------------------------------------------------------------------
> @@ -497,6 +497,10 @@ static int vfio_msi_set_vector_signal(struct vfio_pci_core_device *vdev,
> struct msi_msg msg;
>
> get_cached_msi_msg(irq, &msg);
> + if (vdev->msi_iovas) {
> + msg.address_lo = lower_32_bits(vdev->msi_iovas[vector]);
> + msg.address_hi = upper_32_bits(vdev->msi_iovas[vector]);
> + }
> pci_write_msi_msg(irq, &msg);
> }
>
> --------------------------------------------------------------------
>
> With that, I think we only need one VFIO change for this part :)
Wow, is that really OK from a layering perspective? The comment is
pretty clear on the intention that this is to resync the irq layer
view of the device with the physical HW.
Editing the msi_msg while doing that resync smells bad.
Also, this is only doing MSI-X, we should include normal MSI as
well. (it probably should have a resync too?)
I'd want Thomas/Marc/Alex to agree.. (please read the cover letter for
context)
I think there are many options here we just need to get a clearer
understanding what best fits the architecture of the interrupt
subsystem.
Jason
Powered by blists - more mailing lists