[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <BN9PR11MB5276AA639CD6C3D23C6C9CE48C272@BN9PR11MB5276.namprd11.prod.outlook.com>
Date: Fri, 8 Mar 2024 07:20:49 +0000
From: "Tian, Kevin" <kevin.tian@...el.com>
To: Alex Williamson <alex.williamson@...hat.com>
CC: "kvm@...r.kernel.org" <kvm@...r.kernel.org>, "eric.auger@...hat.com"
<eric.auger@...hat.com>, "clg@...hat.com" <clg@...hat.com>, "Chatre,
Reinette" <reinette.chatre@...el.com>, "linux-kernel@...r.kernel.org"
<linux-kernel@...r.kernel.org>
Subject: RE: [PATCH 1/7] vfio/pci: Disable auto-enable of exclusive INTx IRQ
> From: Alex Williamson <alex.williamson@...hat.com>
> Sent: Friday, March 8, 2024 4:17 AM
>
> On Thu, 7 Mar 2024 08:28:45 +0000
> "Tian, Kevin" <kevin.tian@...el.com> wrote:
>
> > > From: Alex Williamson <alex.williamson@...hat.com>
> > > Sent: Thursday, March 7, 2024 5:15 AM
> > >
> > > Currently for devices requiring masking at the irqchip for INTx, ie.
> > > devices without DisINTx support, the IRQ is enabled in request_irq()
> > > and subsequently disabled as necessary to align with the masked status
> > > flag. This presents a window where the interrupt could fire between
> > > these events, resulting in the IRQ incrementing the disable depth twice.
> >
> > Can you elaborate the last point about disable depth?
>
> Each irq_desc maintains a depth field, a disable increments the depth,
> an enable decrements. On the disable transition from 0 to 1 the IRQ
> chip is disabled, on the enable transition from 1 to 0 the IRQ chip is
> enabled.
>
> Therefore if an interrupt fires between request_irq() and
> disable_irq(), vfio_intx_handler() will disable the IRQ (depth 0->1).
> Note that masked is not tested here, the interrupt is expected to be
> exclusive for non-pci_2_3 devices. @masked would be redundantly set to
> true. The setup call path would increment depth to 2. The order these
> happen is not important so long as the interrupt is in-flight before
> the setup path disables the IRQ.
>
> > > This would be unrecoverable for a user since the masked flag prevents
> > > nested enables through vfio.
> >
> > What is 'nested enables'?
>
> In the case above we have masked true and disable depth 2. If the user
> now unmasks the interrupt then depth is reduced to 1, the IRQ is still
> disabled, and masked is false. The masked value is now out of sync
> with the IRQ line and prevents the user from unmasking again. The
> disable depth is stuck at 1.
>
> Nested enables would be if we allowed the user to unmask a line that we
> think is already unmasked.
Thanks! clear to me now.
>
> > > Instead, invert the logic using IRQF_NO_AUTOEN such that exclusive INTx
> > > is never auto-enabled, then unmask as required.
> > >
> > > Fixes: 89e1f7d4c66d ("vfio: Add PCI device driver")
> > > Signed-off-by: Alex Williamson <alex.williamson@...hat.com>
> >
> > But this patch looks good to me:
> >
> > Reviewed-by: Kevin Tian <kevin.tian@...el.com>
> >
> > with one nit...
> >
> > >
> > > + /*
> > > + * Devices without DisINTx support require an exclusive interrupt,
> > > + * IRQ masking is performed at the IRQ chip. The masked status is
> >
> > "exclusive interrupt, with IRQ masking performed at..."
>
> TBH, the difference is too subtle for me. With my version above you
> could replace the comma with a period, I think it has the same meaning.
> However, "...exclusive interrupt, with IRQ masking performed at..."
> almost suggests that we need a specific type of exclusive interrupt
> with this property. There's nothing unique about the exclusive
> interrupt, we could arbitrarily decide we want an exclusive interrupt
> for DisINTx masking if we wanted to frustrate a lot of users.
>
> Performing masking at the IRQ chip is actually what necessitates the
> exclusive interrupt here. Thanks,
>
make sense. and I saw you replaced the commaon with a period in patch4.
Powered by blists - more mailing lists