lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20210917165500.GA1723244@bjorn-Precision-5520>
Date:   Fri, 17 Sep 2021 11:55:00 -0500
From:   Bjorn Helgaas <helgaas@...nel.org>
To:     Kai-Heng Feng <kai.heng.feng@...onical.com>
Cc:     David Airlie <airlied@...ux.ie>, Daniel Vetter <daniel@...ll.ch>,
        Maarten Lankhorst <maarten.lankhorst@...ux.intel.com>,
        mripard@...nel.org, Thomas Zimmermann <tzimmermann@...e.de>,
        "Deucher, Alexander" <alexander.deucher@....com>,
        "open list:DRM DRIVERS" <dri-devel@...ts.freedesktop.org>,
        LKML <linux-kernel@...r.kernel.org>,
        Linux PCI <linux-pci@...r.kernel.org>,
        Huacai Chen <chenhuacai@...nel.org>
Subject: Re: [PATCH] vgaarb: Use ACPI HID name to find integrated GPU

On Fri, Sep 17, 2021 at 11:49:45AM +0800, Kai-Heng Feng wrote:
> On Fri, Sep 17, 2021 at 12:38 AM Bjorn Helgaas <helgaas@...nel.org> wrote:
> >
> > [+cc Huacai, linux-pci]
> >
> > On Wed, May 19, 2021 at 09:57:23PM +0800, Kai-Heng Feng wrote:
> > > Commit 3d42f1ddc47a ("vgaarb: Keep adding VGA device in queue") assumes
> > > the first device is an integrated GPU. However, on AMD platforms an
> > > integrated GPU can have higher PCI device number than a discrete GPU.
> > >
> > > Integrated GPU on ACPI platform generally has _DOD and _DOS method, so
> > > use that as predicate to find integrated GPU. If the new strategy
> > > doesn't work, fallback to use the first device as boot VGA.
> > >
> > > Signed-off-by: Kai-Heng Feng <kai.heng.feng@...onical.com>
> > > ---
> > >  drivers/gpu/vga/vgaarb.c | 31 ++++++++++++++++++++++++++-----
> > >  1 file changed, 26 insertions(+), 5 deletions(-)
> > >
> > > diff --git a/drivers/gpu/vga/vgaarb.c b/drivers/gpu/vga/vgaarb.c
> > > index 5180c5687ee5..949fde433ea2 100644
> > > --- a/drivers/gpu/vga/vgaarb.c
> > > +++ b/drivers/gpu/vga/vgaarb.c
> > > @@ -50,6 +50,7 @@
> > >  #include <linux/screen_info.h>
> > >  #include <linux/vt.h>
> > >  #include <linux/console.h>
> > > +#include <linux/acpi.h>
> > >
> > >  #include <linux/uaccess.h>
> > >
> > > @@ -1450,9 +1451,23 @@ static struct miscdevice vga_arb_device = {
> > >       MISC_DYNAMIC_MINOR, "vga_arbiter", &vga_arb_device_fops
> > >  };
> > >
> > > +#if defined(CONFIG_ACPI)
> > > +static bool vga_arb_integrated_gpu(struct device *dev)
> > > +{
> > > +     struct acpi_device *adev = ACPI_COMPANION(dev);
> > > +
> > > +     return adev && !strcmp(acpi_device_hid(adev), ACPI_VIDEO_HID);
> > > +}
> > > +#else
> > > +static bool vga_arb_integrated_gpu(struct device *dev)
> > > +{
> > > +     return false;
> > > +}
> > > +#endif
> > > +
> > >  static void __init vga_arb_select_default_device(void)
> > >  {
> > > -     struct pci_dev *pdev;
> > > +     struct pci_dev *pdev, *found = NULL;
> > >       struct vga_device *vgadev;
> > >
> > >  #if defined(CONFIG_X86) || defined(CONFIG_IA64)
> > > @@ -1505,20 +1520,26 @@ static void __init vga_arb_select_default_device(void)
> > >  #endif
> > >
> > >       if (!vga_default_device()) {
> > > -             list_for_each_entry(vgadev, &vga_list, list) {
> > > +             list_for_each_entry_reverse(vgadev, &vga_list, list) {
> >
> > Hi Kai-Heng, do you remember why you changed the order of this list
> > traversal?
> 
> The descending order is to keep the original behavior.
> 
> Before this patch, it breaks out of the loop as early as possible, so
> the lower numbered device is picked.
> This patch makes it only break out of the loop when ACPI_VIDEO_HID
> device is found.
> So if there are more than one device that meet "cmd & (PCI_COMMAND_IO
> | PCI_COMMAND_MEMORY)", higher numbered device will be selected.
> So the traverse order reversal is to keep the original behavior.

Can you give an example of what you mean?  I don't quite follow how it
keeps the original behavior.

If we have this:

  0  PCI_COMMAND_MEMORY set   ACPI_VIDEO_HID
  1  PCI_COMMAND_MEMORY set   ACPI_VIDEO_HID

Previously we didn't look for ACPI_VIDEO_HID, so we chose 0, now we
choose 1, which seems wrong.  In the absence of other information, I
would prefer the lower-numbered device.

Or this:

  0  PCI_COMMAND_MEMORY set
  1  PCI_COMMAND_MEMORY set   ACPI_VIDEO_HID

Previously we chose 0; now we choose 1, which does seem right, but
we'd choose 1 regardless of the order.

Or this:

  0  PCI_COMMAND_MEMORY set   ACPI_VIDEO_HID
  1  PCI_COMMAND_MEMORY set

Previously we chose 0, now we still choose 0, which seems right but
again doesn't depend on the order.

The first case, where both devices are ACPI_VIDEO_HID, is the only one
where the order matters, and I suggest that we should be using the
original order, not the reversed order.

> > I guess the list_add_tail() in vga_arbiter_add_pci_device() means
> > vga_list is generally ordered with small device numbers first and
> > large ones last.
> >
> > So you pick the integrated GPU with the largest device number.  Are
> > there systems with more than one integrated GPU?  If so, I would
> > naively expect that in the absence of an indication otherwise, we'd
> > want the one with the *smallest* device number.
> 
> There's only one integrated GPU on the affected system.
> 
> The approach is to keep the list traversal in one pass.
> Is there any regression introduce by this patch?
> If that's the case, we can separate the logic and find the
> ACPI_VIDEO_HID in second pass.

No regression, I'm just looking at Huacai's VGA patches, which affect
this area.

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ