[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <aFHABY5yTYrJ4OUw@ddadap-lakeline.nvidia.com>
Date: Tue, 17 Jun 2025 14:20:37 -0500
From: Daniel Dadap <ddadap@...dia.com>
To: Mario Limonciello <superm1@...nel.org>
Cc: Bjorn Helgaas <bhelgaas@...gle.com>,
Alex Deucher <alexander.deucher@....com>,
Christian König <christian.koenig@....com>,
David Airlie <airlied@...il.com>, Simona Vetter <simona@...ll.ch>,
Lukas Wunner <lukas@...ner.de>,
Maarten Lankhorst <maarten.lankhorst@...ux.intel.com>,
Maxime Ripard <mripard@...nel.org>,
Thomas Zimmermann <tzimmermann@...e.de>,
David Woodhouse <dwmw2@...radead.org>,
Lu Baolu <baolu.lu@...ux.intel.com>, Joerg Roedel <joro@...tes.org>,
Will Deacon <will@...nel.org>, Robin Murphy <robin.murphy@....com>,
Alex Williamson <alex.williamson@...hat.com>,
Jaroslav Kysela <perex@...ex.cz>, Takashi Iwai <tiwai@...e.com>,
"open list:DRM DRIVERS" <dri-devel@...ts.freedesktop.org>,
open list <linux-kernel@...r.kernel.org>,
"open list:INTEL IOMMU (VT-d)" <iommu@...ts.linux.dev>,
"open list:PCI SUBSYSTEM" <linux-pci@...r.kernel.org>,
"open list:VFIO DRIVER" <kvm@...r.kernel.org>,
"open list:SOUND" <linux-sound@...r.kernel.org>,
Mario Limonciello <mario.limonciello@....com>
Subject: Re: [PATCH v2 6/6] vgaarb: Look at all PCI display devices in VGA
arbiter
On Tue, Jun 17, 2025 at 12:59:10PM -0500, Mario Limonciello wrote:
> From: Mario Limonciello <mario.limonciello@....com>
>
> On a mobile system with an AMD integrated GPU + NVIDIA discrete GPU the
> AMD GPU is not being selected by some desktop environments for any
> rendering tasks. This is because neither GPU is being treated as
> "boot_vga" but that is what some environments use to select a GPU [1].
>
> The VGA arbiter driver only looks at devices that report as PCI display
> VGA class devices. Neither GPU on the system is a PCI display VGA class
> device:
>
> c5:00.0 3D controller: NVIDIA Corporation Device 2db9 (rev a1)
> c6:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 150e (rev d1)
>
> If the GPUs were looked at the vga_is_firmware_default() function actually
> does do a good job at recognizing the case from the device used for the
> firmware framebuffer.
>
> Modify the VGA arbiter code and matching sysfs file entries to examine all
> PCI display class devices. The existing logic stays the same.
>
> This will cause all GPUs to gain a `boot_vga` file, but the correct device
> (AMD GPU in this case) will now show `1` and the incorrect device shows `0`.
> Userspace then picks the right device as well.
>
> Link: https://github.com/robherring/libpciaccess/commit/b2838fb61c3542f107014b285cbda097acae1e12 [1]
> Suggested-by: Daniel Dadap <ddadap@...dia.com>
> Acked-by: Thomas Zimmermann <tzimmermann@...e.de>
> Signed-off-by: Mario Limonciello <mario.limonciello@....com>
> ---
> drivers/pci/pci-sysfs.c | 2 +-
> drivers/pci/vgaarb.c | 8 ++++----
> 2 files changed, 5 insertions(+), 5 deletions(-)
>
> diff --git a/drivers/pci/pci-sysfs.c b/drivers/pci/pci-sysfs.c
> index 268c69daa4d57..c314ee1b3f9ac 100644
> --- a/drivers/pci/pci-sysfs.c
> +++ b/drivers/pci/pci-sysfs.c
> @@ -1707,7 +1707,7 @@ static umode_t pci_dev_attrs_are_visible(struct kobject *kobj,
> struct device *dev = kobj_to_dev(kobj);
> struct pci_dev *pdev = to_pci_dev(dev);
>
> - if (a == &dev_attr_boot_vga.attr && pci_is_vga(pdev))
> + if (a == &dev_attr_boot_vga.attr && pci_is_display(pdev))
> return a->mode;
I can't help but worry about userspace clients that might be checking for
the presence of the boot_vga sysfs file but don't check its contents. I
understand that it's the intention to expose the file for non-VGA display
controllers in the case where none of the display controllers are of the
VGA subclass, but one of them is the boot console device and should be
considered "VGA" for the purposes of the overloaded meaning of "VGA", but
if it isn't too much trouble to minimize the change to UAPI here, I'd be
more comfortable with only exposing this file for devices that really are
VGA and/or the firmware default.
Maybe something like making the condition:
if (a == &dev_attr_boot_vga.attr) {
if (pci_is_vga(pdev) ||
(pci_is_display(pdev) && vga_default_device() == pdev))
return a->mode;
}
(maybe we don't even need the pci_is_display() check at that point? I
feel more comfortable leaving it in, though)
I'd expect that to do something like (assuming two-GPU hybrid system):
* Systems with one VGA controller and one 3D controller:
* VGA controller gets boot_vga file, contents are "1"
* 3D controller does not get boot_vga file
* Systems with no VGA controllers and two 3D controllers:
* 3D controller driving the console gets boot_vga file: "1"
* 3D controller not driving the console does not get boot_vga file
* Systems with two VGA controllers and no 3D controllers:
* VGA controller driving the console gets boot_vga file: "1"
* VGA controller not driving the console gets boot_vga file: "0"
i.e., the behavior would only be visibly different in the case with two
3D controllers, like the one targeted by this patch. You and I have seen
the two VGA controller case in the wild, so we know it exists. The one 3D
and one VGA controller case is what I'd expect to be the common one, and
hopefully this will have the same behavior before and after this change
regardless of whether a muxed system defaults to dGPU (like hybrid Mac
notebooks) or iGPU (like other hybrid systems I'm accustomed to).
>
> return 0;
> diff --git a/drivers/pci/vgaarb.c b/drivers/pci/vgaarb.c
> index 78748e8d2dbae..63216e5787d73 100644
> --- a/drivers/pci/vgaarb.c
> +++ b/drivers/pci/vgaarb.c
> @@ -1499,8 +1499,8 @@ static int pci_notify(struct notifier_block *nb, unsigned long action,
>
> vgaarb_dbg(dev, "%s\n", __func__);
>
> - /* Only deal with VGA class devices */
> - if (!pci_is_vga(pdev))
> + /* Only deal with PCI display class devices */
> + if (!pci_is_display(pdev))
> return 0;
>
> /*
> @@ -1546,12 +1546,12 @@ static int __init vga_arb_device_init(void)
>
> bus_register_notifier(&pci_bus_type, &pci_notifier);
>
> - /* Add all VGA class PCI devices by default */
> + /* Add all PCI display class devices by default */
> pdev = NULL;
> while ((pdev =
> pci_get_subsys(PCI_ANY_ID, PCI_ANY_ID, PCI_ANY_ID,
> PCI_ANY_ID, pdev)) != NULL) {
> - if (pci_is_vga(pdev))
> + if (pci_is_display(pdev))
> vga_arbiter_add_pci_device(pdev);
> }
>
> --
> 2.43.0
>
Powered by blists - more mailing lists