[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1e719367-01ae-565a-2199-0ff7e260422b@linux.intel.com>
Date: Mon, 26 Feb 2024 11:24:16 +0200
From: Mathias Nyman <mathias.nyman@...ux.intel.com>
To: Linux regressions mailing list <regressions@...ts.linux.dev>,
Thomas Gleixner <tglx@...utronix.de>
Cc: "Christian A. Ehrhardt" <lk@...e.de>, niklas.neronin@...ux.intel.com,
Linux List Kernel Mailing <linux-kernel@...r.kernel.org>,
Greg KH <gregkh@...uxfoundation.org>, linux-usb@...r.kernel.org,
linux-x86_64@...r.kernel.org, netdev@...r.kernel.org,
Randy Dunlap <rdunlap@...radead.org>,
Mikhail Gavrilov <mikhail.v.gavrilov@...il.com>
Subject: Re: This is the fourth time I've tried to find what led to the
regression of outgoing network speed and each time I find the merge commit
8c94ccc7cd691472461448f98e2372c75849406c
On 26.2.2024 7.45, Linux regression tracking (Thorsten Leemhuis) wrote:
> On 21.02.24 14:44, Mathias Nyman wrote:
>> On 21.2.2024 1.43, Randy Dunlap wrote:
>>> On 2/20/24 15:41, Randy Dunlap wrote:
>>>> {+ tglx]
>>>> On 2/20/24 15:19, Mikhail Gavrilov wrote:
>>>>> On Mon, Feb 19, 2024 at 2:41 PM Mikhail Gavrilov
>>>>> <mikhail.v.gavrilov@...il.com> wrote:
>>>>> I spotted network performance regression and it turned out, this was
>>>>> due to the network card getting other interrupt. It is a side effect
>>>>> of commit 57e153dfd0e7a080373fe5853c5609443d97fa5a.
>>>> That's a merge commit (AFAIK, maybe not so much). The commit in
>>>> mainline is:
>>>>
>>>> commit f977f4c9301c
>>>> Author: Niklas Neronin <niklas.neronin@...ux.intel.com>
>>>> Date: Fri Dec 1 17:06:40 2023 +0200
>>>>
>>>> xhci: add handler for only one interrupt line
>>>>
>>>>> Installing irqbalance daemon did not help. Maybe someone experienced
>>>>> such a problem?
>>>>
>>>> Thomas, would you look at this, please?
>>>>
>>>> A network device and xhci (USB) driver are now sharing interrupts.
>>>> This causes a large performance decrease for the networking device.
>>
>> Short recap:
>
> Thx for that. As the 6.8 release is merely two or three weeks away while
> a fix is nowhere near in sight yet (afaics!) I start to wonder if we
> should consider a revert here and try reapplying the culprit in a later
> cycle when this problem is fixed.
I don't think reverting this series is a solution.
This isn't really about those usb xhci patches.
This is about which interrupt gets assigned to which CPU.
Mikhail got unlucky when the network adapter interrupts on that system was
assigned to CPU0, clearly a more "clogged" CPU, thus causing a drop in max
bandwidth.
Thanks
Mathias
Powered by blists - more mailing lists