[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <bab8164c-f74f-44fe-ac3e-c078d30ab1e4@leemhuis.info>
Date: Fri, 22 Sep 2023 11:50:09 +0200
From: "Linux regression tracking (Thorsten Leemhuis)"
<regressions@...mhuis.info>
To: Bagas Sanjaya <bagasdotme@...il.com>,
Martin Zaharinov <micron10@...il.com>
Cc: Eric Dumazet <edumazet@...gle.com>, Paolo Abeni <pabeni@...hat.com>,
netdev <netdev@...r.kernel.org>, patchwork-bot+netdevbpf@...nel.org,
Jakub Kicinski <kuba@...nel.org>,
Stephen Hemminger <stephen@...workplumber.org>, kuba+netdrv@...nel.org,
dsahern@...il.com, Florian Westphal <fw@...len.de>,
Pablo Neira Ayuso <pablo@...filter.org>,
Wangyang Guo <wangyang.guo@...el.com>,
Arjan Van De Ven <arjan.van.de.ven@...el.com>,
Thomas Gleixner <tglx@...utronix.de>,
Linux Regressions <regressions@...ts.linux.dev>
Subject: Re: Urgent Bug Report Kernel crash 6.5.2
On 22.09.23 05:06, Bagas Sanjaya wrote:
> On Thu, Sep 21, 2023 at 11:13:55AM +0300, Martin Zaharinov wrote:
>>
>> Its not easy to make this on production, have too many users on it.
>>
>> i make checks and find with kernel 6.3.12-6.5.13 all is fine.
>> on first machine that i have with kernel 6.4 and still work run kernel 6.4.2 and have problem.
This is confusing and hard to follow. You want to describe more
carefully which kernels worked (avoid ranges, as I doubt you have tested
everything between 6.3.12-6.5.13) and try to avoid complexity (you seem
to have two machines? if everything works on one, don't even bring it up
except maybe as a side note)
>> in my investigation problem is start after migration to kernel 6.4.x
>>
>> in 6.4 kernel is add rcuref :
>>
>> https://cdn.kernel.org/pub/linux/kernel/v6.x/ChangeLog-6.4
>>
>> commit bc9d3a9f2afca189a6ae40225b6985e3c775375e
>> Author: Thomas Gleixner <tglx@...utronix.de>
>> Date: Thu Mar 23 21:55:32 2023 +0100
>>
>> net: dst: Switch to rcuref_t reference counting
>
> Is it the culprit you look for? Had you done the bisection and it points
> the culprit to that commit
Martin, if you suspect this to be the culprit try to revert it on top of
the latest kernel; if the problem then goes away it likely is the cause.
> [...]
>> but this is my thinking
>
> What do you think that above causes your regression?
>
> Confused...
>
> [To Thorsten: I'm unsure if the reporter do the bisection and suddenly he found
> the culprit commit. Should I add it to regzbot?
For now: no, things are too confusing and without knowing the culprit I
guess nobody will look into this unless we are extremely lucky.
Ciao, Thorsten
Powered by blists - more mailing lists