[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Zn7umr/VHtZ/Nhcu@visitorckw-System-Product-Name>
Date: Sat, 29 Jun 2024 01:10:50 +0800
From: Kuan-Wei Chiu <visitorckw@...il.com>
To: Linux regressions mailing list <regressions@...ts.linux.dev>
Cc: akpm@...ux-foundation.org, jserv@...s.ncku.edu.tw,
linux-kernel@...r.kernel.org, lkml@....org, alexdeucher@...il.com,
Julian Sikorski <belegdol@...il.com>
Subject: Re: [PATCH 2/2] lib/sort: Optimize heapsort with double-pop variation
On Fri, Jun 28, 2024 at 05:15:15PM +0200, Linux regression tracking (Thorsten Leemhuis) wrote:
> On 20.06.24 17:36, Julian Sikorski wrote:
> >
> > it appears that this patch has caused suspend-to-idle regression:
> >
> > https://gitlab.freedesktop.org/drm/amd/-/issues/3436
> >
> > In brief, my laptop fails to suspend completely with the following error
> > in the log:
> >
> > Jun 18 12:42:20 kernel: amd_pmc AMDI0005:00: Last suspend didn't reach
> > deepest state
> >
> > Power consumption remains high enough that my battery has already
> > unexpectedly drained twice before I noticed something was off.
> > I am not entirely sure how changes to sorting function can influence
> > suspend, but it is what it is. 6.9.5 as shipped by Fedora 40 exhibits
> > the issue, 6.9.5 as shipped by Fedora with the patch reverted does not.
>
> Andrew, could you maybe help out here or point us in the direction of
> somewhat that might be able to help? I'm asking, as from a quick search
> on lore it seems Kuan-Wei Chiu has not posted anything since nearly four
> weeks and thus also did not reply to Julian's regression report.
>
> Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
> --
> Everything you wanna know about Linux kernel regression tracking:
> https://linux-regtracking.leemhuis.info/about/#tldr
> If I did something stupid, please tell me, as explained on that page.
>
> #regzbot poke
Sorry for the late reply.
I apologize for the regression caused by the patch. I am not familiar
with the power management domain. I would guess there might be some
side effects in the compare or swap functions that I was not aware of.
While reviewing the sort calls that could potentially cause the error,
I noticed that the compare function used in the sort call within
drivers/acpi/processor_idle.c might not satisfy the transitive
relation. Although I'm not sure if this is the root cause of the
problem, specifically, if there are two valid acpi_processor_cx
elements A and B, and an invalid element C, there could be a scenario
where A < B but simultaneously A = C and B = C. However, if I
understand correctly, this issue should have existed before this patch
but might not have been triggered previously. My patch might have
exposed this issue by changing the order of comparisons and swaps.
Regards,
Kuan-Wei
Powered by blists - more mailing lists