[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAHk-=whvhkSk6m8_AidhofgR9nq0Md+HbNad5r1RE69tZgbv6Q@mail.gmail.com>
Date: Mon, 22 May 2023 11:23:53 -0700
From: Linus Torvalds <torvalds@...ux-foundation.org>
To: Linux regressions mailing list <regressions@...ts.linux.dev>
Cc: Benjamin Tissoires <benjamin.tissoires@...hat.com>,
Bagas Sanjaya <bagasdotme@...il.com>,
Filipe Laíns <lains@...eup.net>,
Bastien Nocera <hadess@...ess.net>,
Jiri Kosina <jikos@...nel.org>,
"open list:HID CORE LAYER" <linux-input@...r.kernel.org>,
LKML <linux-kernel@...r.kernel.org>, guy.b@...ewin.ch
Subject: Re: [regression] Since kernel 6.3.1 logitech unify receiver not
working properly
On Mon, May 22, 2023 at 5:38 AM Linux regression tracking (Thorsten
Leemhuis) <regressions@...mhuis.info> wrote:
>
> FWIW, in case anybody is interested in a status update: one reporter
> bisected the problem down to 586e8fede79 ("HID: logitech-hidpp: Retry
> commands when device is busy"); reverting that commit on-top of 6.3
> fixes the problem for that reporter. For that reporter things also work
> on 6.4-rc; but for someone else that is affected that's not the case.
Hmm. It's likely timing-dependent.
But that code is clearly buggy.
If the wait_event_timeout() returns early, the device hasn't replied,
but the code does
if (!wait_event_timeout(hidpp->wait, hidpp->answer_available,
5*HZ)) {
dbg_hid("%s:timeout waiting for response\n", __func__);
memset(response, 0, sizeof(struct hidpp_report));
ret = -ETIMEDOUT;
}
and then continues to look at the response _anyway_.
Now, depending on out hardening options, that response may have been
initialized by the compiler, or may just be random stack contents.
That bug is pre-existing (ie the problem was not introduced by that
commit), but who knows if the retry makes things worse (ie if it then
triggers on a retry, the response data will be the *previous*
response).
The whole "goto exit" games should be removed too, because we're in a
for-loop, and instead of "goto exit" it should just do "break".
IOW, something like this might be worth testing.
That said, while I think the code is buggy, I doubt this is the actual
cause of the problem people are reporting. But it would be lovely to
hear if the attached patch makes any difference, and I think this is
fixing a real - but unlikely - problem anyway.
And obviously it might be helpful to actually enable those dbg_hid()
messages, but I didn't look at what the magic config option to do so
was.
NOTE! Patch below *ENTIRELY* untested. I just looked at the code when
that commit was mentioned, and went "that's not right"...
Linus
View attachment "patch.diff" of type "text/x-patch" (1524 bytes)
Powered by blists - more mailing lists