[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20250812075656.GE4067720@noisy.programming.kicks-ass.net>
Date: Tue, 12 Aug 2025 09:56:56 +0200
From: Peter Zijlstra <peterz@...radead.org>
To: Yunseong Kim <ysk@...lloc.com>
Cc: Ingo Molnar <mingo@...hat.com>,
Arnaldo Carvalho de Melo <acme@...nel.org>,
Namhyung Kim <namhyung@...nel.org>,
Mark Rutland <mark.rutland@....com>,
Alexander Shishkin <alexander.shishkin@...ux.intel.com>,
Jiri Olsa <jolsa@...nel.org>, Ian Rogers <irogers@...gle.com>,
Adrian Hunter <adrian.hunter@...el.com>,
Liang Kan <kan.liang@...ux.intel.com>,
Will Deacon <will@...nel.org>, Yeoreum Yun <yeoreum.yun@....com>,
Austin Kim <austindh.kim@...il.com>,
linux-perf-users@...r.kernel.org, linux-kernel@...r.kernel.org,
syzkaller@...glegroups.com
Subject: Re: [PATCH v2] perf: Avoid undefined behavior from stopping/starting
inactive events
On Tue, Aug 12, 2025 at 01:27:22AM +0000, Yunseong Kim wrote:
> Calling pmu->start()/stop() on events in PERF_EVENT_STATE_OFF can leave
> event->hw.idx at -1, which may lead to UBSAN shift-out-of-bounds reports
> when the PMU code later shifts by a negative exponent.
Yeah, but how do we get there? I suppose there is a race somewhere?
Please describe.
> Move the state check into perf_event_throttle()/perf_event_unthrottle() so
> that inactive events are skipped entirely. This ensures only active events
> with a valid hw.idx are processed, preventing undefined behavior and
> silencing UBSAN warnings.
> The problem can be reproduced with the syzkaller reproducer:
> Link: https://lore.kernel.org/lkml/714b7ba2-693e-42e4-bce4-feef2a5e7613@kzalloc.com/
>
> Fixes: 9734e25fbf5a ("perf: Fix the throttle logic for a group")
> Cc: Mark Rutland <mark.rutland@....com>
> Signed-off-by: Yunseong Kim <ysk@...lloc.com>
> ---
> kernel/events/core.c | 6 ++++++
> 1 file changed, 6 insertions(+)
>
> diff --git a/kernel/events/core.c b/kernel/events/core.c
> index 8060c2857bb2..c9322029a8ae 100644
> --- a/kernel/events/core.c
> +++ b/kernel/events/core.c
> @@ -2665,6 +2665,9 @@ static void perf_log_itrace_start(struct perf_event *event);
>
> static void perf_event_unthrottle(struct perf_event *event, bool start)
> {
> + if (event->state <= PERF_EVENT_STATE_OFF)
> + return;
This seems wrong. We should only {,un}throttle ACTIVE events, no?
> event->hw.interrupts = 0;
> if (start)
> event->pmu->start(event, 0);
> @@ -2674,6 +2677,9 @@ static void perf_event_unthrottle(struct perf_event *event, bool start)
>
> static void perf_event_throttle(struct perf_event *event)
> {
> + if (event->state <= PERF_EVENT_STATE_OFF)
> + return;
> +
> event->hw.interrupts = MAX_INTERRUPTS;
> event->pmu->stop(event, 0);
> if (event == event->group_leader)
> --
> 2.50.0
>
Powered by blists - more mailing lists