[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <CAEf4BzYKArxcAFSLuOSeMNsi0M9y1LuDPmQhTEhGvqkRg4BzSQ@mail.gmail.com>
Date: Fri, 29 Mar 2024 09:27:48 -0700
From: Andrii Nakryiko <andrii.nakryiko@...il.com>
To: Sandipan Das <sandipan.das@....com>
Cc: Andrii Nakryiko <andrii@...nel.org>, bpf@...r.kernel.org, linux-kernel@...r.kernel.org,
jolsa@...nel.org, song@...nel.org, kernel-team@...a.com, x86@...nel.org,
peterz@...radead.org, mingo@...hat.com, tglx@...utronix.de
Subject: Re: [PATCH v2] perf/x86/amd: support capturing LBR from software events
On Fri, Mar 29, 2024 at 2:29 AM Sandipan Das <sandipan.das@....com> wrote:
>
> On 3/28/2024 7:03 PM, Andrii Nakryiko wrote:
> > [0] added ability to capture LBR (Last Branch Records) on Intel CPUs
> > from inside BPF program at pretty much any arbitrary point. This is
> > extremely useful capability that allows to figure out otherwise
> > hard-to-debug problems, because LBR is now available based on some
> > application-defined conditions, not just hardware-supported events.
> >
> > retsnoop ([1]) is one such tool that takes a huge advantage of this
> > functionality and has proved to be an extremely useful tool in
> > practice.
> >
> > Now, AMD Zen4 CPUs got support for similar LBR functionality, but
> > necessary wiring inside the kernel is not yet setup. This patch seeks to
> > rectify this and follows a similar approach to the original patch [0]
> > for Intel CPUs.
> >
> > Given LBR can be set up to capture any indirect jumps, it's critical to
> > minimize indirect jumps on the way to requesting LBR from BPF program,
> > so we split amd_pmu_lbr_disable_all() into a wrapper with some generic
> > conditions vs always-inlined __amd_pmu_lbr_disable() called directly
> > from BPF subsystem (through perf_snapshot_branch_stack static call).
> >
> > Now that it's possible to capture LBR on AMD CPU from BPF at arbitrary
> > point, there is no reason to artificially limit this feature to sampling
> > events. So corresponding check is removed. AFAIU, there is no
> > correctness implications of doing this (and it was possible to bypass
> > this check by just setting perf_event's sample_period to 1 anyways, so
> > it doesn't guard all that much).
> >
> > This was tested on AMD Bergamo CPU and worked well when utilized from
> > the aforementioned retsnoop tool.
> >
> > [0] https://lore.kernel.org/bpf/20210910183352.3151445-2-songliubraving@fb.com/
> > [1] https://github.com/anakryiko/retsnoop
> >
> > Signed-off-by: Andrii Nakryiko <andrii@...nel.org>
>
> Changes look good to me.
>
> Reviewed-by: Sandipan Das <sandipan.das@....com>
Thanks for your review! Will split out the check removal, as Ingo
asked, and will post v3, thanks!
>
> > ---
> > arch/x86/events/amd/core.c | 29 ++++++++++++++++++++++++++++-
> > arch/x86/events/amd/lbr.c | 11 +----------
> > arch/x86/events/perf_event.h | 11 +++++++++++
> > 3 files changed, 40 insertions(+), 11 deletions(-)
> >
[...]
Powered by blists - more mailing lists