[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20211015172132.1162559-1-irogers@google.com>
Date: Fri, 15 Oct 2021 10:21:11 -0700
From: Ian Rogers <irogers@...gle.com>
To: Andi Kleen <ak@...ux.intel.com>, Jiri Olsa <jolsa@...hat.com>,
Jin Yao <yao.jin@...ux.intel.com>,
Namhyung Kim <namhyung@...nel.org>,
John Garry <john.garry@...wei.com>,
Kajol Jain <kjain@...ux.ibm.com>,
"Paul A . Clarke" <pc@...ibm.com>,
Arnaldo Carvalho de Melo <acme@...nel.org>,
Riccardo Mancini <rickyman7@...il.com>,
Kan Liang <kan.liang@...ux.intel.com>,
Peter Zijlstra <peterz@...radead.org>,
Ingo Molnar <mingo@...hat.com>,
Mark Rutland <mark.rutland@....com>,
Alexander Shishkin <alexander.shishkin@...ux.intel.com>,
Kees Cook <keescook@...omium.org>,
Sami Tolvanen <samitolvanen@...gle.com>,
Nick Desaulniers <ndesaulniers@...gle.com>,
Andrew Morton <akpm@...ux-foundation.org>,
Jacob Keller <jacob.e.keller@...el.com>,
Zhen Lei <thunder.leizhen@...wei.com>,
ToastC <mrtoastcheng@...il.com>,
Joakim Zhang <qiangqing.zhang@....com>,
Felix Fietkau <nbd@....name>,
Jiapeng Chong <jiapeng.chong@...ux.alibaba.com>,
Song Liu <songliubraving@...com>, Fabian Hemmer <copy@...y.sh>,
Alexander Antonov <alexander.antonov@...ux.intel.com>,
Nicholas Fraser <nfraser@...eweavers.com>,
Adrian Hunter <adrian.hunter@...el.com>,
Denys Zagorui <dzagorui@...co.com>,
Wan Jiabing <wanjiabing@...o.com>,
Thomas Richter <tmricht@...ux.ibm.com>,
Sumanth Korikkar <sumanthk@...ux.ibm.com>,
Heiko Carstens <hca@...ux.ibm.com>,
Changbin Du <changbin.du@...el.com>,
linux-kernel@...r.kernel.org, linux-perf-users@...r.kernel.org,
Andrew Kilroy <andrew.kilroy@....com>
Cc: Stephane Eranian <eranian@...gle.com>,
Ian Rogers <irogers@...gle.com>
Subject: [PATCH v2 00/21] perf metric: Fixes and allow modifiers
There are 4 main changes in this patch set:
- perf metric: Modify resolution and recursion check.
- perf parse-events: Add new "metric-id" term.
- perf metrics: Modify setup and deduplication
- perf metric: Allow modifiers on metrics.
In overview the changes start by trying to simplify the metric code,
then it fixes various bugs and finally it builds a new feature of
allowing metrics like:
$ perf stat -M IPC:u,IPC:k -a sleep 1
Performance counter stats for 'system wide':
93,269,988 inst_retired.any:k # 0.26 IPC:k
352,037,460 cpu_clk_unhalted.thread:k
70,317,865 inst_retired.any:u # 0.76 IPC:u
92,762,220 cpu_clk_unhalted.thread:u
1.003754577 seconds time elapsed
Previous complexity came from using the evsel->name as the identifier
for events in metrics, however, this name isn't stable and has issues
around wildcard expansion. These changes fix this by adding a
dedicated metric_id to evsels, performing deduplication on IDs before
event parsing and not handling all evsels on a single evlist.
The recursion and metric_ref logic is simplified, the first by moving
data from the heap to the stack, the latter by building in an array
rather than a linked list. This logic is integral to metric set up and
simplification makes the effects of the changes easier to follow, in
particular as there are fewer structs being maintained.
Event parsing is modified to allow qualifiers on kernel PMU events,
this is necessary to allow the metric-id to be added, but allows
qualifiers in other cases like specifying callgraph or a name.
There is a certain amount of comment adding and const-ification, this
is with a view to making the code more intention revealing and to aid
following its logic. For example, the pmu event tables should never
change and it'd be a bug if they ever did, it's therefore strange to
access it using non-const pointers.
The kernel list_sort.c/h are added for use sorting metrics in order to
deduplicate/reuse events from a larger group in a smaller one. This
was previously done by inserting in size order, but that only worked
within a metric group.
Some of the commit messages show the TopDownL1 metrics being used on a
SkylakeX machine. These metrics were removed by
c4ad8fabd03f76ed3a2a4c8aef6baf6cd4f24542 ("perf vendor events: Update
metrics for SkyLake Server") and the data was gathered with this patch
reverted.
v2. Add METADATA fixes to 'Add list_sort' suggested by
Arnaldo Carvalho de Melo <acme@...nel.org>.
Fix arm64 function declaration in 'Add const to pmu_events_map' found by
Andrew Kilroy <andrew.kilroy@....com>.
Add a static in 'Make pmu_event tables const' suggested by
John Garry <john.garry@...wei.com>.
Add acked-by: Andi Kleen <ak@...ux.intel.com>.
Fix/handle 0 events case in 'Modify setup and deduplication'.
Fix a missed strdup that resulted in a double free in 'Allow modifiers on
metrics.'
Ian Rogers (21):
tools lib: Add list_sort.
perf pmu: Add const to pmu_events_map.
perf pmu: Make pmu_sys_event_tables const.
perf pmu: Make pmu_event tables const.
perf metric: Move runtime value to the expr context
perf metric: Add documentation and rename a variable.
perf metric: Add metric new and free
perf metric: Only add a referenced metric once
perf metric: Modify resolution and recursion check.
perf metric: Comment data structures.
perf metric: Document the internal 'struct metric'
perf metric: Simplify metric_refs calculation.
perf parse-events: Add const to evsel name
perf parse-events: Add new "metric-id" term.
perf parse-events: Allow config on kernel PMU events
perf metric: Encode and use metric-id as qualifier
perf expr: Add subset utility.
perf metrics: Modify setup and deduplication
perf metric: Switch fprintf to pr_err.
perf parse-events: Identify broken modifiers.
perf metric: Allow modifiers on metrics.
tools/include/linux/list_sort.h | 14 +
tools/lib/list_sort.c | 252 +++++
tools/perf/MANIFEST | 1 +
tools/perf/arch/arm64/util/pmu.c | 2 +-
tools/perf/arch/powerpc/util/header.c | 2 +-
tools/perf/check-headers.sh | 2 +
tools/perf/pmu-events/jevents.c | 6 +-
tools/perf/pmu-events/pmu-events.h | 8 +-
tools/perf/tests/expand-cgroup.c | 2 +-
tools/perf/tests/expr.c | 29 +-
tools/perf/tests/parse-metric.c | 2 +-
tools/perf/tests/pmu-events.c | 59 +-
tools/perf/util/Build | 5 +
tools/perf/util/evsel.c | 17 +
tools/perf/util/evsel.h | 2 +
tools/perf/util/expr.c | 56 +-
tools/perf/util/expr.h | 16 +-
tools/perf/util/expr.l | 6 +-
tools/perf/util/expr.y | 2 +-
tools/perf/util/metricgroup.c | 1461 ++++++++++++++-----------
tools/perf/util/metricgroup.h | 35 +-
tools/perf/util/parse-events-hybrid.c | 34 +-
tools/perf/util/parse-events-hybrid.h | 6 +-
tools/perf/util/parse-events.c | 166 +--
tools/perf/util/parse-events.h | 11 +-
tools/perf/util/parse-events.l | 18 +-
tools/perf/util/parse-events.y | 27 +-
tools/perf/util/pfm.c | 3 +-
tools/perf/util/pmu.c | 22 +-
tools/perf/util/pmu.h | 10 +-
tools/perf/util/python-ext-sources | 1 +
tools/perf/util/s390-sample-raw.c | 6 +-
tools/perf/util/stat-shadow.c | 27 +-
33 files changed, 1470 insertions(+), 840 deletions(-)
create mode 100644 tools/include/linux/list_sort.h
create mode 100644 tools/lib/list_sort.c
--
2.33.0.1079.g6e70778dc9-goog
Powered by blists - more mailing lists