[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <e05d01dd-dd6f-4bb6-857f-3458b9b7159d@linux.intel.com>
Date: Thu, 26 Oct 2023 14:16:34 -0400
From: "Liang, Kan" <kan.liang@...ux.intel.com>
To: Ian Rogers <irogers@...gle.com>,
Peter Zijlstra <peterz@...radead.org>,
Ingo Molnar <mingo@...hat.com>,
Arnaldo Carvalho de Melo <acme@...nel.org>,
Mark Rutland <mark.rutland@....com>,
Alexander Shishkin <alexander.shishkin@...ux.intel.com>,
Jiri Olsa <jolsa@...nel.org>,
Namhyung Kim <namhyung@...nel.org>,
Adrian Hunter <adrian.hunter@...el.com>,
Maxime Coquelin <mcoquelin.stm32@...il.com>,
Alexandre Torgue <alexandre.torgue@...s.st.com>,
linux-kernel@...r.kernel.org, linux-perf-users@...r.kernel.org,
Edward Baker <edward.baker@...el.com>
Subject: Re: [PATCH v1 1/9] perf vendor events intel: Update
alderlake/alderlake events to v1.23
On 2023-10-25 8:31 p.m., Ian Rogers wrote:
> Update alderlake and alderlaken events from v1.21 to v1.23 adding the
> changes from:
> https://github.com/intel/perfmon/commit/8df4db9433a2aab59dbbac1a70281032d1af7734
> https://github.com/intel/perfmon/commit/846bd247c6e04acc572ca56c992e9e65852bbe63
>
> The tsx_cycles_per_elision metric is updated from PR:
> https://github.com/intel/perfmon/pull/116
>
> Signed-off-by: Ian Rogers <irogers@...gle.com>
Thanks Ian. The whole patch series looks good to me.
Reviewed-by: Kan Liang <kan.liang@...ux.intel.com>
Thanks,
Kan
> ---
> .../arch/x86/alderlake/adl-metrics.json | 2 +-
> .../arch/x86/alderlake/frontend.json | 42 ++++++++++--
> .../pmu-events/arch/x86/alderlake/memory.json | 4 +-
> .../arch/x86/alderlake/pipeline.json | 20 +++++-
> .../x86/alderlake/uncore-interconnect.json | 2 +
> .../arch/x86/alderlaken/memory.json | 4 +-
> .../arch/x86/alderlaken/pipeline.json | 16 +++++
> .../x86/alderlaken/uncore-interconnect.json | 66 +++++++++++++++++++
> tools/perf/pmu-events/arch/x86/mapfile.csv | 4 +-
> 9 files changed, 146 insertions(+), 14 deletions(-)
>
> diff --git a/tools/perf/pmu-events/arch/x86/alderlake/adl-metrics.json b/tools/perf/pmu-events/arch/x86/alderlake/adl-metrics.json
> index 8b6bed3bc766..3388b58b8f1a 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlake/adl-metrics.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlake/adl-metrics.json
> @@ -99,7 +99,7 @@
> },
> {
> "BriefDescription": "Number of cycles within a transaction divided by the number of elisions.",
> - "MetricExpr": "(cycles\\-t / el\\-start if has_event(cycles\\-t) else 0)",
> + "MetricExpr": "(cycles\\-t / el\\-start if has_event(el\\-start) else 0)",
> "MetricGroup": "transaction",
> "MetricName": "tsx_cycles_per_elision",
> "ScaleUnit": "1cycles / elision"
> diff --git a/tools/perf/pmu-events/arch/x86/alderlake/frontend.json b/tools/perf/pmu-events/arch/x86/alderlake/frontend.json
> index 81349100fe32..542ba4a81996 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlake/frontend.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlake/frontend.json
> @@ -394,31 +394,61 @@
> "Unit": "cpu_core"
> },
> {
> - "BriefDescription": "Uops not delivered by IDQ when backend of the machine is not stalled",
> + "BriefDescription": "Uops not delivered by IDQ when backend of the machine is not stalled [This event is alias to IDQ_UOPS_NOT_DELIVERED.CORE]",
> + "EventCode": "0x9c",
> + "EventName": "IDQ_BUBBLES.CORE",
> + "PublicDescription": "Counts the number of uops not delivered to by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle. [This event is alias to IDQ_UOPS_NOT_DELIVERED.CORE]",
> + "SampleAfterValue": "1000003",
> + "UMask": "0x1",
> + "Unit": "cpu_core"
> + },
> + {
> + "BriefDescription": "Cycles when no uops are not delivered by the IDQ when backend of the machine is not stalled [This event is alias to IDQ_UOPS_NOT_DELIVERED.CYCLES_0_UOPS_DELIV.CORE]",
> + "CounterMask": "6",
> + "EventCode": "0x9c",
> + "EventName": "IDQ_BUBBLES.CYCLES_0_UOPS_DELIV.CORE",
> + "PublicDescription": "Counts the number of cycles when no uops were delivered by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle. [This event is alias to IDQ_UOPS_NOT_DELIVERED.CYCLES_0_UOPS_DELIV.CORE]",
> + "SampleAfterValue": "1000003",
> + "UMask": "0x1",
> + "Unit": "cpu_core"
> + },
> + {
> + "BriefDescription": "Cycles when optimal number of uops was delivered to the back-end when the back-end is not stalled [This event is alias to IDQ_UOPS_NOT_DELIVERED.CYCLES_FE_WAS_OK]",
> + "CounterMask": "1",
> + "EventCode": "0x9c",
> + "EventName": "IDQ_BUBBLES.CYCLES_FE_WAS_OK",
> + "Invert": "1",
> + "PublicDescription": "Counts the number of cycles when the optimal number of uops were delivered by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle. [This event is alias to IDQ_UOPS_NOT_DELIVERED.CYCLES_FE_WAS_OK]",
> + "SampleAfterValue": "1000003",
> + "UMask": "0x1",
> + "Unit": "cpu_core"
> + },
> + {
> + "BriefDescription": "Uops not delivered by IDQ when backend of the machine is not stalled [This event is alias to IDQ_BUBBLES.CORE]",
> "EventCode": "0x9c",
> "EventName": "IDQ_UOPS_NOT_DELIVERED.CORE",
> - "PublicDescription": "Counts the number of uops not delivered to by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle.",
> + "PublicDescription": "Counts the number of uops not delivered to by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle. [This event is alias to IDQ_BUBBLES.CORE]",
> "SampleAfterValue": "1000003",
> "UMask": "0x1",
> "Unit": "cpu_core"
> },
> {
> - "BriefDescription": "Cycles when no uops are not delivered by the IDQ when backend of the machine is not stalled",
> + "BriefDescription": "Cycles when no uops are not delivered by the IDQ when backend of the machine is not stalled [This event is alias to IDQ_BUBBLES.CYCLES_0_UOPS_DELIV.CORE]",
> "CounterMask": "6",
> "EventCode": "0x9c",
> "EventName": "IDQ_UOPS_NOT_DELIVERED.CYCLES_0_UOPS_DELIV.CORE",
> - "PublicDescription": "Counts the number of cycles when no uops were delivered by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle.",
> + "PublicDescription": "Counts the number of cycles when no uops were delivered by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle. [This event is alias to IDQ_BUBBLES.CYCLES_0_UOPS_DELIV.CORE]",
> "SampleAfterValue": "1000003",
> "UMask": "0x1",
> "Unit": "cpu_core"
> },
> {
> - "BriefDescription": "Cycles when optimal number of uops was delivered to the back-end when the back-end is not stalled",
> + "BriefDescription": "Cycles when optimal number of uops was delivered to the back-end when the back-end is not stalled [This event is alias to IDQ_BUBBLES.CYCLES_FE_WAS_OK]",
> "CounterMask": "1",
> "EventCode": "0x9c",
> "EventName": "IDQ_UOPS_NOT_DELIVERED.CYCLES_FE_WAS_OK",
> "Invert": "1",
> - "PublicDescription": "Counts the number of cycles when the optimal number of uops were delivered by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle.",
> + "PublicDescription": "Counts the number of cycles when the optimal number of uops were delivered by the Instruction Decode Queue (IDQ) to the back-end of the pipeline when there was no back-end stalls. This event counts for one SMT thread in a given cycle. [This event is alias to IDQ_BUBBLES.CYCLES_FE_WAS_OK]",
> "SampleAfterValue": "1000003",
> "UMask": "0x1",
> "Unit": "cpu_core"
> diff --git a/tools/perf/pmu-events/arch/x86/alderlake/memory.json b/tools/perf/pmu-events/arch/x86/alderlake/memory.json
> index 73d92d5c9f9d..23d36164433f 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlake/memory.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlake/memory.json
> @@ -248,7 +248,7 @@
> "Unit": "cpu_core"
> },
> {
> - "BriefDescription": "Counts demand data reads that were not supplied by the L3 cache.",
> + "BriefDescription": "Counts demand data reads that were not supplied by the L3 cache. [L3_MISS_LOCAL is alias to L3_MISS]",
> "EventCode": "0xB7",
> "EventName": "OCR.DEMAND_DATA_RD.L3_MISS_LOCAL",
> "MSRIndex": "0x1a6,0x1a7",
> @@ -278,7 +278,7 @@
> "Unit": "cpu_core"
> },
> {
> - "BriefDescription": "Counts demand reads for ownership (RFO) and software prefetches for exclusive ownership (PREFETCHW) that were not supplied by the L3 cache.",
> + "BriefDescription": "Counts demand reads for ownership (RFO) and software prefetches for exclusive ownership (PREFETCHW) that were not supplied by the L3 cache. [L3_MISS_LOCAL is alias to L3_MISS]",
> "EventCode": "0xB7",
> "EventName": "OCR.DEMAND_RFO.L3_MISS_LOCAL",
> "MSRIndex": "0x1a6,0x1a7",
> diff --git a/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json b/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json
> index a92013cdf136..f9876bef16da 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json
> @@ -238,6 +238,15 @@
> "UMask": "0x8",
> "Unit": "cpu_core"
> },
> + {
> + "BriefDescription": "Counts the number of near taken branch instructions retired.",
> + "EventCode": "0xc4",
> + "EventName": "BR_INST_RETIRED.NEAR_TAKEN",
> + "PEBS": "1",
> + "SampleAfterValue": "200003",
> + "UMask": "0xc0",
> + "Unit": "cpu_atom"
> + },
> {
> "BriefDescription": "Taken branch instructions retired.",
> "EventCode": "0xc4",
> @@ -411,6 +420,15 @@
> "UMask": "0x7e",
> "Unit": "cpu_atom"
> },
> + {
> + "BriefDescription": "Counts the number of mispredicted near taken branch instructions retired.",
> + "EventCode": "0xc5",
> + "EventName": "BR_MISP_RETIRED.NEAR_TAKEN",
> + "PEBS": "1",
> + "SampleAfterValue": "200003",
> + "UMask": "0x80",
> + "Unit": "cpu_atom"
> + },
> {
> "BriefDescription": "Number of near branch instructions retired that were mispredicted and taken.",
> "EventCode": "0xc5",
> @@ -842,7 +860,7 @@
> "Unit": "cpu_core"
> },
> {
> - "BriefDescription": "INT_MISC.UNKNOWN_BRANCH_CYCLES",
> + "BriefDescription": "Bubble cycles of BAClear (Unknown Branch).",
> "EventCode": "0xad",
> "EventName": "INT_MISC.UNKNOWN_BRANCH_CYCLES",
> "MSRIndex": "0x3F7",
> diff --git a/tools/perf/pmu-events/arch/x86/alderlake/uncore-interconnect.json b/tools/perf/pmu-events/arch/x86/alderlake/uncore-interconnect.json
> index 34fc052d00e4..8bf020a9dfa8 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlake/uncore-interconnect.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlake/uncore-interconnect.json
> @@ -25,6 +25,7 @@
> },
> {
> "BriefDescription": "This event is deprecated. Refer to new event UNC_ARB_REQ_TRK_REQUEST.DRD",
> + "Deprecated": "1",
> "EventCode": "0x81",
> "EventName": "UNC_ARB_DAT_REQUESTS.RD",
> "PerPkg": "1",
> @@ -33,6 +34,7 @@
> },
> {
> "BriefDescription": "This event is deprecated. Refer to new event UNC_ARB_DAT_OCCUPANCY.ALL",
> + "Deprecated": "1",
> "EventCode": "0x85",
> "EventName": "UNC_ARB_IFA_OCCUPANCY.ALL",
> "PerPkg": "1",
> diff --git a/tools/perf/pmu-events/arch/x86/alderlaken/memory.json b/tools/perf/pmu-events/arch/x86/alderlaken/memory.json
> index 37259d38a222..863a3ba2b4b2 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlaken/memory.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlaken/memory.json
> @@ -59,7 +59,7 @@
> "UMask": "0x1"
> },
> {
> - "BriefDescription": "Counts demand data reads that were not supplied by the L3 cache.",
> + "BriefDescription": "Counts demand data reads that were not supplied by the L3 cache. [L3_MISS_LOCAL is alias to L3_MISS]",
> "EventCode": "0xB7",
> "EventName": "OCR.DEMAND_DATA_RD.L3_MISS_LOCAL",
> "MSRIndex": "0x1a6,0x1a7",
> @@ -77,7 +77,7 @@
> "UMask": "0x1"
> },
> {
> - "BriefDescription": "Counts demand reads for ownership (RFO) and software prefetches for exclusive ownership (PREFETCHW) that were not supplied by the L3 cache.",
> + "BriefDescription": "Counts demand reads for ownership (RFO) and software prefetches for exclusive ownership (PREFETCHW) that were not supplied by the L3 cache. [L3_MISS_LOCAL is alias to L3_MISS]",
> "EventCode": "0xB7",
> "EventName": "OCR.DEMAND_RFO.L3_MISS_LOCAL",
> "MSRIndex": "0x1a6,0x1a7",
> diff --git a/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json b/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json
> index fa53ff11a509..3153bab527a9 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json
> @@ -90,6 +90,14 @@
> "SampleAfterValue": "200003",
> "UMask": "0xf7"
> },
> + {
> + "BriefDescription": "Counts the number of near taken branch instructions retired.",
> + "EventCode": "0xc4",
> + "EventName": "BR_INST_RETIRED.NEAR_TAKEN",
> + "PEBS": "1",
> + "SampleAfterValue": "200003",
> + "UMask": "0xc0"
> + },
> {
> "BriefDescription": "This event is deprecated. Refer to new event BR_INST_RETIRED.INDIRECT",
> "Deprecated": "1",
> @@ -183,6 +191,14 @@
> "SampleAfterValue": "200003",
> "UMask": "0x7e"
> },
> + {
> + "BriefDescription": "Counts the number of mispredicted near taken branch instructions retired.",
> + "EventCode": "0xc5",
> + "EventName": "BR_MISP_RETIRED.NEAR_TAKEN",
> + "PEBS": "1",
> + "SampleAfterValue": "200003",
> + "UMask": "0x80"
> + },
> {
> "BriefDescription": "This event is deprecated. Refer to new event BR_MISP_RETIRED.INDIRECT",
> "Deprecated": "1",
> diff --git a/tools/perf/pmu-events/arch/x86/alderlaken/uncore-interconnect.json b/tools/perf/pmu-events/arch/x86/alderlaken/uncore-interconnect.json
> index 4af695a5e755..8bf020a9dfa8 100644
> --- a/tools/perf/pmu-events/arch/x86/alderlaken/uncore-interconnect.json
> +++ b/tools/perf/pmu-events/arch/x86/alderlaken/uncore-interconnect.json
> @@ -7,6 +7,56 @@
> "UMask": "0x1",
> "Unit": "ARB"
> },
> + {
> + "BriefDescription": "Each cycle counts number of any coherent request at memory controller that were issued by any core.",
> + "EventCode": "0x85",
> + "EventName": "UNC_ARB_DAT_OCCUPANCY.ALL",
> + "PerPkg": "1",
> + "UMask": "0x1",
> + "Unit": "ARB"
> + },
> + {
> + "BriefDescription": "Each cycle counts number of coherent reads pending on data return from memory controller that were issued by any core.",
> + "EventCode": "0x85",
> + "EventName": "UNC_ARB_DAT_OCCUPANCY.RD",
> + "PerPkg": "1",
> + "UMask": "0x2",
> + "Unit": "ARB"
> + },
> + {
> + "BriefDescription": "This event is deprecated. Refer to new event UNC_ARB_REQ_TRK_REQUEST.DRD",
> + "Deprecated": "1",
> + "EventCode": "0x81",
> + "EventName": "UNC_ARB_DAT_REQUESTS.RD",
> + "PerPkg": "1",
> + "UMask": "0x2",
> + "Unit": "ARB"
> + },
> + {
> + "BriefDescription": "This event is deprecated. Refer to new event UNC_ARB_DAT_OCCUPANCY.ALL",
> + "Deprecated": "1",
> + "EventCode": "0x85",
> + "EventName": "UNC_ARB_IFA_OCCUPANCY.ALL",
> + "PerPkg": "1",
> + "UMask": "0x1",
> + "Unit": "ARB"
> + },
> + {
> + "BriefDescription": "Each cycle count number of 'valid' coherent Data Read entries . Such entry is defined as valid when it is allocated till deallocation. Doesn't include prefetches [This event is alias to UNC_ARB_TRK_OCCUPANCY.RD]",
> + "EventCode": "0x80",
> + "EventName": "UNC_ARB_REQ_TRK_OCCUPANCY.DRD",
> + "PerPkg": "1",
> + "UMask": "0x2",
> + "Unit": "ARB"
> + },
> + {
> + "BriefDescription": "Number of all coherent Data Read entries. Doesn't include prefetches [This event is alias to UNC_ARB_TRK_REQUESTS.RD]",
> + "EventCode": "0x81",
> + "EventName": "UNC_ARB_REQ_TRK_REQUEST.DRD",
> + "PerPkg": "1",
> + "UMask": "0x2",
> + "Unit": "ARB"
> + },
> {
> "BriefDescription": "Each cycle counts number of all outgoing valid entries in ReqTrk. Such entry is defined as valid from its allocation in ReqTrk till deallocation. Accounts for Coherent and non-coherent traffic.",
> "EventCode": "0x80",
> @@ -15,6 +65,14 @@
> "UMask": "0x1",
> "Unit": "ARB"
> },
> + {
> + "BriefDescription": "Each cycle count number of 'valid' coherent Data Read entries . Such entry is defined as valid when it is allocated till deallocation. Doesn't include prefetches [This event is alias to UNC_ARB_REQ_TRK_OCCUPANCY.DRD]",
> + "EventCode": "0x80",
> + "EventName": "UNC_ARB_TRK_OCCUPANCY.RD",
> + "PerPkg": "1",
> + "UMask": "0x2",
> + "Unit": "ARB"
> + },
> {
> "BriefDescription": "Counts the number of coherent and in-coherent requests initiated by IA cores, processor graphic units, or LLC.",
> "EventCode": "0x81",
> @@ -22,5 +80,13 @@
> "PerPkg": "1",
> "UMask": "0x1",
> "Unit": "ARB"
> + },
> + {
> + "BriefDescription": "Number of all coherent Data Read entries. Doesn't include prefetches [This event is alias to UNC_ARB_REQ_TRK_REQUEST.DRD]",
> + "EventCode": "0x81",
> + "EventName": "UNC_ARB_TRK_REQUESTS.RD",
> + "PerPkg": "1",
> + "UMask": "0x2",
> + "Unit": "ARB"
> }
> ]
> diff --git a/tools/perf/pmu-events/arch/x86/mapfile.csv b/tools/perf/pmu-events/arch/x86/mapfile.csv
> index 57ba7e814563..c09b81d8d5e1 100644
> --- a/tools/perf/pmu-events/arch/x86/mapfile.csv
> +++ b/tools/perf/pmu-events/arch/x86/mapfile.csv
> @@ -1,6 +1,6 @@
> Family-model,Version,Filename,EventType
> -GenuineIntel-6-(97|9A|B7|BA|BF),v1.21,alderlake,core
> -GenuineIntel-6-BE,v1.21,alderlaken,core
> +GenuineIntel-6-(97|9A|B7|BA|BF),v1.23,alderlake,core
> +GenuineIntel-6-BE,v1.23,alderlaken,core
> GenuineIntel-6-(1C|26|27|35|36),v4,bonnell,core
> GenuineIntel-6-(3D|47),v28,broadwell,core
> GenuineIntel-6-56,v11,broadwellde,core
Powered by blists - more mailing lists