[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <ff20aa57-9858-4cca-8709-51cbb67037d5@amd.com>
Date: Mon, 11 Aug 2025 18:51:56 -0500
From: "Moger, Babu" <babu.moger@....com>
To: Reinette Chatre <reinette.chatre@...el.com>, corbet@....net,
tony.luck@...el.com, james.morse@....com, tglx@...utronix.de,
mingo@...hat.com, bp@...en8.de, dave.hansen@...ux.intel.com
Cc: Dave.Martin@....com, x86@...nel.org, hpa@...or.com,
akpm@...ux-foundation.org, paulmck@...nel.org, rostedt@...dmis.org,
Neeraj.Upadhyay@....com, david@...hat.com, arnd@...db.de, fvdl@...gle.com,
seanjc@...gle.com, jpoimboe@...nel.org, pawan.kumar.gupta@...ux.intel.com,
xin@...or.com, manali.shukla@....com, tao1.su@...ux.intel.com,
sohil.mehta@...el.com, kai.huang@...el.com, xiaoyao.li@...el.com,
peterz@...radead.org, xin3.li@...el.com, kan.liang@...ux.intel.com,
mario.limonciello@....com, thomas.lendacky@....com, perry.yuan@....com,
gautham.shenoy@....com, chang.seok.bae@...el.com, linux-doc@...r.kernel.org,
linux-kernel@...r.kernel.org, peternewman@...gle.com, eranian@...gle.com
Subject: Re: [PATCH v16 30/34] fs/resctrl: Introduce the interface to modify
assignments in a group
Hi Reinette,
On 7/30/2025 3:10 PM, Reinette Chatre wrote:
> Hi Babu,
>
> On 7/25/25 11:29 AM, Babu Moger wrote:
>> Enable the mbm_l3_assignments resctrl file to be used to modify counter
>> assignments of CTRL_MON and MON groups when the "mbm_event" counter
>> assignment mode is enabled.
>>
>> The assignment modifications are done in the following format:
> (needs imperative)
Sure.
>
>> <Event>:<Domain id>=<Assignment state>
>>
>> Event: A valid MBM event in the
>> /sys/fs/resctrl/info/L3_MON/event_configs directory.
>>
>> Domain ID: A valid domain ID. When writing, '*' applies the changes
>> to all domains.
>>
>> Assignment states:
>>
>> _ : Unassign a counter.
>>
>> e : Assign a counter exclusively.
>>
>> Examples:
>>
>> $ cd /sys/fs/resctrl
>> $ cat /sys/fs/resctrl/mbm_L3_assignments
>> mbm_total_bytes:0=e;1=e
>> mbm_local_bytes:0=e;1=e
>>
>> To unassign the counter associated with the mbm_total_bytes event on
>> domain 0:
>>
>> $ echo "mbm_total_bytes:0=_" > mbm_L3_assignments
>> $ cat /sys/fs/resctrl/mbm_L3_assignments
>> mbm_total_bytes:0=_;1=e
>> mbm_local_bytes:0=e;1=e
>>
>> To unassign the counter associated with the mbm_total_bytes event on
>> all the domains:
>>
>> $ echo "mbm_total_bytes:*=_" > mbm_L3_assignments
>> $ cat /sys/fs/resctrl/mbm_L3_assignments
>> mbm_total_bytes:0=_;1=_
>> mbm_local_bytes:0=e;1=e
>>
>> Signed-off-by: Babu Moger <babu.moger@....com>
>> ---
> ...
>
>> ---
>> Documentation/filesystems/resctrl.rst | 146 +++++++++++++++++++++++++-
>> fs/resctrl/internal.h | 3 +
>> fs/resctrl/monitor.c | 94 +++++++++++++++++
>> fs/resctrl/rdtgroup.c | 48 ++++++++-
>> 4 files changed, 289 insertions(+), 2 deletions(-)
>>
>> diff --git a/Documentation/filesystems/resctrl.rst b/Documentation/filesystems/resctrl.rst
>> index 0b8ce942f112..0c8701103214 100644
>> --- a/Documentation/filesystems/resctrl.rst
>> +++ b/Documentation/filesystems/resctrl.rst
>> @@ -525,7 +525,8 @@ When the "mba_MBps" mount option is used all CTRL_MON groups will also contain:
>> Event: A valid MBM event in the
>> /sys/fs/resctrl/info/L3_MON/event_configs directory.
>>
>> - Domain ID: A valid domain ID.
>> + Domain ID: A valid domain ID. When writing, '*' applies the changes
>> + to all the domains.
>>
>> Assignment states:
>>
>> @@ -542,6 +543,34 @@ When the "mba_MBps" mount option is used all CTRL_MON groups will also contain:
>> mbm_total_bytes:0=e;1=e
>> mbm_local_bytes:0=e;1=e
>>
>> + Assignments can be modified by writing to the interface.
>> +
>> + Example:
>> + To unassign the counter associated with the mbm_total_bytes event on domain 0:
> The alignment is off when looking at the generated html. What seems to be intended is that
> "Example" is some sort of heading but it ends up just being part of the sentence that follows
> and thus not apply to other examples that follow.
> It can also be "Examples" since there are more than one.
Checking it again.
>
>> + ::
>> +
>> + # echo "mbm_total_bytes:0=_" > /sys/fs/resctrl/mbm_L3_assignments
>> + # cat /sys/fs/resctrl/mbm_L3_assignments
>> + mbm_total_bytes:0=_;1=e
>> + mbm_local_bytes:0=e;1=e
>> +
>> + To unassign the counter associated with the mbm_total_bytes event on all the domains:
>> + ::
>> +
>> + # echo "mbm_total_bytes:*=_" > /sys/fs/resctrl/mbm_L3_assignments
>> + # cat /sys/fs/resctrl/mbm_L3_assignments
>> + mbm_total_bytes:0=_;1=_
>> + mbm_local_bytes:0=e;1=e
>> +
>> + To assign a counter associated with the mbm_total_bytes event on all domains in
>> + exclusive mode:
>> + ::
>> +
>> + # echo "mbm_total_bytes:*=e" > /sys/fs/resctrl/mbm_L3_assignments
>> + # cat /sys/fs/resctrl/mbm_L3_assignments
>> + mbm_total_bytes:0=e;1=e
>> + mbm_local_bytes:0=e;1=e
>> +
>> Resource allocation rules
>> -------------------------
>>
>> @@ -1577,6 +1606,121 @@ View the llc occupancy snapshot::
>> # cat /sys/fs/resctrl/p1/mon_data/mon_L3_00/llc_occupancy
>> 11234000
>>
>> +
>> +Examples on working with mbm_assign_mode
>> +========================================
>> +
>> +a. Check if MBM counter assignment mode is supported.
>> +::
>> +
>> + # mount -t resctrl resctrl /sys/fs/resctrl/
>> +
>> + # cat /sys/fs/resctrl/info/L3_MON/mbm_assign_mode
>> + [mbm_event]
>> + default
>> +
>> +The "mbm_event" mode is detected and enabled.
>> +
>> +b. Check how many assignable counters are supported.
>> +::
>> +
>> + # cat /sys/fs/resctrl/info/L3_MON/num_mbm_cntrs
>> + 0=32;1=32
>> +
>> +c. Check how many assignable counters are available for assignment in each domain.
>> +::
>> +
>> + # cat /sys/fs/resctrl/info/L3_MON/available_mbm_cntrs
>> + 0=30;1=30
>> +
>> +d. To list the default group's assign states.
>> +::
>> +
>> + # cat /sys/fs/resctrl/mbm_L3_assignments
>> + mbm_total_bytes:0=e;1=e
>> + mbm_local_bytes:0=e;1=e
>> +
>> +e. To unassign the counter associated with the mbm_total_bytes event on domain 0.
>> +::
>> +
>> + # echo "mbm_total_bytes:0=_" > /sys/fs/resctrl/mbm_L3_assignments
>> + # cat /sys/fs/resctrl/mbm_L3_assignments
>> + mbm_total_bytes:0=_;1=e
>> + mbm_local_bytes:0=e;1=e
>> +
>> +f. To unassign the counter associated with the mbm_total_bytes event on all domains.
>> +::
>> +
>> + # echo "mbm_total_bytes:*=_" > /sys/fs/resctrl/mbm_L3_assignments
>> + # cat /sys/fs/resctrl/mbm_L3_assignment
>> + mbm_total_bytes:0=_;1=_
>> + mbm_local_bytes:0=e;1=e
>> +
>> +g. To assign a counter associated with the mbm_total_bytes event on all domains in
>> +exclusive mode.
>> +::
>> +
>> + # echo "mbm_total_bytes:*=e" > /sys/fs/resctrl/mbm_L3_assignments
>> + # cat /sys/fs/resctrl/mbm_L3_assignments
>> + mbm_total_bytes:0=e;1=e
>> + mbm_local_bytes:0=e;1=e
>> +
>> +h. Read the events mbm_total_bytes and mbm_local_bytes of the default group. There is
>> +no change in reading the events with the assignment. If the event is unassigned when
>> +reading, then the read will come back as "Unassigned".
> While this example is for a single resource group the supporting text goes back
> and forth between being specific to one resource group and describing what happens
> when there are multiple resource groups (see (j)). If it is just one resource group then above is
> fine, but for multiple there are much more involved with the "unassigned". Same as what
> was mentioned during previous version.
Removed the "Unassigned" related text. Also removed texts about
multiple groups.
We already have details on "Unassigned" in mon_data section.
>
>> +::
>> +
>> + # cat /sys/fs/resctrl/mon_data/mon_L3_00/mbm_total_bytes
>> + 779247936
>> + # cat /sys/fs/resctrl/mon_data/mon_L3_00/mbm_local_bytes
>> + 765207488
>> +
>> +i. Check the event configurations.
>> +::
>> +
>> + # cat /sys/fs/resctrl/info/L3_MON/event_configs/mbm_total_bytes/event_filter
>> + local_reads,remote_reads,local_non_temporal_writes,remote_non_temporal_writes,
>> + local_reads_slow_memory,remote_reads_slow_memory,dirty_victim_writes_all
>> +
>> + # cat /sys/fs/resctrl/info/L3_MON/event_configs/mbm_local_bytes/event_filter
>> + local_reads,local_non_temporal_writes,local_reads_slow_memory
>> +
>> +j. Change the event configuration for mbm_local_bytes.
>> +::
>> +
>> + # echo "local_reads, local_non_temporal_writes, local_reads_slow_memory, remote_reads" >
>> + /sys/fs/resctrl/info/L3_MON/event_configs/mbm_local_bytes/event_filter
>> +
>> + # cat /sys/fs/resctrl/info/L3_MON/event_configs/mbm_local_bytes/event_filter
>> + local_reads,local_non_temporal_writes,local_reads_slow_memory,remote_reads
>> +
>> +This will update all (across all domains of all monitor groups) counter assignments
>> +associated with the mbm_local_bytes event.
>> +
>> +k. Now read the local event again. The first read may come back with "Unavailable"
>> +status. The subsequent read of mbm_local_bytes will display the current value.
>> +::
>> +
>> + # cat /sys/fs/resctrl/mon_data/mon_L3_00/mbm_local_bytes
>> + Unavailable
>> + # cat /sys/fs/resctrl/mon_data/mon_L3_00/mbm_local_bytes
>> + 314101
>> +
>> +l. Users have the option to go back to 'default' mbm_assign_mode if required. This can be
>> +done using the following command. Note that switching the mbm_assign_mode may reset all
>> +the MBM counters (and thus all MBM events) of all the resctrl groups.
>> +::
>> +
>> + # echo "default" > /sys/fs/resctrl/info/L3_MON/mbm_assign_mode
>> + # cat /sys/fs/resctrl/info/L3_MON/mbm_assign_mode
>> + mbm_event
>> + [default]
>> +
>> +m. Unmount the resctrl filesystem.
>> +::
>> +
>> + # umount /sys/fs/resctrl/
>> +
>> Intel RDT Errata
>> ================
>>
>> diff --git a/fs/resctrl/internal.h b/fs/resctrl/internal.h
>> index e2e3fc0c5fab..1350fc273258 100644
>> --- a/fs/resctrl/internal.h
>> +++ b/fs/resctrl/internal.h
>> @@ -418,6 +418,9 @@ int event_filter_show(struct kernfs_open_file *of, struct seq_file *seq, void *v
>> ssize_t event_filter_write(struct kernfs_open_file *of, char *buf, size_t nbytes,
>> loff_t off);
>>
>> +int resctrl_parse_mbm_assignment(struct rdt_resource *r, struct rdtgroup *rdtgrp,
>> + char *event, char *tok);
>> +
>> #ifdef CONFIG_RESCTRL_FS_PSEUDO_LOCK
>> int rdtgroup_locksetup_enter(struct rdtgroup *rdtgrp);
>>
>> diff --git a/fs/resctrl/monitor.c b/fs/resctrl/monitor.c
>> index ebc049105949..1e4f8e3bedc6 100644
>> --- a/fs/resctrl/monitor.c
>> +++ b/fs/resctrl/monitor.c
>> @@ -1311,3 +1311,97 @@ void resctrl_update_cntr_allrdtgrp(struct mon_evt *mevt)
>> rdtgroup_update_cntr_event(r, crgrp, mevt->evtid);
>> }
>> }
>> +
>> +/*
>> + * mbm_get_mon_event_by_name() - Return the mon_evt entry for the matching
>> + * event name.
>> + */
>> +static struct mon_evt *mbm_get_mon_event_by_name(struct rdt_resource *r, char *name)
>> +{
>> + struct mon_evt *mevt;
>> +
>> + for_each_mon_event(mevt) {
>> + if (mevt->rid == r->rid && mevt->enabled &&
>> + resctrl_is_mbm_event(mevt->evtid) &&
>> + !strcmp(mevt->name, name))
>> + return mevt;
>> + }
>> +
>> + return NULL;
>> +}
>> +
>> +static int rdtgroup_modify_assign_state(char *assign, struct rdt_mon_domain *d,
>> + struct rdtgroup *rdtgrp, struct mon_evt *mevt)
>> +{
>> + int ret = 0;
>> +
>> + if (!assign || strlen(assign) != 1)
>> + return -EINVAL;
>> +
>> + switch (*assign) {
>> + case 'e':
>> + ret = rdtgroup_assign_cntr_event(d, rdtgrp, mevt);
>> + break;
>> + case '_':
>> + rdtgroup_unassign_cntr_event(d, rdtgrp, mevt);
>> + break;
>> + default:
>> + ret = -EINVAL;
>> + break;
>> + }
>> +
>> + return ret;
>> +}
>> +
>> +int resctrl_parse_mbm_assignment(struct rdt_resource *r, struct rdtgroup *rdtgrp,
>> + char *event, char *tok)
>> +{
>> + struct rdt_mon_domain *d;
>> + unsigned long dom_id = 0;
>> + char *dom_str, *id_str;
>> + struct mon_evt *mevt;
>> + int ret;
>> +
>> + mevt = mbm_get_mon_event_by_name(r, event);
>> + if (!mevt) {
>> + rdt_last_cmd_printf("Invalid event %s\n", event);
>> + return -ENOENT;
> Extra space
Sure.
>
>> + }
>> +
>> +next:
>> + if (!tok || tok[0] == '\0')
>> + return 0;
>> +
>> + /* Start processing the strings for each domain */
>> + dom_str = strim(strsep(&tok, ";"));
>> +
>> + id_str = strsep(&dom_str, "=");
>> +
>> + /* Check for domain id '*' which means all domains */
>> + if (id_str && *id_str == '*') {
>> + ret = rdtgroup_modify_assign_state(dom_str, NULL, rdtgrp, mevt);
>> + if (ret)
>> + rdt_last_cmd_printf("Assign operation '%s:*=%s' failed\n",
>> + event, dom_str);
>> + return ret;
>> + } else if (!id_str || kstrtoul(id_str, 10, &dom_id)) {
>> + rdt_last_cmd_puts("Missing domain id\n");
>> + return -EINVAL;
>> + }
>> +
>> + /* Verify if the dom_id is valid */
>> + list_for_each_entry(d, &r->mon_domains, hdr.list) {
>> + if (d->hdr.id == dom_id) {
>> + ret = rdtgroup_modify_assign_state(dom_str, d, rdtgrp, mevt);
>> + if (ret) {
>> + rdt_last_cmd_printf("Assign operation '%s:%ld=%s' failed\n",
>> + event, dom_id, dom_str);
>> + return ret;
>> + }
>> + goto next;
>> + }
>> + }
>> +
>> + rdt_last_cmd_printf("Invalid domain id %ld\n", dom_id);
>> + return -EINVAL;
>> +}
>> diff --git a/fs/resctrl/rdtgroup.c b/fs/resctrl/rdtgroup.c
>> index 47716e623a9c..2d2b91cd1f67 100644
>> --- a/fs/resctrl/rdtgroup.c
>> +++ b/fs/resctrl/rdtgroup.c
>> @@ -1979,6 +1979,51 @@ static int mbm_L3_assignments_show(struct kernfs_open_file *of, struct seq_file
>> return ret;
>> }
>>
>> +static ssize_t mbm_L3_assignments_write(struct kernfs_open_file *of, char *buf,
> Please move to monitor.c
Sure.
Thanks
Babu
Powered by blists - more mailing lists