[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <a37917d2-8ad9-4192-a3f8-9789193d53d6@intel.com>
Date: Mon, 17 Jun 2024 08:53:38 -0700
From: Reinette Chatre <reinette.chatre@...el.com>
To: Dave Martin <Dave.Martin@....com>
CC: <x86@...nel.org>, <linux-kernel@...r.kernel.org>, James Morse
<james.morse@....com>, Fenghua Yu <fenghua.yu@...el.com>, Thomas Gleixner
<tglx@...utronix.de>, Ingo Molnar <mingo@...hat.com>, Borislav Petkov
<bp@...en8.de>, H Peter Anvin <hpa@...or.com>, Babu Moger
<Babu.Moger@....com>, <shameerali.kolothum.thodi@...wei.com>, "D Scott
Phillips OS" <scott@...amperecomputing.com>, <carl@...amperecomputing.com>,
<lcherian@...vell.com>, <bobo.shaobowang@...wei.com>, "Shaopeng Tan
(Fujitsu)" <tan.shaopeng@...itsu.com>, <baolin.wang@...ux.alibaba.com>,
"Jamie Iles" <quic_jiles@...cinc.com>, Xin Hao <xhao@...ux.alibaba.com>,
Peter Newman <peternewman@...gle.com>, <dfustini@...libre.com>,
<amitsinght@...vell.com>, David Hildenbrand <david@...hat.com>, Rex Nie
<rex.nie@...uarmicro.com>
Subject: Re: [PATCH] x86/resctrl: Don't try to free nonexistent RMIDs
Hi Dave,
On 6/17/24 4:55 AM, Dave Martin wrote:
> Hi Reinette,
>
> On Fri, Jun 14, 2024 at 03:47:58PM -0700, Reinette Chatre wrote:
>> Hi Dave,
>>
>> On 6/14/24 9:08 AM, Dave Martin wrote:
>>> Commit 6791e0ea3071 ("x86/resctrl: Access per-rmid structures by
>>> index") adds logic to map individual monitoring groups into a
>>> global index space used for tracking allocated RMIDs.
>>>
>>> That patch keept the logic to ignore requests to free the default
>>
>> keept -> kept
>>
>> nitpick: I actually do not know if "that patch" gets same hate as
>> "this patch" so to avoid any potential feedback about this I'd like
>> to suggest that this is rewritten without this term. Perhaps
>> something like: "Requests to free the default RMID in free_rmid()
>> are ignored, and this works fine on x86."
>>
>>> RMID in free_rmid(), and this works fine on x86.
>>>
>
> How about recasting the first paragraph into the past tense (since it
> relates a past commit), and rewording as "Requests to free the default
> RMID continued to be ignored in free_rmid(), and this works fine on
> x86."
Please keep it in the present tense. I do not see this as relating to
a "past commit" but instead it is an existing commit responsible for
current behavior. Documentation/process/maintainer-tip.rst contains
some example changelogs created by x86 maintainers that captures their
requirements. The beginning "context" portion is always in present
tense.
>
> (I agree that "this patch" would have been ambiguous. "That patch" was
> an attempt to be clearer, but felt a bit clumsy. Naming the commit
> again felt worse, though would have been clearer. I've noticed that
> people who do not have English as their first language tend to use
> "this" and "that" a little differently from native English speakers, so
> there is probably more scope for confusion here that I like to
> assume...)
>
>>> With arm64 MPAM, there is a latent bug here: on platforms with no
>>> monitors exposed through resctrl, each control group still gets a
>>> different monitoring group ID as seen by the hardware, since the
>>> CLOSID always forms part of the monitoring group ID. This means
>>> that when removing a control group, the code may try to free this
>>> group's default monitoring group RMID for real. If there are no
>>> monitors however, the RMID tracking table rmid_ptrs[] would be a
>>> waste of memory and is never allocated, leading to a splat when a
>>> free_rmid() tries to dereference the table.
>>>
>>> One option would be to treat RMID 0 as special for every CLOSID,
>>> but this would be ugly since we still want to do bookkeeping for
>>> these monitoring group IDs when there are monitors present in the
>>> hardware.
>>>
>>> Instead, add a gating check of resctrl_arch_mon_capable() in
>>> free_rmid(), and just do nothing if the hardware doesn't have
>>> monitors.
>>>
>>> This fix mirrors the gating checks already present in
>>> mkdir_rdt_prepare_rmid_alloc() and elsewhere.
>>>
>>> No functional change on x86.
>>>
>>> Fixes: 6791e0ea3071 ("x86/resctrl: Access per-rmid structures by index")
>>> Signed-off-by: Dave Martin <Dave.Martin@....com>
>>>
>>> ---
>>>
>>> Based on v6.10-rc3.
>>>
>>> Tested on x86 (But so far for the monitors-present case.
>>
>> Tested by booting with "rdt=!cmt,!mbmtotal,!mbmlocal".
>
> Thanks (I take it that's your test, not a request to be more specific
> about mine?)
Yes, I did test it with those parameters. You are also welcome to
add:
Tested-by: Reinette Chatre <reinette.chatre@...el.com>
>
> As it happens I tested with rdt=cmt,mbmtotal,mbmlocal,l3cat,l3cdp
> (though I made no effort to exercise these features other than running
> the selftests). I can note this in the commit if you prefer.
hmmm ... those parameters should not be necessary unless the system
has those features forced off by default because of errata. Doing
functional testing on these systems via such enabling is fine
though.
Reinette
Powered by blists - more mailing lists