[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <71eb17c7-acca-412c-bd59-17ee5aa0aa07@intel.com>
Date: Tue, 12 Nov 2024 14:00:36 -0800
From: Reinette Chatre <reinette.chatre@...el.com>
To: Fenghua Yu <fenghua.yu@...el.com>, "Luck, Tony" <tony.luck@...el.com>,
Peter Newman <peternewman@...gle.com>, Jonathan Corbet <corbet@....net>,
Shuah Khan <skhan@...uxfoundation.org>, "x86@...nel.org" <x86@...nel.org>
CC: James Morse <james.morse@....com>, Jamie Iles <quic_jiles@...cinc.com>,
Babu Moger <babu.moger@....com>, Randy Dunlap <rdunlap@...radead.org>,
"Shaopeng Tan (Fujitsu)" <tan.shaopeng@...itsu.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"linux-doc@...r.kernel.org" <linux-doc@...r.kernel.org>,
"patches@...ts.linux.dev" <patches@...ts.linux.dev>
Subject: Re: [PATCH v8 6/7] x86/resctrl: Add write option to "mba_MBps_event"
file
Hi Fenghua and Tony,
On 11/1/24 5:57 PM, Fenghua Yu wrote:
> Hi, Tony,
>
> On 11/1/24 16:55, Luck, Tony wrote:
>>>> + if (!strcmp(buf, "mbm_local_bytes")) {
>>>> + if (is_mbm_local_enabled())
>>>> + rdtgrp->mba_mbps_event = QOS_L3_MBM_LOCAL_EVENT_ID;
>>>> + else
>>>> + ret = -ENXIO;
>>>> + } else if (!strcmp(buf, "mbm_total_bytes")) {
>>>> + if (is_mbm_total_enabled())
>>>> + rdtgrp->mba_mbps_event = QOS_L3_MBM_TOTAL_EVENT_ID;
>>>
>>>
>>> User may think each time toggling the local/total event will effect MBA.
>>> And they may create usage case like frequently changing the events to
>>> maintain/adjust both total and local within bw boundary.
>>>
>>> But toggling mba_mbps_event faster than 1sec doesn't have any effect on
>>> MBA SC because MBA SC is called every one second.
>>>
>>> Maybe need to add a ratelimit of 1 second on calling this function? And
>>> adding info in the document that toggling speed should be slower than 1
>>> second?
>>
>> The limit would need to be per ctrl_mon group, not on calls to this function.
>> It's perfectly ok to switch multiple groups in a short interval.
>
> Agree.
>
>>
>> I'm not sure how to rate limit here. I could add a delay so that the write()
>> call blocks until enough time passes before making the change. But
>> what should I do if a user submits more writes to the file? Queue them
>> all and apply at one second intervals?
>
> Maybe define "mba_mbps_last_time" in rdtgroup. Then
>
> if (time_before(jiffies, rdtgrp->mba_mbps_last_time + HZ) {
> rdt_last_cmd_printf("Too fast (>1/s) mba_MBps event change)\n");
> rdtgroup_kn_unlock(of->kn);
> return -EAGAIN;
> }
> rdtgrp->mba_mbps_last_time = jiffies;
This seems like enforcing an unnecessary limitation. For example, this would mean
that user space needs to wait for a second to undo a change to the monitoring event
in case there was a mistake. This seems like an unnecessary restriction to me.
I am also afraid that there may be some corner cases where a write to the file and
the actual run of the overflow handler (on every domain) may not be exactly HZ apart.
Bandwidth allocation remains to be adjusted in either direction with at most the bandwidth
granularity. A user attempting to toggle bandwidth event cannot expecting large
changes in allocated bandwidth even if the events differ significantly.
Surely we can explore more if we learn about a specific use case.
>> Maybe it would be better to just to add some additional text to the
>> documentation pointing out that resctrl only checks bandwidth once
>> per second to make throttling adjustments. So changes to the event
>> will only have effect after some seconds have passed?
>
>
> Add additional text would be great.
Agreed.
Reinette
Powered by blists - more mailing lists