lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <3c7ac433-480b-4cb3-9645-4d2cc495f1a3@huawei.com>
Date: Mon, 16 Jun 2025 21:22:27 +0800
From: "zhenglifeng (A)" <zhenglifeng1@...wei.com>
To: Beata Michalska <beata.michalska@....com>, Will Deacon <will@...nel.org>
CC: <sudeep.holla@....com>, <catalin.marinas@....com>,
	<linux-arm-kernel@...ts.infradead.org>, <linux-kernel@...r.kernel.org>,
	<linuxarm@...wei.com>, <jonathan.cameron@...wei.com>,
	<viresh.kumar@...aro.org>, <yangyicong@...ilicon.com>,
	<zhanjie9@...ilicon.com>, <lihuisong@...wei.com>, <yubowen8@...wei.com>,
	<vincent.guittot@...aro.org>
Subject: Re: [PATCH] arm64: topology: Setup amu fie when cpu hotplugging

On 2025/6/13 4:49, Beata Michalska wrote:
> On Thu, Jun 12, 2025 at 04:17:36PM +0100, Will Deacon wrote:
>> [+arm topology folks]
>>
>> On Sat, Jun 07, 2025 at 05:45:33PM +0800, Lifeng Zheng wrote:
>>> Amu fie was set up by a cpufreq policy notifier after the policy was
>>> created. This caused some problems:
>>>
>>> 1. The cpus related to the same policy would all fail to set up amu fie if
>>> one of them couldn't pass the freq_counters_valid() check.
>>>
> I believe that was actually deliberate - it's all or nothing.
> On the other hand, that validation is pretty straightforward.
> Would you mind sharing some more details on the setup that triggers it?
>>> 2. The cpus fail to set up amu fie would never have a chance to set up
>>> again.
>>
> I'd say that's the consequence of 1.
>> I don't fully understand this (we don't tend to use the past tense in commit
>> messages), but it sounds like you're saying that the singleton nature of
>> the AMU driver is causing you problems with late CPU hotplug. Is that
>> correct? Can you perhaps be a bit more specific about what goes wrong and
>> how to reproduce the issue, please?

Here's the problem I met:

Assuming cpu0 and cpu1 share the same cpufreq policy and LPI(Low Power Idle
States) is on. When boot with a parameter maxcpus=1, Only cpu0 will go
online. The support amu flag of cpu0 will be set but the flag of cpu1 will
not. This will cause amu fie set up fail for cpu0 when cpufreq policy0 is
creating. After that, when hotplug cpu1 on, the support amu flag of it will
finally be set. But for cpu0 and cpu1, they'll never have the chance to set
up amu fie, even though they're eligible.

So I think the best time to set up amu fie is when the cpu is going online,
not when cpufreq policy is being created. This is what this patch does.

>>
>>> When boot with maxcpu=1 restrict, the support amu flags of the offline cpus
>>> would never be setup. After that, when cpufreq policy was being created,
>>> the online cpu might set up amu fie fail because the other cpus related to
>>> the same policy couldn't pass the freq_counters_valid() check. Hotplug the
>>> offline cpus, since the policy was already created, amu_fie_setup() would
>>> never be called again. All cpus couldn't setup amu fie in this situation.
>>>
>>> After commit 1f023007f5e7 ("arm64/amu: Use capacity_ref_freq() to set AMU
>>> ratio"), the max_freq stores in policy data is never needed when setting up
>>> amu fie.  This indicates that the setting up of amu fie does not depend on
>>> the policy any more. So each cpu can set up amu fie separately during
>>> hotplug and the problems above will be solved.
> I do not think this is the conclusion I would draw from that change.
> It aligns the AMU FIE code to use the right ratio based on the reference freq
> instead of cpuinfo.max_freq

Yes, it's the purpose of this commit. What I'm trying to say is that thanks
for this commit, cpuinfo.max_freq or even the whole policy is not needed
anymore for setting up amu fie. And this is the precondition to this patch.

>>>
>>> Signed-off-by: Lifeng Zheng <zhenglifeng1@...wei.com>
>>> ---
>>>  arch/arm64/kernel/topology.c | 56 ++++++++++++++----------------------
>>>  1 file changed, 21 insertions(+), 35 deletions(-)
>>>
>>> diff --git a/arch/arm64/kernel/topology.c b/arch/arm64/kernel/topology.c
>>> index 5d07ee85bdae..207eab4fa31f 100644
>>> --- a/arch/arm64/kernel/topology.c
>>> +++ b/arch/arm64/kernel/topology.c
>>> @@ -351,63 +351,49 @@ int arch_freq_get_on_cpu(int cpu)
>>>  	return freq;
>>>  }
>>>  
>>> -static void amu_fie_setup(const struct cpumask *cpus)
>>> +static void amu_fie_setup(unsigned int cpu)
>>>  {
>>> -	int cpu;
>>> -
>>> -	/* We are already set since the last insmod of cpufreq driver */
>>>  	if (cpumask_available(amu_fie_cpus) &&
>>> -	    unlikely(cpumask_subset(cpus, amu_fie_cpus)))
>>> +	    cpumask_test_cpu(cpu, amu_fie_cpus))
>>>  		return;
>>>  
>>> -	for_each_cpu(cpu, cpus)
>>> -		if (!freq_counters_valid(cpu))
>>> -			return;
>>> +	if (!freq_counters_valid(cpu))
>>> +		return;
>>>  
>>>  	if (!cpumask_available(amu_fie_cpus) &&
>>>  	    !zalloc_cpumask_var(&amu_fie_cpus, GFP_KERNEL)) {
>>> -		WARN_ONCE(1, "Failed to allocate FIE cpumask for CPUs[%*pbl]\n",
>>> -			  cpumask_pr_args(cpus));
>>> +		WARN_ONCE(1, "Failed to allocate FIE cpumask for CPUs[%u]\n",
>>> +			  cpu);
>>>  		return;
>>>  	}
>>>  
>>> -	cpumask_or(amu_fie_cpus, amu_fie_cpus, cpus);
>>> +	cpumask_set_cpu(cpu, amu_fie_cpus);
>>>  
>>>  	topology_set_scale_freq_source(&amu_sfd, amu_fie_cpus);
>>
>> Isn't it going to be potentially expensive to call this every time a CPU
>> comes online?
>>
> It does seem *bit excessive - I guess this should use single-CPU mask, to start
> with.

For the cpus which already setup amu fie, they won't pass the first "if" in
this function, so it's not so expensive I think. For the other cpus, I
believe it's worth a try to set.

>> Will
>>
>> [left the rest of the patch below for the folks I added]
> 
> I think I kinda get the idea, but having more details would help to clarify
> things.
> 
> ---
> BR
> Beata
>>
>>>  
>>> -	pr_debug("CPUs[%*pbl]: counters will be used for FIE.",
>>> -		 cpumask_pr_args(cpus));
>>> +	pr_debug("CPUs[%u]: counters will be used for FIE.", cpu);
>>>  }
>>>  
>>> -static int init_amu_fie_callback(struct notifier_block *nb, unsigned long val,
>>> -				 void *data)
>>> +static int cpuhp_topology_online(unsigned int cpu)
>>>  {
>>> -	struct cpufreq_policy *policy = data;
>>> -
>>> -	if (val == CPUFREQ_CREATE_POLICY)
>>> -		amu_fie_setup(policy->related_cpus);
>>> -
>>> -	/*
>>> -	 * We don't need to handle CPUFREQ_REMOVE_POLICY event as the AMU
>>> -	 * counters don't have any dependency on cpufreq driver once we have
>>> -	 * initialized AMU support and enabled invariance. The AMU counters will
>>> -	 * keep on working just fine in the absence of the cpufreq driver, and
>>> -	 * for the CPUs for which there are no counters available, the last set
>>> -	 * value of arch_freq_scale will remain valid as that is the frequency
>>> -	 * those CPUs are running at.
>>> -	 */
>>> +	amu_fie_setup(cpu);
>>>  
>>>  	return 0;
>>>  }
>>>  
>>> -static struct notifier_block init_amu_fie_notifier = {
>>> -	.notifier_call = init_amu_fie_callback,
>>> -};
>>> -
>>>  static int __init init_amu_fie(void)
>>>  {
>>> -	return cpufreq_register_notifier(&init_amu_fie_notifier,
>>> -					CPUFREQ_POLICY_NOTIFIER);
>>> +	int ret;
>>> +
>>> +	ret = cpuhp_setup_state(CPUHP_AP_ONLINE_DYN,
>>> +				"arm64/topology:online",
>>> +				cpuhp_topology_online,
>>> +				NULL);
>>> +
>>> +	if (ret < 0)
>>> +		return ret;
>>> +
>>> +	return 0;
>>>  }
>>>  core_initcall(init_amu_fie);
>>>  
>>> -- 
>>> 2.33.0
>>>
> 


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ