[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-Id: <28e408f0-d773-6d91-9282-92aa65ef043e@linux.vnet.ibm.com>
Date: Mon, 25 Sep 2017 17:22:01 +0530
From: Anju T Sudhakar <anju@...ux.vnet.ibm.com>
To: Michael Ellerman <mpe@...erman.id.au>
Cc: linuxppc-dev@...ts.ozlabs.org, linux-kernel@...r.kernel.org,
maddy@...ux.vnet.ibm.com, hemant@...ux.vnet.ibm.com
Subject: Re: [PATCH] powerpc/perf: Fix for core/nest imc call trace on
cpuhotplug
Hi mpe,
On Thursday 21 September 2017 10:04 AM, Michael Ellerman wrote:
> Anju T Sudhakar <anju@...ux.vnet.ibm.com> writes:
>
>> Nest/core pmu units are enabled only when it is used. A reference count is
>> maintained for the events which uses the nest/core pmu units. Currently in
>> *_imc_counters_release function a WARN() is used for notification of any
>> underflow of ref count. Replace WARN() with a pr_info since it is an overkill.
> As discussed elsewhere this is not the right solution.
>
> If it's OK for the reference count to be negative, then we shouldn't
> print anything when it is.
>
> But I don't understand how it can be OK for the refcount to be negative.
> That means someone has a negative number of references to something?
>
> cheers
>
Scenario where this happens is in a stress test where perf session is
started, followed by offlining of all cpus in a given core. And finally
terminate the perf session.
So, in cpuhotplug offline path(ppc_core_imc_cpu_offline), function set
the ref->count to zero, if the current cpu which is about to offline is
the last cpu in a given core and make an OPAL call to disable the engine
in that core.
And on perf session termination, perf->destory
(core_imc_counters_release) will first decrement the ref->count for this
core and based on the ref->count value an opal call is made to disable
the core-imc engine.
Now, since cpuhotplug path already clears the ref->count for core and
disabled the engine, perf->destroy() decrementing again at event
termination make it negative which in turn fires the WARN_ON.
So we do prefer to remove the message as this wont happen in normal
operation and the core counters are working as expected.
I will send out a patch by removing the message asap.
Thanks,
Anju
Powered by blists - more mailing lists