lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <9ff8f11c-c7d8-d0cc-1b7d-97778bfab7b5@linaro.org>
Date:   Sun, 13 Sep 2020 08:36:25 -0500
From:   Alex Elder <elder@...aro.org>
To:     Bjorn Andersson <bjorn.andersson@...aro.org>
Cc:     davem@...emloft.net, kuba@...nel.org, evgreen@...omium.org,
        subashab@...eaurora.org, cpratapa@...eaurora.org,
        netdev@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: [PATCH net-next v2 3/7] net: ipa: verify reference flag values

On 9/12/20 9:25 PM, Bjorn Andersson wrote:
> On Fri 11 Sep 19:45 CDT 2020, Alex Elder wrote:
> 
>> We take a single IPA clock reference to keep the clock running until
>> we get a system suspend operation, and maintain a flag indicating
>> whether that reference has been taken.  When a suspend request
>> arrives, we drop that reference and clear the flag.
>>
>> In most places we simply set or clear the extra-reference flag.
>> Instead--primarily to catch coding errors--test the previous value
>> of the flag and report an error in the event the previous value is
>> unexpected.  And if the clock reference is already taken, don't take
>> another.
>>
>> In a couple of cases it's pretty clear atomic access is not
>> necessary and an error should never be reported.  Report these
>> anyway, conveying our surprise with an added exclamation point.
>>
>> Signed-off-by: Alex Elder <elder@...aro.org>
>> ---
>> v2: Updated to operate on a bitmap bit rather than an atomic_t.
>>
>>  drivers/net/ipa/ipa_main.c | 23 ++++++++++++++++-------
>>  1 file changed, 16 insertions(+), 7 deletions(-)
>>
>> diff --git a/drivers/net/ipa/ipa_main.c b/drivers/net/ipa/ipa_main.c
>> index 409375b96eb8f..cfdf60ded86ca 100644
>> --- a/drivers/net/ipa/ipa_main.c
>> +++ b/drivers/net/ipa/ipa_main.c
>> @@ -83,6 +83,7 @@ static void ipa_suspend_handler(struct ipa *ipa, enum ipa_irq_id irq_id)
>>  	/* Take a a single clock reference to prevent suspend.  All
>>  	 * endpoints will be resumed as a result.  This reference will
>>  	 * be dropped when we get a power management suspend request.
>> +	 * The first call activates the clock; ignore any others.
>>  	 */
>>  	if (!test_and_set_bit(IPA_FLAG_CLOCK_HELD, ipa->flags))
>>  		ipa_clock_get(ipa);
>> @@ -502,14 +503,17 @@ static void ipa_resource_deconfig(struct ipa *ipa)
>>   */
>>  static int ipa_config(struct ipa *ipa, const struct ipa_data *data)
>>  {
>> +	struct device *dev = &ipa->pdev->dev;
>>  	int ret;
>>  
>>  	/* Get a clock reference to allow initialization.  This reference
>>  	 * is held after initialization completes, and won't get dropped
>>  	 * unless/until a system suspend request arrives.
>>  	 */
>> -	__set_bit(IPA_FLAG_CLOCK_HELD, ipa->flags);
>> -	ipa_clock_get(ipa);
>> +	if (!__test_and_set_bit(IPA_FLAG_CLOCK_HELD, ipa->flags))
>> +		ipa_clock_get(ipa);
>> +	else
>> +		dev_err(dev, "suspend clock reference already taken!\n");
>>  
>>  	ipa_hardware_config(ipa);
>>  
>> @@ -544,7 +548,8 @@ static int ipa_config(struct ipa *ipa, const struct ipa_data *data)
>>  err_hardware_deconfig:
>>  	ipa_hardware_deconfig(ipa);
>>  	ipa_clock_put(ipa);
>> -	__clear_bit(IPA_FLAG_CLOCK_HELD, ipa->flags);
>> +	if (!__test_and_clear_bit(IPA_FLAG_CLOCK_HELD, ipa->flags))
>> +		dev_err(dev, "suspend clock reference already dropped!\n");
>>  
>>  	return ret;
>>  }
>> @@ -562,7 +567,8 @@ static void ipa_deconfig(struct ipa *ipa)
>>  	ipa_endpoint_deconfig(ipa);
>>  	ipa_hardware_deconfig(ipa);
>>  	ipa_clock_put(ipa);
>> -	__clear_bit(IPA_FLAG_CLOCK_HELD, ipa->flags);
>> +	if (!test_and_clear_bit(IPA_FLAG_CLOCK_HELD, ipa->flags))
> 
> Doesn't this imply that we ran with the clocks disabled, which
> presumably would have nasty side effects?

Yes.  This is one of those that I mentioned "can't happen"
but I added the check anyway.

We call ipa_config() as the last step of ipa_probe().  The inverse
of ipa_config() is ipa_deconfig(), and that is called in two cases:
- If the AP is loading firmware, it does so *after* ipa_config()
  has been called and returned success.  If firmware loading fails,
  ipa_deconfig() is called in the error path to clean up.  If we
  never reached ipa_config() in the probe function, we will never
  call ipa_deconfig() in the error path.
- If ipa_config() fails when called in ipa_probe(), it will clean
  up all changed state and return an error value.  I *assume* that
  if the ->probe function returns an error, the ->remove function
  will never be called.  So again, we will never call ipa_deconfig()
  unless ipa_config() has been called.

That's the reasoning anyway.  That being said, you make a very
good point, in that the whole purpose of checking this at all
is to catch coding errors, and a WARN() call would provide much
better information than just an error message would.

So I will plan to update this in a new version of this patch
(and series).  I'll wait until tonight or tomorrow to see if
there is any other feedback before preparing that.

Thanks a lot.

					-Alex

> This seems like something that is worthy of more than just a simple
> printout - which no one will actually read.  If you instead use a
> WARN_ON() to highlight this at least some of the test environments out
> there will pick it up and report it...
> 
> Regards,
> Bjorn
> 
>> +		dev_err(&ipa->pdev->dev, "no suspend clock reference\n");
>>  }
>>  
>>  static int ipa_firmware_load(struct device *dev)
>> @@ -913,7 +919,8 @@ static int ipa_suspend(struct device *dev)
>>  	struct ipa *ipa = dev_get_drvdata(dev);
>>  
>>  	ipa_clock_put(ipa);
>> -	__clear_bit(IPA_FLAG_CLOCK_HELD, ipa->flags);
>> +	if (!test_and_clear_bit(IPA_FLAG_CLOCK_HELD, ipa->flags))
>> +		dev_err(dev, "suspend: missing suspend clock reference\n");
>>  
>>  	return 0;
>>  }
>> @@ -933,8 +940,10 @@ static int ipa_resume(struct device *dev)
>>  	/* This clock reference will keep the IPA out of suspend
>>  	 * until we get a power management suspend request.
>>  	 */
>> -	__set_bit(IPA_FLAG_CLOCK_HELD, ipa->flags);
>> -	ipa_clock_get(ipa);
>> +	if (!test_and_set_bit(IPA_FLAG_CLOCK_HELD, ipa->flags))
>> +		ipa_clock_get(ipa);
>> +	else
>> +		dev_err(dev, "resume: duplicate suspend clock reference\n");
>>  
>>  	return 0;
>>  }
>> -- 
>> 2.20.1
>>

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ