lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <2919c400-9626-4cf7-a889-63ab50e989af@roeck-us.net>
Date: Wed, 30 Jul 2025 16:19:20 -0700
From: Guenter Roeck <linux@...ck-us.net>
To: yukuai@...nel.org, Tejun Heo <tj@...nel.org>
Cc: Josef Bacik <josef@...icpanda.com>, Jens Axboe <axboe@...nel.dk>,
 Yu Kuai <yukuai3@...wei.com>, cgroups@...r.kernel.org,
 linux-block@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: [PATCH 1/2] block/blk-throttle: Fix throttle slice time for SSDs

On 7/30/25 11:30, Yu Kuai wrote:
> Hi,
> 
> 在 2025/7/31 0:48, Guenter Roeck 写道:
>> Commit d61fcfa4bb18 ("blk-throttle: choose a small throtl_slice for SSD")
>> introduced device type specific throttle slices if BLK_DEV_THROTTLING_LOW
>> was enabled. Commit bf20ab538c81 ("blk-throttle: remove
>> CONFIG_BLK_DEV_THROTTLING_LOW") removed support for BLK_DEV_THROTTLING_LOW,
>> but left the device type specific throttle slices in place. This
>> effectively changed throttling behavior on systems with SSD which now use
>> a different and non-configurable slice time compared to non-SSD devices.
>> Practical impact is that throughput tests with low configured throttle
>> values (65536 bps) experience less than expected throughput on SSDs,
>> presumably due to rounding errors associated with the small throttle slice
>> time used for those devices. The same tests pass when setting the throttle
>> values to 65536 * 4 = 262144 bps.
>>
>> The original code sets the throttle slice time to DFL_THROTL_SLICE_HD if
>> CONFIG_BLK_DEV_THROTTLING_LOW is disabled. Restore that code to fix the
>> problem. With that, DFL_THROTL_SLICE_SSD is no longer necessary. Revert to
>> the original code and re-introduce DFL_THROTL_SLICE to replace both
>> DFL_THROTL_SLICE_HD and DFL_THROTL_SLICE_SSD. This effectively reverts
>> commit d61fcfa4bb18 ("blk-throttle: choose a small throtl_slice for SSD").
>>
>> After the removal of CONFIG_BLK_DEV_THROTTLING_LOW, it is no longer
>> necessary to enable block accounting, so remove the call to
>> blk_stat_enable_accounting(). With that, the track_bio_latency variable
>> is no longer used and can be deleted from struct throtl_data. Also,
>> including blk-stat.h is no longer necessary.
>>
>> While at it, also remove MAX_THROTL_SLICE since it is not used anymore.
>>
>> Fixes: bf20ab538c81 ("blk-throttle: remove CONFIG_BLK_DEV_THROTTLING_LOW")
>> Cc: Yu Kuai <yukuai3@...wei.com>
>> Cc: Tejun Heo <tj@...nel.org>
>> Signed-off-by: Guenter Roeck <linux@...ck-us.net>
>> ---
>>   block/blk-throttle.c | 15 ++-------------
>>   1 file changed, 2 insertions(+), 13 deletions(-)
>>
>> diff --git a/block/blk-throttle.c b/block/blk-throttle.c
>> index 397b6a410f9e..924d09b51b69 100644
>> --- a/block/blk-throttle.c
>> +++ b/block/blk-throttle.c
>> @@ -12,7 +12,6 @@
>>   #include <linux/blktrace_api.h>
>>   #include "blk.h"
>>   #include "blk-cgroup-rwstat.h"
>> -#include "blk-stat.h"
>>   #include "blk-throttle.h"
>>   /* Max dispatch from a group in 1 round */
>> @@ -22,9 +21,7 @@
>>   #define THROTL_QUANTUM 32
>>   /* Throttling is performed over a slice and after that slice is renewed */
>> -#define DFL_THROTL_SLICE_HD (HZ / 10)
>> -#define DFL_THROTL_SLICE_SSD (HZ / 50)
>> -#define MAX_THROTL_SLICE (HZ)
>> +#define DFL_THROTL_SLICE (HZ / 10)
>>   /* A workqueue to queue throttle related work */
>>   static struct workqueue_struct *kthrotld_workqueue;
>> @@ -45,8 +42,6 @@ struct throtl_data
>>       /* Work for dispatching throttled bios */
>>       struct work_struct dispatch_work;
>> -
>> -    bool track_bio_latency;
>>   };
>>   static void throtl_pending_timer_fn(struct timer_list *t);
>> @@ -1345,13 +1340,7 @@ static int blk_throtl_init(struct gendisk *disk)
>>           goto out;
>>       }
>> -    if (blk_queue_nonrot(q))
>> -        td->throtl_slice = DFL_THROTL_SLICE_SSD;
>> -    else
>> -        td->throtl_slice = DFL_THROTL_SLICE_HD;
>> -    td->track_bio_latency = !queue_is_mq(q);
>> -    if (!td->track_bio_latency)
>> -        blk_stat_enable_accounting(q);
>> +    td->throtl_slice = DFL_THROTL_SLICE;
>>   out:
>>       blk_mq_unquiesce_queue(disk->queue);
> This looks correct, I do missed the throtl_slice for ssd is only used with
> BLK_DEV_THROTTLING_LOW. However, I think it's better to factor the
> track_bio_latency changes into a separate patch.
> 

I had combined it because it is another left-over from bf20ab538c81 and
I don't know if enabling statistics has other side effects. But, sure,
I can split it out if that is preferred. Let's wait for feedback from
Jens and/or Tejun; I'll follow their guidance.

Thanks,
Guenter


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ