linux-kernel - Re: [PATCH v1] printk_ringbuffer: don't needlessly wrap data blocks around

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <02e9e8d3-3b82-4972-8b66-06065557a4f1@yandex-team.ru>
Date: Thu, 4 Sep 2025 19:50:24 +0300
From: Daniil Tatianin <d-tatianin@...dex-team.ru>
To: Petr Mladek <pmladek@...e.com>, John Ogness <john.ogness@...utronix.de>
Cc: linux-kernel@...r.kernel.org, Steven Rostedt <rostedt@...dmis.org>,
 Sergey Senozhatsky <senozhatsky@...omium.org>
Subject: Re: [PATCH v1] printk_ringbuffer: don't needlessly wrap data blocks
 around


On 9/4/25 7:33 PM, Petr Mladek wrote:
> On Thu 2025-09-04 16:04:30, John Ogness wrote:
>> On 2025-09-03, Daniil Tatianin <d-tatianin@...dex-team.ru> wrote:
>>> Previously, data blocks that perfectly fit the data ring buffer would
>>> get wrapped around to the beginning for no reason since the calculated
>>> offset of the next data block would belong to the next wrap. Since this
>>> offset is not actually part of the data block, but rather the offset of
>>> where the next data block is going to start, there is no reason to
>>> include it when deciding whether the current block fits the buffer.
>> This is a nice catch!
>>
>> Although note that this patch avoids wasting a maximum of 8 bytes of
>> ringbuffer space. If you are interested in tackling the wasted-space
>> issue of the printk ringbuffer there are much larger [0] fish to catch.
>>
>> [0] https://lore.kernel.org/lkml/84y10vz7ty.fsf@jogness.linutronix.de
>>
>> My comments below...
>>
>>> diff --git a/kernel/printk/printk_ringbuffer.c b/kernel/printk/printk_ringbuffer.c
>>> index d9fb053cff67..f885ba8be5e6 100644
>>> --- a/kernel/printk/printk_ringbuffer.c
>>> +++ b/kernel/printk/printk_ringbuffer.c
>>> @@ -1002,6 +1002,18 @@ static bool desc_reserve(struct printk_ringbuffer *rb, unsigned long *id_out)
>>>   	return true;
>>>   }
>>>   
>>> +static bool same_lpos_wraps(struct prb_data_ring *data_ring,
>>> +			     unsigned long begin_lpos, unsigned long next_lpos)
>> We need a better name here since it is not actually using the value of
>> @next_lpos to check the wrap count. Perhaps inverting the return value
>> and naming it blk_lpos_wraps(). So it would be identifying if the given
>> blk_lpos values lead to a wrapping data block. Some like this:
>>
>> diff --git a/kernel/printk/printk_ringbuffer.c b/kernel/printk/printk_ringbuffer.c
>> index d9fb053cff67d..cf0fcd9b106ae 100644
>> --- a/kernel/printk/printk_ringbuffer.c
>> +++ b/kernel/printk/printk_ringbuffer.c
>> @@ -1002,6 +995,17 @@ static bool desc_reserve(struct printk_ringbuffer *rb, unsigned long *id_out)
>>   	return true;
>>   }
>>   
>> +/* Identify if given blk_lpos values represent a wrapping data block. */
>> +static bool blk_lpos_wraps(struct prb_data_ring *data_ring,
>> +			   unsigned long begin_lpos, unsigned long next_lpos)
>> +{
>> +	/*
>> +	 * Subtract one from @next_lpos since it is not actually part of this
>> +	 * data block. This allows perfectly fitting records to not wrap.
>> +	 */
>> +	return (DATA_WRAPS(data_ring, begin_lpos) != DATA_WRAPS(data_ring, next_lpos - 1));
>> +}
> Or a combination of my and this proposal: is_blk_wrapped().
>
>> +
>>   /* Determine the end of a data block. */
>>   static unsigned long get_next_lpos(struct prb_data_ring *data_ring,
>>   				   unsigned long lpos, unsigned int size)
>>
>>> +{
>>> +	/*
>>> +	 * Subtract one from next_lpos since it's not actually part of this data
>>> +	 * block. We do this to prevent perfectly fitting records from wrapping
>>> +	 * around for no reason.
>>> +	 */
>>> +	return DATA_WRAPS(data_ring, begin_lpos) ==
>>> +	       DATA_WRAPS(data_ring, next_lpos - 1);
>>> +}
>>> +
>>>   /* Determine the end of a data block. */
>>>   static unsigned long get_next_lpos(struct prb_data_ring *data_ring,
>>>   				   unsigned long lpos, unsigned int size)
>> The rest looked fine to me and also passed various private
>> tests. However, we should also update data_check_size(), since now data
>> blocks are allowed to occupy the entire data ring. Something like this:
>>
>> diff --git a/kernel/printk/printk_ringbuffer.c b/kernel/printk/printk_ringbuffer.c
>> index d9fb053cff67d..e6bdfb8237a3d 100644
>> --- a/kernel/printk/printk_ringbuffer.c
>> +++ b/kernel/printk/printk_ringbuffer.c
>> @@ -397,21 +397,14 @@ static unsigned int to_blk_size(unsigned int size)
>>    */
>>   static bool data_check_size(struct prb_data_ring *data_ring, unsigned int size)
>>   {
>> -	struct prb_data_block *db = NULL;
>> -
>>   	if (size == 0)
>>   		return true;
>>   
>>   	/*
>>   	 * Ensure the alignment padded size could possibly fit in the data
>> -	 * array. The largest possible data block must still leave room for
>> -	 * at least the ID of the next block.
>> +	 * array.
>>   	 */
>> -	size = to_blk_size(size);
>> -	if (size > DATA_SIZE(data_ring) - sizeof(db->id))
>> -		return false;
>> -
>> -	return true;
>> +	return (to_blk_size(size) <= DATA_SIZE(data_ring));
>>   }
> I hope that we would never reach this limit. A buffer for one
> message does not look practical. I originally suggested to avoid
> messages bigger than 1/4 of the buffer size ;-)
>
> That said, strictly speaking, the above change looks correct.
> I would just do it in a separate patch. The use of the full
> buffer and the limit of the maximal message are related
> but they are not the same things. Also separate patch might
> help with bisection in case of problems.

Sounds good to me.
Do you need more time for extra testing, or can I go ahead and submit a 
new series?

Since you asked, I noticed this issue when studying the code to make a 
similar lockless log ring for my hobby OS :)
I might also take a look at reducing the memory usage for the metadata 
at some point (from discussions linked in John's email).

Thanks for the quick reviews!

>
> Best Regards,
> Petr