lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <aMQzD9CLP1F01Rry@pathway.suse.cz>
Date: Fri, 12 Sep 2025 16:49:51 +0200
From: Petr Mladek <pmladek@...e.com>
To: John Ogness <john.ogness@...utronix.de>
Cc: Daniil Tatianin <d-tatianin@...dex-team.ru>,
	linux-kernel@...r.kernel.org, Steven Rostedt <rostedt@...dmis.org>,
	Sergey Senozhatsky <senozhatsky@...omium.org>
Subject: Re: [PATCH v2 0/2] printk_ringbuffer: don't needlessly wrap data
 blocks around

On Fri 2025-09-12 11:55:02, Petr Mladek wrote:
> On Fri 2025-09-12 11:25:09, Petr Mladek wrote:
> > On Thu 2025-09-11 18:18:32, John Ogness wrote:
> > > On 2025-09-11, Petr Mladek <pmladek@...e.com> wrote:
> > > > diff --git a/kernel/printk/printk_ringbuffer_kunit_test.c b/kernel/printk/printk_ringbuffer_kunit_test.c
> > > > index 2282348e869a..241f7ef49ac6 100644
> > > > --- a/kernel/printk/printk_ringbuffer_kunit_test.c
> > > > +++ b/kernel/printk/printk_ringbuffer_kunit_test.c
> > > > @@ -56,7 +56,7 @@ struct prbtest_rbdata {
> > > >  	char text[] __counted_by(size);
> > > >  };
> > > >  
> > > > -#define MAX_RBDATA_TEXT_SIZE 0x80
> > > > +#define MAX_RBDATA_TEXT_SIZE (0x256 - sizeof(struct prbtest_rbdata))
> > > 
> > > I guess this should be:
> > > 
> > > #define MAX_RBDATA_TEXT_SIZE (256  - sizeof(struct prbtest_rbdata))
> > 
> > Great catch!
> > 
> > But the KUnit test fails even with this change, see below. And I am
> > not surprised. The test should work even with larger-than-allowed
> > messages. prbtest_writer() should skip then because prb_reserve()
> > should fail.
> > 
> > Here is test result with:
> > 
> > #define MAX_RBDATA_TEXT_SIZE (256 - sizeof(struct prbtest_rbdata))
> > #define MAX_PRB_RECORD_SIZE (sizeof(struct prbtest_rbdata) + MAX_RBDATA_TEXT_SIZE)
> > 
> > DEFINE_PRINTKRB(test_rb, 4, 4);
> > 
> > and with this patchset reverted, aka, sources from
> > printk/linux.git, branch for-next:
> > 
> > It is well reproducible. It always fails after reading few records.
> > Here are results from few other runs:
> 
> And I am not longer able to reproduce it after limiting the size
> of the record to 1/4 of the data buffer size. I did it with
> the following change:
> 
> diff --git a/kernel/printk/printk_ringbuffer.c b/kernel/printk/printk_ringbuffer.c
> index bc811de18316..2f02254705aa 100644
> --- a/kernel/printk/printk_ringbuffer.c
> +++ b/kernel/printk/printk_ringbuffer.c
> @@ -398,8 +398,6 @@ static unsigned int to_blk_size(unsigned int size)
>   */
>  static bool data_check_size(struct prb_data_ring *data_ring, unsigned int size)
>  {
> -	struct prb_data_block *db = NULL;
> -
>  	if (size == 0)
>  		return true;
>  
> @@ -409,7 +407,7 @@ static bool data_check_size(struct prb_data_ring *data_ring, unsigned int size)
>  	 * at least the ID of the next block.
>  	 */
>  	size = to_blk_size(size);
> -	if (size > DATA_SIZE(data_ring) - sizeof(db->id))
> +	if (size > DATA_SIZE(data_ring) / 4)
>  		return false;
>  
>  	return true;
> 
> 
> I guess that there is a race when we need to make all existing records
> reusable when making space for the next one.

It looks to me that the reader API is not ready to handle the
situation when all records are "reusable".

At least, it looks like prb_next_seq() might end up in an "infinite"
loop because it blindly increments "seq" until finding a valid record...

Honestly, I would really like to limit the maximal record size to
1/4 of the buffer size. I do not want to make the design more
complicated just to be able to fill just one record, definitely.


That said, I still a bit nervous because I do not understand why
the KUnit test fails. It does not depend on prb_next_seq().

It seems that prb_read_valid() retuns success even when
returning a garbage. I have added some debug output using
trace_printk():

--- a/kernel/printk/printk_ringbuffer.c
+++ b/kernel/printk/printk_ringbuffer.c
@@ -1856,6 +1856,10 @@ static bool copy_data(struct prb_data_ring *data_ring,
 	data_size = min_t(unsigned int, buf_size, len);
 
 	memcpy(&buf[0], data, data_size); /* LMM(copy_data:A) */
+	if (data_ring != &prb->text_data_ring) {
+		trace_printk("  %s: reading from 0x%lx (%d bytes)\n",
+			     __func__, (unsigned long)data, data_size);
+	}
 	return true;
 }
 
@@ -1939,6 +1943,9 @@ static int prb_read(struct printk_ringbuffer *rb, u64 seq,
 	if (r->info)
 		memcpy(r->info, info, sizeof(*(r->info)));
 
+	if (rb != prb)
+		trace_printk("%s: Calling copy_data() for seq=%llu\n", __func__, seq);
+
 	/* Copy text data. If it fails, this is a data-less record. */
 	if (!copy_data(&rb->text_data_ring, &desc.text_blk_lpos, info->text_len,
 		       r->text_buf, r->text_buf_size, line_count)) {
@@ -2160,6 +2167,11 @@ static bool _prb_read_valid(struct printk_ringbuffer *rb, u64 *seq,
 		}
 	}
 
+	if (rb != prb && r && r->text_buf) {
+		trace_printk("%s: Successfully read record with seq=%llu\n",
+			     __func__, *seq);
+	}
+
 	return true;
 }
 

The test failed with the output:

[   57.654444] KTAP version 1
[   57.654739] 1..1
[   57.655363]     KTAP version 1
[   57.655766]     # Subtest: printk-ringbuffer
[   57.656105]     # module: printk_ringbuffer_kunit_test
[   57.656137]     1..1
[   57.657997]     # test_readerwriter: running for 10000 ms
[   57.659135]     # test_readerwriter: start thread 001 (writer)
[   57.659328]     # test_readerwriter: start thread 002 (writer)
[   57.659626]     # test_readerwriter: start thread 003 (writer)
[   57.659827]     # test_readerwriter: start thread 004 (writer)
[   57.660008]     # test_readerwriter: start thread 005 (writer)
[   57.660209]     # test_readerwriter: start thread 006 (writer)
[   57.661648]     # test_readerwriter: start thread 007 (writer)
[   57.662040]     # test_readerwriter: start thread 008 (writer)
[   57.662283]     # test_readerwriter: start thread 009 (writer)
[   57.662539]     # test_readerwriter: start thread 010 (writer)
[   57.662833]     # test_readerwriter: start thread 011 (writer)
[   57.662841]     # test_readerwriter: starting test
[   57.662945]     # test_readerwriter: start reader
[   57.663072]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1519 size=1145324612 text=
[   57.663253]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1520 size=4473924 text=
[   57.663432]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1524 size=1145324612 text=
[   57.672125]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1526 size=4473924 text=
[   57.672322]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1528 size=1145324612 text=
[   57.675564]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1529 size=1145324612 text=
[   57.675761]     # test_readerwriter: EXPECTATION FAILED at kernel/printk/printk_ringbuffer_kunit_test.c:80
               BAD RECORD: seq=1531 size=1145324612 text=
[   67.904538]     # test_readerwriter: end reader: read=14 seq=1531
[   67.905316]     # test_readerwriter: completed test
[   67.907023]     # test_readerwriter: end thread 011: wrote=0 by attempts=10260578
[   67.907974]     # test_readerwriter: end thread 010: wrote=0 by attempts=10271385
[   67.908929]     # test_readerwriter: end thread 009: wrote=0 by attempts=10281631
[   67.909675]     # test_readerwriter: end thread 008: wrote=0 by attempts=10266089
[   67.910571]     # test_readerwriter: end thread 007: wrote=0 by attempts=10278468
[   67.911294]     # test_readerwriter: end thread 006: wrote=0 by attempts=13344945
[   67.912009]     # test_readerwriter: end thread 005: wrote=160 by attempts=10290490
[   67.912800]     # test_readerwriter: end thread 004: wrote=247 by attempts=13351217
[   67.913499]     # test_readerwriter: end thread 003: wrote=294 by attempts=13337712
[   67.914167]     # test_readerwriter: end thread 002: wrote=281 by attempts=13373189
[   67.914858]     # test_readerwriter: end thread 001: wrote=439 by attempts=13283005
[   67.915980]     # test_readerwriter.speed: slow
[   67.916068]     not ok 1 test_readerwriter
[   67.916930] not ok 1 printk-ringbuffer


with the mathing trace output:

kunit_try_catch-1487    [000] .....    57.792815: _prb_read_valid: prb_read: Calling copy_data() for seq=1518
 kunit_try_catch-1487    [000] .....    57.792818: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (78 bytes)
 kunit_try_catch-1487    [000] .....    57.792819: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1518
 kunit_try_catch-1487    [000] .....    57.792821: _prb_read_valid: prb_read: Calling copy_data() for seq=1519
 kunit_try_catch-1487    [000] .....    57.792822: _prb_read_valid:   copy_data: reading from 0xffffffffc0255520 (17 bytes)
 kunit_try_catch-1487    [000] .....    57.792823: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1519
 kunit_try_catch-1487    [000] .....    57.793011: _prb_read_valid: prb_read: Calling copy_data() for seq=1520
 kunit_try_catch-1487    [000] .....    57.793012: _prb_read_valid:   copy_data: reading from 0xffffffffc0255558 (15 bytes)
 kunit_try_catch-1487    [000] .....    57.793013: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1520
 kunit_try_catch-1487    [000] .....    57.793182: _prb_read_valid: prb_read: Calling copy_data() for seq=1521
 kunit_try_catch-1487    [000] .....    57.793183: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (131 bytes)
 kunit_try_catch-1487    [000] .....    57.793184: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1521
 kunit_try_catch-1487    [000] .....    57.793185: _prb_read_valid: prb_read: Calling copy_data() for seq=1522
 kunit_try_catch-1487    [000] .....    57.793186: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (95 bytes)
 kunit_try_catch-1487    [000] .....    57.793186: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1522
 kunit_try_catch-1487    [000] .....    57.793187: _prb_read_valid: prb_read: Calling copy_data() for seq=1523
 kunit_try_catch-1487    [000] .....    57.793188: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (131 bytes)
 kunit_try_catch-1487    [000] .....    57.793189: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1523
 kunit_try_catch-1487    [000] .....    57.793190: _prb_read_valid: prb_read: Calling copy_data() for seq=1524
 kunit_try_catch-1487    [000] .....    57.793191: _prb_read_valid:   copy_data: reading from 0xffffffffc0255530 (87 bytes)
 kunit_try_catch-1487    [000] .....    57.793192: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1524
 kunit_try_catch-1487    [000] .....    57.801868: _prb_read_valid: prb_read: Calling copy_data() for seq=1525
 kunit_try_catch-1487    [000] .....    57.801869: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (107 bytes)
 kunit_try_catch-1487    [000] .....    57.801870: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1525
 kunit_try_catch-1487    [000] .....    57.801872: _prb_read_valid: prb_read: Calling copy_data() for seq=1526
 kunit_try_catch-1487    [000] .....    57.801873: _prb_read_valid:   copy_data: reading from 0xffffffffc0255558 (60 bytes)
 kunit_try_catch-1487    [000] .....    57.801873: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1526
 kunit_try_catch-1487    [000] .....    57.802067: _prb_read_valid: prb_read: Calling copy_data() for seq=1527
 kunit_try_catch-1487    [000] .....    57.802068: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (51 bytes)
 kunit_try_catch-1487    [000] .....    57.802068: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1527
 kunit_try_catch-1487    [000] .....    57.802070: _prb_read_valid: prb_read: Calling copy_data() for seq=1528
 kunit_try_catch-1487    [000] .....    57.802070: _prb_read_valid:   copy_data: reading from 0xffffffffc0255540 (67 bytes)
 kunit_try_catch-1487    [000] .....    57.802071: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1528
 kunit_try_catch-1487    [000] .....    57.805307: _prb_read_valid: prb_read: Calling copy_data() for seq=1529
 kunit_try_catch-1487    [000] .....    57.805308: _prb_read_valid:   copy_data: reading from 0xffffffffc0255508 (57 bytes)
 kunit_try_catch-1487    [000] .....    57.805309: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1529
 kunit_try_catch-1487    [000] .....    57.805505: _prb_read_valid: prb_read: Calling copy_data() for seq=1530
 kunit_try_catch-1487    [000] .....    57.805506: _prb_read_valid:   copy_data: reading from 0xffffffffc02554c8 (45 bytes)
 kunit_try_catch-1487    [000] .....    57.805507: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1530
 kunit_try_catch-1487    [000] .....    57.805509: _prb_read_valid: prb_read: Calling copy_data() for seq=1531
 kunit_try_catch-1487    [000] .....    57.805509: _prb_read_valid:   copy_data: reading from 0xffffffffc0255550 (109 bytes)
 kunit_try_catch-1487    [000] .....    57.805510: _prb_read_valid: _prb_read_valid: Successfully read record with seq=1531


It seems that the first successfully read record was OK.

BTW: I have added the trace_printk() "Calling copy_data()..."
     because before I saw that the test failed more times than
     _prb_read_valid() returned success. So, I was curious
     whether it at least copied the data for the seq number.


BTW2: It seems the even writers had problems to reserve space this
      time. Only few of them succeeded few times from many attempts.
      I the numbers by this change:

--- a/kernel/printk/printk_ringbuffer_kunit_test.c
+++ b/kernel/printk/printk_ringbuffer_kunit_test.c
@@ -112,6 +112,7 @@ static int prbtest_writer(void *data)
 	struct prbtest_rbdata *dat;
 	u32 record_size, text_size;
 	unsigned long count = 0;
+	unsigned long success_count = 0;
 	struct printk_record r;
 
 	kunit_info(tr->test_data->test, "start thread %03lu (writer)\n", tr->num);
@@ -154,6 +155,7 @@ static int prbtest_writer(void *data)
 			prb_commit(&e);
 
 			wake_up_interruptible(&tr->test_data->new_record_wait);
+			success_count++;
 		}
 
 		if ((count++ & 0x3fff) == 0)
@@ -163,7 +165,9 @@ static int prbtest_writer(void *data)
 			break;
 	}
 
-	kunit_info(tr->test_data->test, "end thread %03lu: wrote=%lu\n", tr->num, count);
+	kunit_info(tr->test_data->test,
+		   "end thread %03lu: wrote=%lu by attempts=%lu\n",
+		   tr->num, success_count, count);
 
 	return 0;
 }


Best Regards,
Petr

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ