[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <alpine.DEB.1.10.0810291439460.13214@gandalf.stny.rr.com>
Date: Wed, 29 Oct 2008 18:48:00 -0400 (EDT)
From: Steven Rostedt <rostedt@...dmis.org>
To: LKML <linux-kernel@...r.kernel.org>
cc: Ingo Molnar <mingo@...e.hu>, Thomas Gleixner <tglx@...utronix.de>,
Peter Zijlstra <peterz@...radead.org>,
Linus Torvalds <torvalds@...ux-foundation.org>,
Andrew Morton <akpm@...ux-foundation.org>
Subject: [PATCH] ring-buffer: add paranoid checks for loops
[ for 2.6.28 ]
While writing a new tracer, I had a bug where I caused the ring-buffer
to recurse in a bad way. The bug was with the tracer I was writing
and not the ring-buffer itself. But it took a long time to find the
problem.
This patch adds paranoid checks into the ring-buffer infrastructure
that will catch bugs of this nature.
Note: I put the bug back in the tracer and this patch showed the error
nicely and prevented the lockup.
Signed-off-by: Steven Rostedt <srostedt@...hat.com>
---
kernel/trace/ring_buffer.c | 45 +++++++++++++++++++++++++++++++++++++++++++++
1 file changed, 45 insertions(+)
Index: linux-tip.git/kernel/trace/ring_buffer.c
===================================================================
--- linux-tip.git.orig/kernel/trace/ring_buffer.c 2008-10-29 12:38:54.000000000 -0400
+++ linux-tip.git/kernel/trace/ring_buffer.c 2008-10-29 16:11:00.000000000 -0400
@@ -1022,8 +1022,20 @@ rb_reserve_next_event(struct ring_buffer
struct ring_buffer_event *event;
u64 ts, delta;
int commit = 0;
+ int paranoid = 0;
again:
+ /*
+ * If we loop here 1,000 times, that means we are either
+ * in an interrupt storm, or we have something buggy.
+ * Bail!
+ */
+ if (unlikely(paranoid > 1000)) {
+ RB_WARN_ON(cpu_buffer, 1);
+ return NULL;
+ }
+ paranoid++;
+
ts = ring_buffer_time_stamp(cpu_buffer->cpu);
/*
@@ -1532,10 +1544,21 @@ rb_get_reader_page(struct ring_buffer_pe
{
struct buffer_page *reader = NULL;
unsigned long flags;
+ int paranoid = 0;
spin_lock_irqsave(&cpu_buffer->lock, flags);
again:
+ /*
+ * We can call here a couple of times, lets only allow 5.
+ */
+ if (unlikely(paranoid > 4)) {
+ RB_WARN_ON(cpu_buffer, 1);
+ reader = NULL;
+ goto out;
+ }
+ paranoid++;
+
reader = cpu_buffer->reader_page;
/* If there's more to read, return this page */
@@ -1665,6 +1688,7 @@ ring_buffer_peek(struct ring_buffer *buf
struct ring_buffer_per_cpu *cpu_buffer;
struct ring_buffer_event *event;
struct buffer_page *reader;
+ int paranoid = 0;
if (!cpu_isset(cpu, buffer->cpumask))
return NULL;
@@ -1672,6 +1696,16 @@ ring_buffer_peek(struct ring_buffer *buf
cpu_buffer = buffer->buffers[cpu];
again:
+ /*
+ * This could happen a few times, but if more than
+ * 10 times, then something is probably wrong.
+ */
+ if (unlikely(paranoid > 10)) {
+ RB_WARN_ON(cpu_buffer, 1);
+ return NULL;
+ }
+ paranoid++;
+
reader = rb_get_reader_page(cpu_buffer);
if (!reader)
return NULL;
@@ -1722,6 +1756,7 @@ ring_buffer_iter_peek(struct ring_buffer
struct ring_buffer *buffer;
struct ring_buffer_per_cpu *cpu_buffer;
struct ring_buffer_event *event;
+ int paranoid = 0;
if (ring_buffer_iter_empty(iter))
return NULL;
@@ -1730,6 +1765,16 @@ ring_buffer_iter_peek(struct ring_buffer
buffer = cpu_buffer->buffer;
again:
+ /*
+ * This could happen a few times, but if more than
+ * 10 times, then something is probably wrong.
+ */
+ if (unlikely(paranoid > 10)) {
+ RB_WARN_ON(cpu_buffer, 1);
+ return NULL;
+ }
+ paranoid++;
+
if (rb_per_cpu_empty(cpu_buffer))
return NULL;
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists