[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20120906222239.GQ2448@linux.vnet.ibm.com>
Date: Thu, 6 Sep 2012 15:22:39 -0700
From: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
To: Steven Rostedt <rostedt@...dmis.org>
Cc: Peter Zijlstra <peterz@...radead.org>,
linux-kernel@...r.kernel.org, mingo@...e.hu, laijs@...fujitsu.com,
dipankar@...ibm.com, akpm@...ux-foundation.org,
mathieu.desnoyers@...ymtl.ca, josh@...htriplett.org,
niv@...ibm.com, tglx@...utronix.de, Valdis.Kletnieks@...edu,
dhowells@...hat.com, eric.dumazet@...il.com, darren@...art.com,
fweisbec@...il.com, sbw@....edu, patches@...aro.org,
"Paul E. McKenney" <paul.mckenney@...aro.org>
Subject: Re: [PATCH tip/core/rcu 11/15] rcu: Avoid spurious RCU CPU stall
warnings
On Thu, Sep 06, 2012 at 06:05:53PM -0400, Steven Rostedt wrote:
> On Thu, 2012-09-06 at 14:58 -0700, Paul E. McKenney wrote:
>
> > > All these cases deserve a warning.
> >
> > Agreed, and that is the whole purpose of the stall warnings.
>
> Then let me ask the question again. According to the change log:
>
> > If a given CPU avoids the idle loop but also avoids starting a new
> > RCU grace period for a full minute, RCU can issue spurious RCU CPU
> > stall warnings. This commit fixes this issue by adding a check for
> > ongoing grace period to avoid these spurious stall warnings.
>
> I'm still confused by what is "this issue"? And why is it being fixed.
> It sounds to me that the "issue" was a CPU avoiding starting a new RCU
> grace period for a full minute. Which to me sounds like a bug in which
> we *want* a warning. Why is this patch needed?
Ah!
It is perfectly legal to avoid -starting- an RCU grace period for a
minute, or even longer. If RCU has nothing to do, in other words, if no
one registers any RCU callbacks, then RCU need not start a grace period.
Of course, this would mean that it would eventually be a full minute
since the last start of a grace period. This is not a problem, after
all, Linux went through a full ten years before experiencing its first
grace period.
But the stall-warning code just checked how long it had been since
the last start of a grace period, failing to note that this grace
period had long since completed. So it splatted out a warning.
This warning was spurious in the sense that there was no bug aside
from the missing check that the grace period was still in progress.
And this commit fixes that bug in RCU.
Thanx, Paul
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists