[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4AD531E5.6070103@cn.fujitsu.com>
Date: Wed, 14 Oct 2009 10:05:25 +0800
From: Lai Jiangshan <laijs@...fujitsu.com>
To: paulmck@...ux.vnet.ibm.com
CC: linux-kernel@...r.kernel.org, mingo@...e.hu, dipankar@...ibm.com,
akpm@...ux-foundation.org, mathieu.desnoyers@...ymtl.ca,
josh@...htriplett.org, dvhltc@...ibm.com, niv@...ibm.com,
tglx@...utronix.de, peterz@...radead.org, rostedt@...dmis.org,
Valdis.Kletnieks@...edu, dhowells@...hat.com, avi@...hat.com,
mtosatti@...hat.com, torvalds@...ux-foundation.org
Subject: Re: [PATCH RFC tip/core/rcu 1/3] rcu: The Bloatwatch Edition, v7
Paul E. McKenney wrote:
> On Wed, Oct 14, 2009 at 08:37:18AM +0800, Lai Jiangshan wrote:
>> Paul E. McKenney wrote:
>>>> It's an old issue.
>>>> It's not only about RCUTINY, it's also about other rcu implementations:
>>>>
>>>> rcu_enter_nohz()/rcu_exit_nohz() are not called in pairs.
>>>>
>>>> irq_exit() calls tick_nohz_stop_sched_tick() which calls rcu_enter_nohz(),
>>>> where is the corresponding rcu_exit_nohz()?
>>>> (or tick_nohz_restart_sched_tick())?
>>> The tick_nohz_restart_sched_tick() function is called from the various
>>> per-architecture cpu_idle() functions (or default_idle() or whatever
>>> name that the architecture uses). For example, in:
>>>
>>> arch/x86/kernel/process_64.c
>>>
>>> the cpu_idle() function invokes tick_nohz_restart_sched_tick() just
>>> before invoking schedule() to exit the idle loop.
>>>
>>> And, as you say, tick_nohz_restart_sched_tick() invokes rcu_exit_nohz().
>> These tick_nohz_restart_sched_tick() which are called from the various
>> per-architecture cpu_idle() functions are not the opposite of
>> the tick_nohz_stop_sched_tick() in *irq_exit()*. So I figure that
>> rcu_enter_nohz()/rcu_exit_nohz() are not called in pairs.
>
> OK, let's start with rcu_enter_nohz(), which tells RCU that the running
> CPU is going into dyntick-idle mode, and thus should be ignored by RCU.
> Let's do the idle loop first:
>
> o Upon entry to the idle() loop (using cpu_idle() in
> arch/x86/kernel/process_64.c for this exercise),
> we invoke tick_nohz_stop_sched_tick(1), which says we
> are in an idle loop. (This is in contrast to the call
> from irq_exit(), where we are not in the idle loop.)
>
> o tick_nohz_stop_sched_tick() invokes rcu_enter_nohz(),
> does a bunch of timer checking, and returns. If anything
> indicated that entering dyntick-idle mode would be bad,
> we raise TIMER_SOFTIRQ to kick us out of this mode.
>
> Either way, we return to the idle loop.
>
> o The idle loops until need_resched(). Upon exit from the
> idle loop, we call tick_nohz_restart_sched_tick(), which
> invokes rcu_exit_nohz(), which tells RCU to start paying
> attention to this CPU once more.
>
> OK, now for interrupts.
>
> o The hardware interrupt handlers invoke irq_enter(), which in
> turn invokes rcu_irq_enter(). This has no real effect (other
> than incrementing a counter) if the interrupt did not come
> from dyntick-idle mode.
>
> Either way, RCU is now paying attention to RCU read-side
> critical sections on this CPU.
>
> o Upon return from interrupt, the hardware interrupt handlers
> invoke irq_exit(), which in turn invokes rcu_irq_exit().
> This has no real effect (other than decrementing a counter)
> if the interrupt is not returning to dyntick-idle mode.
>
> However, if the interrupt -is- returning to dyntick-idle
> mode, then RCU will stop paying attention to RCU read-side
> critical sections on this CPU.
You haven't explain the tick_nohz_stop_sched_tick() in *irq_exit()*.
(tick_nohz_stop_sched_tick() calls rcu_enter_nohz())
void irq_exit(void)
{
....
rcu_irq_exit(); /* This is OK, the opposite is in irq_enter() */
if (idle_cpu(smp_processor_id()) && !in_interrupt() && !need_resched())
tick_nohz_stop_sched_tick(0); /* where is the opposite ??? */
....
}
This means if the interrupt -is- returning to dyntick-idle mode,
rcu_enter_nohz() is called again.
Take this flow as example:
cpu_idle():
while(1) {
tick_nohz_stop_sched_tick()
rcu_enter_nohz() *****
------->interrupt happen
irq_enter()
irq_exit()
tick_nohz_stop_sched_tick()
rcu_enter_nohz() *****
<-------interrupt returns
tick_nohz_restart_sched_tick()
rcu_exit_nohz() *****
} /* while(1) */
You can see that rcu_enter_nohz() is called twice and
rcu_exit_nohz() is only called once in this flow.
It's because tick_nohz_stop_sched_tick()/tick_nohz_restart_sched_tick()
are not called in pairs, so rcu_enter_nohz() and rcu_exit_nohz()
are not called in pairs either.
Lai
>
> So I do believe that rcu_enter_nohz() and rcu_exit_nohz() are in fact
> invoked in pairs. One strange thing about this is that the idle loop
> first invokes rcu_enter_nohz(), then invokes rcu_exit_nohz(), while
> an interrupt handler first invokes rcu_irq_enter() and then invokes
> rcu_irq_exit(). So the idle loop enters dyntick-idle mode and then
> leaves it, while an interrupt handler might leave dyntick-idle mode and
> then re-enter it.
>
> Or am I still missing something here?
>
> Thanx, Paul
>
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists