[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20101021151613.GB2363@linux.vnet.ibm.com>
Date: Thu, 21 Oct 2010 08:16:13 -0700
From: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
To: Eric Dumazet <eric.dumazet@...il.com>
Cc: Hans Schillstrom <hans.schillstrom@...csson.com>,
Daniel Lezcano <daniel.lezcano@...e.fr>,
"lvs-devel@...r.kernel.org" <lvs-devel@...r.kernel.org>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>,
"netfilter-devel@...r.kernel.org" <netfilter-devel@...r.kernel.org>,
"horms@...ge.net.au" <horms@...ge.net.au>, "ja@....bg" <ja@....bg>,
"wensong@...ux-vs.org" <wensong@...ux-vs.org>
Subject: Re: [RFC PATCH 1/9] ipvs network name space aware
On Thu, Oct 21, 2010 at 10:58:24AM +0200, Eric Dumazet wrote:
>
> > You said that there were a lot of "stepi" commands to get through
> > rcu_read_lock() on x86_64. This is quite surprising, especially if you
> > built with CONFIG_RCU_TREE. Even if you built with CONFIG_PREEMPT_RCU_TREE,
> > you should only see something like the following from rcu_read_lock():
> >
> > 000000b7 <__rcu_read_lock>:
> > b7: 55 push %ebp
> > b8: 64 a1 00 00 00 00 mov %fs:0x0,%eax
> > be: ff 80 80 01 00 00 incl 0x180(%eax)
> > c4: 89 e5 mov %esp,%ebp
> > c6: 5d pop %ebp
> > c7: c3 ret
> >
> > Unless you have some sort of debugging options turned on. Or unless
> > six instructions counts for "quite many" stepi commands. ;-)
>
> Paul, this should be inlined, dont you think ?
Indeed it should!!! It is out-of-line due to header-file issues.
Lai Jiangshan proposed a change to kbuild to allow it to be inlined as
part of his ring-RCU patch, and I have asked him to submit a version
of that for Tree and Tiny preemptible RCU. This is the usual trick of
having the build system compile the data structure and emit offsets,
which are then used in the main kernel build. (Yes, I did something
similar in DYNIX/ptx, but never managed to work up the courage to attempt
the equivalent in Linux's kbuild, so props to Lai!)
> Also, I dont understand why we use ACCESS_ONCE() in rcu_read_lock()
>
> ACCESS_ONCE(current->rcu_read_lock_nesting)++;
>
> Apparently, some compilers are a bit noisy here.
>
> mov 0x1b0(%rdx),%eax
> inc %eax
> mov %eax,0x1b0(%rdx)
>
> instead of :
>
> incl 0x1b0(%rax)
>
> So if the ACCESS_ONCE() is needed, we might add a comment, because it's
> not obvious ;)
Here is what it looks like in my -rcu tree:
void __rcu_read_lock(void)
{
current->rcu_read_lock_nesting++;
barrier(); /* needed if we ever invoke rcu_read_lock in rcutree.c */
}
So yes, I finally did convince myself that the ACCESS_ONCE was not
needed. ;-)
This is not yet in mainline, but Ingo sent the series containing this
commit (80dcf60e) to Linus earlier today, so there is hope!
Thanx, Paul
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists