[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <1190551422.4256.36.camel@localhost>
Date: Sun, 23 Sep 2007 08:43:41 -0400
From: jamal <hadi@...erus.ca>
To: john ye <johny@...mco.com.cn>
Cc: David Miller <davem@...emloft.net>, netdev@...r.kernel.org,
kuznet@....inr.ac.ru, pekkas@...core.fi, jmorris@...ei.org,
kaber@...eworks.de
Subject: Re: [PATCH: 2.6.13-15-SMP 3/3] network: concurrently run
softirqnetwork code on SMP
On Sun, 2007-23-09 at 12:45 +0800, john ye wrote:
> I do randomly select a CPU to dispatch the skb to. Previously, I
> dispatch
> skb evenly to all CPUs( round robin, one by one). but I didn't find a
> quick
> coding. for_each_online_cpu is not quick enough.
for_each_online_cpu doenst look that expensive - but even round robin
wont fix the reordering problem. What you need to do is make sure that a
flow always goes to the same cpu over some period of time.
> According to my test result, it did make packet INPUT speed doubled
> because
> another CPU is used concurrently.
How did you measure "speed" - was it throughput? Did you measure how
much cpu was being utilized?
> It seems the packets still keep "roughly ordering" after turning on
> BS patch.
Linux TCP is very resilient to reordering compared to other OSes, but
even then if you hit it with enough packets it is going to start
sweating it.
> The test is simple: use an 2400 lines of iptables -t filter -A INPUT
> -p
> tcp -s x.x.x.x --dport yy -j XXXX.
> these rules make the current softirq be very busy on one CPU and make
> the
> incoming net very slow. after turning on BS, the speed doubled.
>
Ok, but how do you observe "doubled"?
Do you have conntrack on? It maybe that what you have just found is
netfilter needs to have its work defered from packet rcv.
You need some real fast traffic generator; possibly one that can do
thousands of tcp sessions.
> For NAT test, I didn't get a good result like INPUT because real
> environment limitation.
> The test is very basic and is far from "full".
What happens when you totally compile out netfilter and you just use
this machine as a server?
> It seems to me that the cross-cpu spinlock_xxxx for the queue doesn't
> have
> big cost and is allowable in terms of CPU time consumption, compared
> with
> the gains by making other CPUs joint in the work.
>
> I have made BS patch into a loadable module.
> http://linux.chinaunix.net/bbs/thread-909725-2-1.html and let others
> help with testing.
It is still very hard to read; and i am not sure how you are going to
get the performance you claim eventually - you are registering as a tap
for ip packets, which means you will process two of each incoming
packets.
cheers,
jamal
-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists