[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <1240566136.6554.220.camel@blade.ines.ro>
Date: Fri, 24 Apr 2009 12:42:16 +0300
From: Radu Rendec <radu.rendec@...s.ro>
To: Jesper Dangaard Brouer <hawk@...u.dk>
Cc: Jarek Poplawski <jarkao2@...il.com>,
Denys Fedoryschenko <denys@...p.net.lb>, netdev@...r.kernel.org
Subject: Re: htb parallelism on multi-core platforms
On Thu, 2009-04-23 at 22:19 +0200, Jesper Dangaard Brouer wrote:
> >> It also proves that most of the packet processing work is actually in
> >> htb.
>
> I'm not sure that statement is true.
> Can you run Oprofile on the system? That will tell us exactly where time
> is spend...
I've never used oprofile, but it looks very powerful and simple to use.
I'll compile a 2.6.29 (so that I also benefit from the htb patch you
told me about) then put oprofile on top of it. I'll get back to you by
evening (or maybe Monday noon) with real facts :)
> > ...
> > I thought about using some trick with virtual devs instead, but maybe
> > I'm totally wrong.
>
> I like the idea with virtual devices, as each virtual device could be
> bound to a hardware tx-queue.
Is there any current support for this or do you talk about it as an
approach to use in future development?
The idea looks interesting indeed. If there's current support for it,
I'd like to try it out. If not, perhaps I can help at least with testing
(or even some coding as well).
> Then you just have to construct your HTB trees on each virtual
> device, and assign customers accordingly.
I don't think it's that easy. Let's say we have the same HTB trees on
both virtual devices A and B (each of them is bound to a different
hardware tx queue). If packets for a specific destination ip address
(pseudo)randomly arrive at both A and B, tokens will be extracted from
both A and B trees, resulting in an erroneus overall bandwidth (at worst
double the ceil, if packets reach the ceil on both A and B).
I have to make sure packets belonging to a certain customer (or ip
address) always come through a specific virtual device. Then HTB trees
don't even need to be identical.
However, this is not trivial at all. A single customer can have
different subnets (even from different class-B networks) but share a
single HTB bucket for all of them. Using a simple hash function on the
ip address to determine which virtual device to send through doesn't
seem to be an option since it does not guarantee all packets for a
certain customer will go together.
What I had in mind for parallel shaping was this:
NIC0 -> mux -----> Thread 0: classify/shape -----> NIC2
\/ \/
/\ /\
NIC1 -> mux -----> Thread 1: classify/shape -----> NIC3
Of course the number of input NICs, processing threads and output NICs
would be adjustable. But this idea has 2 major problems:
* shaping data must be shared between processing threads (in order to
extract tokens from the same bucket regardless of the thread that does
the actual processesing)
* it seems to be impossible to do this with (unmodified) HTB
> I just realized, you don't use a multi-queue capably NIC right?
> Then it would be difficult to use the hardware tx-queue idea.
> Have you though of using several physical NICs?
The machine we are preparing for production has this:
2 x Intel Corporation 82571EB Gigabit Ethernet Controller
2 x Intel Corporation 80003ES2LAN Gigabit Ethernet Controller
All 4 NICs use the e1000e driver and I think they are multi-queue
capable. So in theory I can use several NICs and/or multi-queue.
Thanks,
Radu Rendec
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists