[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20080822.034217.135152551.davem@davemloft.net>
Date: Fri, 22 Aug 2008 03:42:17 -0700 (PDT)
From: David Miller <davem@...emloft.net>
To: herbert@...dor.apana.org.au
Cc: hadi@...erus.ca, kaber@...sh.net, netdev@...r.kernel.org,
johannes@...solutions.net, linux-wireless@...r.kernel.org
Subject: Re: [PATCH 20/31]: pkt_sched: Perform bulk of qdisc destruction in
RCU.
From: Herbert Xu <herbert@...dor.apana.org.au>
Date: Fri, 22 Aug 2008 17:41:15 +1000
> On Fri, Aug 22, 2008 at 12:16:20AM -0700, David Miller wrote:
> >
> > I suppose you're talking about the grand unified flow cache
> > that never gets implemented right? I classify that in the
> > same category as net channels at the moment, theoretically
> > very interesting but no practical implementation in sight.
>
> In any case, we could implement what I suggested even without
> a flow cache, by simply storing the info in the dst. This means
> that all flows on the same dst will end up in the same queue, which
> is not as good as what a flow cache could give, but it's useful
> enough to be an option.
I had another idea in the back of my head that I wanted to mention.
The SKB has a hash, and sources set the hash. It doesn't matter
where in the stack it comes from.
At transmit time, the select queue logic in dev_queue_xmit() will use
the SKB hash if one has been set already. Otherwise it will do the
hashing it does currently.
So in the simplest case for forwarding, the RX side puts the RSS
hash or whatever into this SKB hash location.
Then, taking things to the next level, protocols set hashes for
locally created packets. And this leads to being able to delete
the simple_tx_hash() code entirely and also we won't have to add
support for every protocol on the planet to that function. :)
TCP sockets maybe could even do something clever, like, using the
incoming SKB hash of the SYN and SYN+ACK packets for all subsequent
SKBs sent by that socket.
Another nice side effect, we have the choice of allowing IPSEC
encapsulation of locally generated frames to not change the TX queue.
This would have to be controlled by a sysctl or similar because I
can see both ways being useful in different circumstances.
You get the idea.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists