[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1354075918.14302.77.camel@edumazet-glaptop>
Date: Tue, 27 Nov 2012 20:11:58 -0800
From: Eric Dumazet <eric.dumazet@...il.com>
To: Joe Perches <joe@...ches.com>
Cc: David Miller <davem@...emloft.net>,
netdev <netdev@...r.kernel.org>,
Ling Ma <ling.ma.program@...il.com>
Subject: Re: [PATCH net-next] net: move inet_dport/inet_num in sock_common
On Tue, 2012-11-27 at 18:23 -0800, Joe Perches wrote:
> Still, the logical tests that are likely to be in the same
> cacheline could be ANDed together to avoid a test and jump.
The point of having the cond jump on sk_hash/hash was that in one
compare, we catch the yes/no status with 99.999999 % success rate.
All the following compares are predicted by the cpu and essentially are
free. Adding the AND or OR will basically have the same cpu cost.
If we wanted to do a full test of all tuple fields and a single
conditional jump, we would not have to include hash test at all.
(If the 4-tuple matches, then sk_hash/hash value _must_ be the same by
definition)
Note its quite different from the optimization we did in
ipv6_addr_equal(), as it allowed fewer memory loads and instructions.
I would say this can come later, as the meat of my patch was about
avoiding a full cache line miss, which is far more expensive than any
tricks we can even think about.
Note it will be hard to actually measure any further gains, since I did
TCP_RR tests (200 threads) and the lookup cost went from 1.4 % to 0.8 %
of the grand total, mostly dominated by the atomic to increase socket
refcount.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists