[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5542B9A0.70605@gmail.com>
Date: Thu, 30 Apr 2015 16:24:16 -0700
From: Alexander Duyck <alexander.duyck@...il.com>
To: Eric Dumazet <eric.dumazet@...il.com>,
Alexander Duyck <alexander.h.duyck@...hat.com>
CC: netdev@...r.kernel.org, davem@...emloft.net
Subject: Re: [PATCH 1/3] etherdev: Avoid unnecessary byte swap in check for
Ethertype
On 04/30/2015 04:03 PM, Eric Dumazet wrote:
> On Thu, 2015-04-30 at 14:53 -0700, Alexander Duyck wrote:
>> This change takes advantage of the fact that ETH_P_802_3_MIN is aligned to
>> 512 so as a result we can actually ignore the lower 8b when comparing the
>> Ethertype to ETH_P_802_3_MIN. This allows us to avoid a byte swap by simply
>> masking the value and comparing it to the byte swapped value for
>> ETH_P_802_3_MIN.
>>
>> Signed-off-by: Alexander Duyck <alexander.h.duyck@...hat.com>
>> ---
>> net/ethernet/eth.c | 2 +-
>> 1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> diff --git a/net/ethernet/eth.c b/net/ethernet/eth.c
>> index f3bad41d725f..60069318d5d1 100644
>> --- a/net/ethernet/eth.c
>> +++ b/net/ethernet/eth.c
>> @@ -178,7 +178,7 @@ __be16 eth_type_trans(struct sk_buff *skb, struct net_device *dev)
>> if (unlikely(netdev_uses_dsa(dev)))
>> return htons(ETH_P_XDSA);
>>
>> - if (likely(ntohs(eth->h_proto) >= ETH_P_802_3_MIN))
>> + if (likely((eth->h_proto & htons(0xFF00)) >= htons(ETH_P_802_3_MIN)))
>> return eth->h_proto;
> Then, a byte operation on x86 is shorter/faster than u16 one.
>
> You also could use
>
> if (likely(*(u8 *)ð->h_proto >= (ETH_P_802_3_MIN>>8)))
> return eth->h_proto;
>
> I would at least leave a comment here to explain the logic.
Actually a byte operation itself is not faster. Note in the next line
we are returning the value. So what you typically end up with by doing
it that way would be 2 reads, one for the u8 and one for the u16 return
value. That is actually what I am trying to address in the second patch
in the set since we were doing a 8b test on the first byte of the
address followed by a 64b read.
The advantage with the way I wrote this is that the compiler itself
should be able to sort out how it wants to test the value while
accessing it in a 16b size. So at worst case it is a mask and compare,
followed by a return of the value. From what I have seen the compiler
seems to be smart enough on x86 anyway to just convert this into a one
byte compare on AL and then return the result in AX. I would suspect
that for bit-endian systems it would likely just perform the compare.
- Alex
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists