[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <48EB5D28.7000503@cosmosbay.com>
Date: Tue, 07 Oct 2008 14:59:20 +0200
From: Eric Dumazet <dada1@...mosbay.com>
To: Benny Amorsen <benny+usenet@...rsen.dk>
Cc: David Miller <davem@...emloft.net>, minyard@....org,
linux-kernel@...r.kernel.org, netdev@...r.kernel.org,
shemminger@...tta.com, paulmck@...ux.vnet.ibm.com
Subject: Re: [PATCH 3/3] Convert the UDP hash lock to RCU
Benny Amorsen a écrit :
> Eric Dumazet <dada1@...mosbay.com> writes:
>
>> Most UDP sockets are setup for long periods (RTP trafic), or if an application really
>> wants to {open/send or receive one UDP frame/close} many sockets, it already hits
>> RCU handling of its file structures and should not be slowed down that much.
>>
I should have say 'Many' instead of 'Most' :)
>> By 'long period' I mean thousand of packets sent/received by each RTP session, being
>> voice (50 packets/second) or even worse video...
>
> Does DNS with port randomization need short lived sockets?
>
Yes very true, but current allocation of a random port can be very expensive,
since we scan all the UDP hash table to select the smaller hash chain.
We stop the scan if we find an empty slot, but on machines with say more than 200
bound UDP sockets, they are probably no empty slots. (UDP_HTABLE_SIZE is 128)
bind(NULL port) algo is then O(N), N being number of bound UDP sockets.
So heavy DNS servers/proxies probably use a pool/range of pre-allocated sockets
to avoid costs of allocating/freeing them ? If they dont care about that cost,
the extra call_rcu() will be unnoticed.
For pathological (yet very common :) ) cases like single DNS query/answer, RCU
would mean :
Pros :
- one few rwlock hit when receiving the answer (if any)
Cons :
- one call_rcu() to delay socket freeing/reuse after RCU period.
So it might be a litle bit more expensive than without RCU
I agree I am more interested in optimizing UDP stack for heavy users like RTP
servers/proxies handling xxx.000 packets/second than DNS users/servers.
Shame on me :)
(2 weeks ago, Corey mentioned a 10x increase on UDP throughput on a 16-way machine,
that sounds promising)
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists