[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20100506.220443.135536330.davem@davemloft.net>
Date: Thu, 06 May 2010 22:04:43 -0700 (PDT)
From: David Miller <davem@...emloft.net>
To: eric.dumazet@...il.com
Cc: bhaskie@...il.com, shemminger@...tta.com,
bhutchings@...arflare.com, netdev@...r.kernel.org
Subject: Re: TCP-MD5 checksum failure on x86_64 SMP
From: Eric Dumazet <eric.dumazet@...il.com>
Date: Thu, 06 May 2010 14:06:26 +0200
> Le jeudi 06 mai 2010 à 17:25 +0530, Bhaskar Dutta a écrit :
>
>> I put in the above change and ran some load tests with around 50
>> active TCP connections doing MD5.
>> I could see only 1 bad packet in 30 min (earlier the problem used to
>> occur instantaneously and repeatedly).
>>
>> I think there is another possibility of being preempted when calling
>> tcp_alloc_md5sig_pool()
>> this function releases the spinlock when calling __tcp_alloc_md5sig_pool().
>>
>> I will run some more tests after changing the tcp_alloc_md5sig_pool
>> and see if the problem is completely resolved.
>>
>
> This code should be completely rewritten for linux-2.6.35, its very ugly
> and over complex, yet it is not scalable.
>
> It could use true percpu data, with no central lock or refcount.
Yes I've always disliked the way the TCP MD5 pool stuff is coded, it's
frankly crap and this is like the 5th major bug that had to get fixed
in it. :-)
But let's fix this as simply as possible in net-2.6 and -stable, so Eric
let me know when you have a tested patch for me to apply.
Thanks a lot!
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists