[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4EEB9D64.5080901@hp.com>
Date: Fri, 16 Dec 2011 11:35:00 -0800
From: Rick Jones <rick.jones2@...com>
To: Eric Dumazet <eric.dumazet@...il.com>
CC: Stephen Hemminger <shemminger@...tta.com>,
Vijay Subramanian <subramanian.vijay@...il.com>,
tcpdump-workers@...ts.tcpdump.org, netdev@...r.kernel.org,
Matthew Vick <matthew.vick@...el.com>,
Jeff Kirsher <jeffrey.t.kirsher@...el.com>
Subject: Re: twice past the taps, thence out to net?
>> but is that getting a little close?
>>
>> rick jones
>
> Sure !
>
> I only pointed out a possible problem, and not gave a full patch, since
> we also need to change the opposite threshold (when we XON the queue at
> TX completion)
>
> You can see its not even consistent with the minimum for a single TSO
> frame ! Most probably your high requeue numbers come from this too low
> value given the real requirements of the hardware (4 + nr_frags
> descriptors per skb)
>
> /* How many Tx Descriptors do we need to call netif_wake_queue ? */
> #define IGB_TX_QUEUE_WAKE 16
>
>
> Maybe we should CC Intel guys
>
> Could you try following patch ?
I would *love* to. All my accessible igb-driven hardware is in an
environment locked to the kernels already there :( Not that it makes it
more possible for me to do it, but I suspect it does not require 30
receivers to reproduce the dups with netperf TCP_STREAM. Particularly
if the tx queue len is at 256 it may only take 6 or 8. In fact let me
try that now...
Yep, with just 8 destinations/concurrent TCP_STREAM tests from the one
system one can still see the duplicates in the packet trace taken on the
sender.
Perhaps we can trouble the Intel guys to try to reproduce what I've seen?
rick
>
> Thanks !
>
> diff --git a/drivers/net/ethernet/intel/igb/igb.h b/drivers/net/ethernet/intel/igb/igb.h
> index c69feeb..93ce118 100644
> --- a/drivers/net/ethernet/intel/igb/igb.h
> +++ b/drivers/net/ethernet/intel/igb/igb.h
> @@ -51,8 +51,8 @@ struct igb_adapter;
> /* TX/RX descriptor defines */
> #define IGB_DEFAULT_TXD 256
> #define IGB_DEFAULT_TX_WORK 128
> -#define IGB_MIN_TXD 80
> -#define IGB_MAX_TXD 4096
> +#define IGB_MIN_TXD max_t(unsigned, 80U, IGB_TX_QUEUE_WAKE * 2)
> +#define IGB_MAX_TXD 4096
>
> #define IGB_DEFAULT_RXD 256
> #define IGB_MIN_RXD 80
> @@ -121,8 +121,11 @@ struct vf_data_storage {
> #define IGB_RXBUFFER_16384 16384
> #define IGB_RX_HDR_LEN IGB_RXBUFFER_512
>
> -/* How many Tx Descriptors do we need to call netif_wake_queue ? */
> -#define IGB_TX_QUEUE_WAKE 16
> +/* How many Tx Descriptors should be available
> + * before calling netif_wake_subqueue() ?
> + */
> +#define IGB_TX_QUEUE_WAKE (MAX_SKB_FRAGS * 4)
> +
> /* How many Rx Buffers do we bundle into one write to the hardware ? */
> #define IGB_RX_BUFFER_WRITE 16 /* Must be power of 2 */
>
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to majordomo@...r.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists