[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20141107.170053.1003349690694025765.davem@redhat.com>
Date: Fri, 07 Nov 2014 17:00:53 -0500 (EST)
From: David Miller <davem@...hat.com>
To: eric.dumazet@...il.com
Cc: netdev@...r.kernel.org, amirv@...lanox.com, ogerlitz@...lanox.com,
willemb@...gle.com
Subject: Re: [PATCH v2 net-next 2/2] mlx4: use napi_complete_done()
From: Eric Dumazet <eric.dumazet@...il.com>
Date: Thu, 06 Nov 2014 21:10:11 -0800
> From: Eric Dumazet <edumazet@...gle.com>
>
> To enable gro_flush_timeout, a driver has to use napi_complete_done()
> instead of napi_complete().
>
> Tested:
> Ran 200 netperf TCP_STREAM from A to B (10Gbe mlx4 link, 8 RX queues)
>
> Without this feature, we send back about 305,000 ACK per second.
>
> GRO aggregation ratio is low (811/305 = 2.65 segments per GRO packet)
>
> Setting a timer of 2000 nsec is enough to increase GRO packet sizes
> and reduce number of ACK packets. (811/19.2 = 42)
>
> Receiver performs less calls to upper stacks, less wakes up.
> This also reduces cpu usage on the sender, as it receives less ACK
> packets.
>
> Note that reducing number of wakes up increases cpu efficiency, but can
> decrease QPS, as applications wont have the chance to warmup cpu caches
> doing a partial read of RPC requests/answers if they fit in one skb.
>
> B:~# sar -n DEV 1 10 | grep eth0 | tail -1
> Average: eth0 811269.80 305732.30 1199462.57 19705.72 0.00
> 0.00 0.50
>
> B:~# echo 2000 >/sys/class/net/eth0/gro_flush_timeout
>
> B:~# sar -n DEV 1 10 | grep eth0 | tail -1
> Average: eth0 811577.30 19230.80 1199916.51 1239.80 0.00
> 0.00 0.50
>
> Signed-off-by: Eric Dumazet <edumazet@...gle.com>
Applied.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists