[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20200406144758.GC301483@lunn.ch>
Date: Mon, 6 Apr 2020 16:47:58 +0200
From: Andrew Lunn <andrew@...n.ch>
To: Alexander Lobakin <79537434260@...dex.com>
Cc: "David S. Miller" <davem@...emloft.net>,
Jakub Kicinski <kuba@...nel.org>,
Woojung Huh <woojung.huh@...rochip.com>,
Florian Fainelli <f.fainelli@...il.com>,
Hauke Mehrtens <hauke@...ke-m.de>,
Linus Walleij <linus.walleij@...aro.org>,
Sean Wang <sean.wang@...iatek.com>,
Russell King <linux@...linux.org.uk>,
Microchip Linux Driver Support <UNGLinuxDriver@...rochip.com>,
Claudiu Manoil <claudiu.manoil@....com>,
netdev@...r.kernel.org, linux-mediatek@...ts.infradead.org,
linux-arm-kernel@...ts.infradead.org,
Philipp Zabel <p.zabel@...gutronix.de>,
Vladimir Oltean <vladimir.oltean@....com>,
Matthias Brugger <matthias.bgg@...il.com>,
Oleksij Rempel <linux@...pel-privat.de>,
Vivien Didelot <vivien.didelot@...il.com>,
linux-kernel@...r.kernel.org, Mao Wenan <maowenan@...wei.com>
Subject: Re: [PATCH net-next] net: dsa: add GRO support via gro_cells
On Mon, Apr 06, 2020 at 01:59:10PM +0300, Alexander Lobakin wrote:
> gro_cells lib is used by different encapsulating netdevices, such as
> geneve, macsec, vxlan etc. to speed up decapsulated traffic processing.
> CPU tag is a sort of "encapsulation", and we can use the same mechs to
> greatly improve overall DSA performance.
> skbs are passed to the GRO layer after removing CPU tags, so we don't
> need any new packet offload types as it was firstly proposed by me in
> the first GRO-over-DSA variant [1].
>
> The size of struct gro_cells is sizeof(void *), so hot struct
> dsa_slave_priv becomes only 4/8 bytes bigger, and all critical fields
> remain in one 32-byte cacheline.
> The other positive side effect is that drivers for network devices
> that can be shipped as CPU ports of DSA-driven switches can now use
> napi_gro_frags() to pass skbs to kernel. Packets built that way are
> completely non-linear and are likely being dropped without GRO.
>
> This was tested on to-be-mainlined-soon Ethernet driver that uses
> napi_gro_frags(), and the overall performance was on par with the
> variant from [1], sometimes even better due to minimal overhead.
> net.core.gro_normal_batch tuning may help to push it to the limit
> on particular setups and platforms.
>
> [1] https://lore.kernel.org/netdev/20191230143028.27313-1-alobakin@dlink.ru/
Hi Alexander
net-next is closed at the moment. So you should of posted this with an
RFC prefix.
The implementation looks nice and simple. But it would be nice to have
some performance figures.
Andrew
Powered by blists - more mailing lists