[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20160617170012.GA24589@e104818-lin.cambridge.arm.com>
Date: Fri, 17 Jun 2016 18:00:13 +0100
From: Catalin Marinas <catalin.marinas@....com>
To: Robin Murphy <robin.murphy@....com>
Cc: will.deacon@....com, luke.starrett@...adcom.com,
bcm-kernel-feedback-list@...adcom.com,
linux-kernel@...r.kernel.org, linux-arm-kernel@...ts.infradead.org
Subject: Re: [PATCH] arm64: Implement optimised IP checksum helpers
On Thu, May 12, 2016 at 03:26:48PM +0100, Robin Murphy wrote:
> AArch64 is capable of 128-bit memory accesses without alignment
> restrictions, which makes it both possible and highly practical to slurp
> up a typical 20-byte IP header in just 2 loads. Implement our own
> version of ip_fast_checksum() to take advantage of that, resulting in
> considerably fewer instructions and memory accesses than the generic
> version. We can also get more optimal code generation for csum_fold() by
> defining it a slightly different way round from the generic version, so
> throw that into the mix too.
>
> Suggested-by: Luke Starrett <luke.starrett@...adcom.com>
> Signed-off-by: Robin Murphy <robin.murphy@....com>
Queued for 4.8. Thanks.
--
Catalin
Powered by blists - more mailing lists