[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <cover.1434978970.git.christophe.leroy@c-s.fr>
Date: Mon, 22 Jun 2015 19:01:05 +0200 (CEST)
From: Christophe Leroy <christophe.leroy@....fr>
To: Benjamin Herrenschmidt <benh@...nel.crashing.org>,
Paul Mackerras <paulus@...ba.org>,
Michael Ellerman <mpe@...erman.id.au>, scottwood@...escale.com
Cc: linux-kernel@...r.kernel.org, linuxppc-dev@...ts.ozlabs.org,
Joakim Tjernlund <joakim.tjernlund@...nsmode.se>
Subject: [PATCH 0/2] powerpc32: optimisation of csum_partial_copy_generic()
This patch optimises csum_partial_copy_generic() by making use of cache
instructions (dcbt/dcbz) just like copy_tofrom_user() does
On a TCP benchmark using socklib on the loopback interface on which checksum
offload and scatter/gather have been deactivated, we get about 20% performance
increase.
Christophe Leroy (2):
powerpc32: checksum_wrappers_64 becomes checksum_wrappers
powerpc32: rewrite of csum_partial_copy_generic based of copy_tofrom_user
arch/powerpc/include/asm/checksum.h | 9 -
arch/powerpc/lib/Makefile | 3 +-
arch/powerpc/lib/checksum_32.S | 320 +++++++++++++++++++++-----------
arch/powerpc/lib/checksum_wrappers.c | 102 ++++++++++
arch/powerpc/lib/checksum_wrappers_64.c | 102 ----------
5 files changed, 312 insertions(+), 224 deletions(-)
create mode 100644 arch/powerpc/lib/checksum_wrappers.c
delete mode 100644 arch/powerpc/lib/checksum_wrappers_64.c
--
2.1.0
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists