[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20170510.112059.169845404310247896.davem@davemloft.net>
Date: Wed, 10 May 2017 11:20:59 -0400 (EDT)
From: David Miller <davem@...emloft.net>
To: pasha.tatashin@...cle.com
Cc: mhocko@...nel.org, linux-kernel@...r.kernel.org,
sparclinux@...r.kernel.org, linux-mm@...ck.org,
linuxppc-dev@...ts.ozlabs.org, linux-s390@...r.kernel.org,
borntraeger@...ibm.com, heiko.carstens@...ibm.com
Subject: Re: [v3 0/9] parallelized "struct page" zeroing
From: Pasha Tatashin <pasha.tatashin@...cle.com>
Date: Wed, 10 May 2017 11:01:40 -0400
> Perhaps you are right, and I will measure on x86. But, I suspect hit
> can become unacceptable on some platfoms: there is an overhead of
> calling a function, even if it is leaf-optimized, and there is an
> overhead in memset() to check for alignments of size and address,
> types of setting (zeroing vs. non-zeroing), etc., that adds up
> quickly.
Another source of overhead on the sparc64 side is that we much
do memory barriers around the block initializiing stores. So
batching calls to memset() amortize that as well.
Powered by blists - more mailing lists