[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20131213235903.8236C539@viggo.jf.intel.com>
Date: Fri, 13 Dec 2013 15:59:03 -0800
From: Dave Hansen <dave@...1.net>
To: linux-kernel@...r.kernel.org
Cc: linux-mm@...ck.org, Pravin B Shelar <pshelar@...ira.com>,
Christoph Lameter <cl@...ux-foundation.org>,
"Kirill A. Shutemov" <kirill.shutemov@...ux.intel.com>,
Andi Kleen <ak@...ux.intel.com>, Dave Hansen <dave@...1.net>
Subject: [RFC][PATCH 0/7] re-shrink 'struct page' when SLUB is on.
SLUB depends on a 16-byte cmpxchg for an optimization. For the
purposes of this series, I'm assuming that it is a very important
optimization that we desperately need to keep around.
In order to get guaranteed 16-byte alignment (required by the
hardware on x86), 'struct page' is padded out from 56 to 64
bytes.
Those 8-bytes matter. We've gone to great lengths to keep
'struct page' small in the past. It's a shame that we bloat it
now just for alignment reasons when we have extra space. Plus,
bloating such a commonly-touched structure *HAS* to have cache
footprint implications.
These patches attempt _internal_ alignment instead of external
alignment for slub.
I also got a bug report from some folks running a large database
benchmark. Their old kernel uses slab and their new one uses
slub. They were swapping and couldn't figure out why. It turned
out to be the 2GB of RAM that the slub padding wastes on their
system.
On my box, that 2GB cost about $200 to populate back when we
bought it. I want my $200 back.
This set takes me from 16909584K of reserved memory at boot
down to 14814472K, so almost *exactly* 2GB of savings! It also
helps performance, presumably because it touches 14% fewer
struct page cachelines. A 30GB dd to a ramfs file:
dd if=/dev/zero of=bigfile bs=$((1<<30)) count=30
is sped up by about 4.4% in my testing.
This is compile tested and lightly runtime tested. I'm curious
what people think of it before we push it futher. I believe this
gets rid of the concerns Christoph had about adding additional
branches in the fast path, although I still disagree that this
has any benefit in practice.
I also wrote up a document describing 'struct page's layout:
http://tinyurl.com/n6kmedz
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists