[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <21a0ba8b-bf05-0799-7c78-2a35f8c8d52a@os.amperecomputing.com>
Date: Thu, 14 Sep 2023 19:40:22 -0700 (PDT)
From: "Lameter, Christopher" <cl@...amperecomputing.com>
To: Feng Tang <feng.tang@...el.com>
cc: Hyeonggon Yoo <42.hyeyoo@...il.com>,
Vlastimil Babka <vbabka@...e.cz>,
Andrew Morton <akpm@...ux-foundation.org>,
Pekka Enberg <penberg@...nel.org>,
David Rientjes <rientjes@...gle.com>,
Joonsoo Kim <iamjoonsoo.kim@....com>,
Roman Gushchin <roman.gushchin@...ux.dev>, linux-mm@...ck.org,
linux-kernel@...r.kernel.org
Subject: Re: [RFC Patch 3/3] mm/slub: setup maxim per-node partial according
to cpu numbers
On Thu, 14 Sep 2023, Feng Tang wrote:
> One reason I wanted to revisit the MIN_PARTIAL is, it was changed from
> 2 to 5 in 2007 by Christoph, in commit 76be895001f2 ("SLUB: Improve
> hackbench speed"), the system has been much huger since then.
> Currently while a per-cpu partial can already have 5 or more slabs,
> the limit for a node with possible 100+ CPU could be reconsidered.
Well the trick that I keep using in large systems with lots of memory is
to use huge page sized page allocation. The applications on those
already are using the same page size. Doing so usually removes a lot of
overhead and speeds up things significantly.
Try booting with "slab_min_order=9"
Powered by blists - more mailing lists