[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5ed7f24b-d21b-75a1-ff74-49a9e21a7b39@suse.cz>
Date: Wed, 11 Mar 2020 18:58:00 +0100
From: Vlastimil Babka <vbabka@...e.cz>
To: Rik van Riel <riel@...riel.com>,
Andrew Morton <akpm@...ux-foundation.org>
Cc: linux-mm@...ck.org, linux-kernel@...r.kernel.org,
kernel-team@...com, Roman Gushchin <guro@...com>,
Qian Cai <cai@....pw>,
Mel Gorman <mgorman@...hsingularity.net>,
Anshuman Khandual <anshuman.khandual@....com>,
Joonsoo Kim <iamjoonsoo.kim@....com>
Subject: Re: [PATCH] mm,page_alloc,cma: conditionally prefer cma pageblocks
for movable allocations
On 3/8/20 2:23 PM, Rik van Riel wrote:
> On Sat, 2020-03-07 at 14:38 -0800, Andrew Morton wrote:
>> On Fri, 6 Mar 2020 15:01:02 -0500 Rik van Riel <riel@...riel.com>
>> wrote:
>
>> > --- a/mm/page_alloc.c
>> > +++ b/mm/page_alloc.c
>> > @@ -2711,6 +2711,18 @@ __rmqueue(struct zone *zone, unsigned int
>> > order, int migratetype,
>> > {
>> > struct page *page;
>> >
>> > + /*
>> > + * Balance movable allocations between regular and CMA areas by
>> > + * allocating from CMA when over half of the zone's free memory
>> > + * is in the CMA area.
>> > + */
>> > + if (migratetype == MIGRATE_MOVABLE &&
>> > + zone_page_state(zone, NR_FREE_CMA_PAGES) >
>> > + zone_page_state(zone, NR_FREE_PAGES) / 2) {
>> > + page = __rmqueue_cma_fallback(zone, order);
>> > + if (page)
>> > + return page;
>> > + }
>> > retry:
>> > page = __rmqueue_smallest(zone, order, migratetype);
>> > if (unlikely(!page)) {
>>
>> __rmqueue() is a hot path (as much as any per-page operation can be a
>> hot path). What is the impact here?
>
> That is a good question. For MIGRATE_MOVABLE allocations,
> most allocations seem to be order 0, which go through the
> per cpu pages array, and rmqueue_pcplist, or be order 9.
>
> For order 9 allocations, other things seem likely to dominate
> the allocation anyway, while for order 0 allocations the
> pcp list should take away the sting?
I agree it should be in the noise. But please do put it behind CONFIG_CMA
#ifdef. My x86_64 desktop distro kernel doesn't have CONFIG_CMA. Even if this is
effectively no-op with __rmqueue_cma_fallback() returning NULL immediately, I
think the compiler cannot eliminate the two zone_page_state()'s which are
atomic_long_read(), even if it's just ultimately READ_ONCE() here, that's a
volatile cast which means elimination not possible AFAIK? Other architectures
might be even more involved.
Otherwise I agree this is a reasonable solution until CMA is rewritten.
> What I do not know is how much impact this change would
> have on other allocations, like order 3 or order 4 network
> buffer allocations from irq context...
>
> Are there cases in particular that we should be testing?
>
Powered by blists - more mailing lists