[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <fdc656df7c54819f60d9a1682c84b14f@codeaurora.org>
Date: Wed, 09 Jan 2019 11:51:00 +0530
From: Arun KS <arunks@...eaurora.org>
To: Alexander Duyck <alexander.h.duyck@...ux.intel.com>
Cc: arunks.linux@...il.com, akpm@...ux-foundation.org,
mhocko@...nel.org, vbabka@...e.cz, osalvador@...e.de,
linux-kernel@...r.kernel.org, linux-mm@...ck.org,
getarunks@...il.com
Subject: Re: [PATCH v7] mm/page_alloc.c: memory_hotplug: free pages as higher
order
On 2019-01-09 03:47, Alexander Duyck wrote:
> On Fri, 2019-01-04 at 10:31 +0530, Arun KS wrote:
>> When freeing pages are done with higher order, time spent on
>> coalescing
>> pages by buddy allocator can be reduced. With section size of 256MB,
>> hot
>> add latency of a single section shows improvement from 50-60 ms to
>> less
>> than 1 ms, hence improving the hot add latency by 60 times. Modify
>> external providers of online callback to align with the change.
>>
>> Signed-off-by: Arun KS <arunks@...eaurora.org>
>> Acked-by: Michal Hocko <mhocko@...e.com>
>> Reviewed-by: Oscar Salvador <osalvador@...e.de>
>
> Sorry, ended up encountering a couple more things that have me a bit
> confused.
>
> [...]
>
>> diff --git a/drivers/hv/hv_balloon.c b/drivers/hv/hv_balloon.c
>> index 5301fef..211f3fe 100644
>> --- a/drivers/hv/hv_balloon.c
>> +++ b/drivers/hv/hv_balloon.c
>> @@ -771,7 +771,7 @@ static void hv_mem_hot_add(unsigned long start,
>> unsigned long size,
>> }
>> }
>>
>> -static void hv_online_page(struct page *pg)
>> +static int hv_online_page(struct page *pg, unsigned int order)
>> {
>> struct hv_hotadd_state *has;
>> unsigned long flags;
>> @@ -783,10 +783,12 @@ static void hv_online_page(struct page *pg)
>> if ((pfn < has->start_pfn) || (pfn >= has->end_pfn))
>> continue;
>>
>> - hv_page_online_one(has, pg);
>> + hv_bring_pgs_online(has, pfn, (1UL << order));
>> break;
>> }
>> spin_unlock_irqrestore(&dm_device.ha_lock, flags);
>> +
>> + return 0;
>> }
>>
>> static int pfn_covered(unsigned long start_pfn, unsigned long
>> pfn_cnt)
>
> So the question I have is why was a return value added to these
> functions? They were previously void types and now they are int. What
> is the return value expected other than 0?
Earlier with returning a void there was now way for an arch code to
denying onlining of this particular page. By using an int as return
type, we can implement this. In one of the boards I was using, there are
some pages which should not be onlined because they are used for other
purposes(like secure trust zone or hypervisor).
>
>> diff --git a/drivers/xen/balloon.c b/drivers/xen/balloon.c
>> index ceb5048..95f888f 100644
>> --- a/drivers/xen/balloon.c
>> +++ b/drivers/xen/balloon.c
>> @@ -345,8 +345,8 @@ static enum bp_state
>> reserve_additional_memory(void)
>>
>> /*
>> * add_memory_resource() will call online_pages() which in its turn
>> - * will call xen_online_page() callback causing deadlock if we don't
>> - * release balloon_mutex here. Unlocking here is safe because the
>> + * will call xen_bring_pgs_online() callback causing deadlock if we
>> + * don't release balloon_mutex here. Unlocking here is safe because
>> the
>> * callers drop the mutex before trying again.
>> */
>> mutex_unlock(&balloon_mutex);
>> @@ -369,15 +369,22 @@ static enum bp_state
>> reserve_additional_memory(void)
>> return BP_ECANCELED;
>> }
>>
>> -static void xen_online_page(struct page *page)
>> +static int xen_bring_pgs_online(struct page *pg, unsigned int order)
>
> Why did we rename this function? I see it was added as a new function
> in v3, however in v4 we ended up replacing it completely. So why not
> just keep the same name and make it easier for us to identify that the
> is the Xen version of the XXX_online_pages callback?
Point taken. Will send a patch.
>
> [...]
>
>> +static int online_pages_blocks(unsigned long start, unsigned long
>> nr_pages)
>> +{
>> + unsigned long end = start + nr_pages;
>> + int order, ret, onlined_pages = 0;
>> +
>> + while (start < end) {
>> + order = min(MAX_ORDER - 1,
>> + get_order(PFN_PHYS(end) - PFN_PHYS(start)));
>> +
>> + ret = (*online_page_callback)(pfn_to_page(start), order);
>> + if (!ret)
>> + onlined_pages += (1UL << order);
>> + else if (ret > 0)
>> + onlined_pages += ret;
>> +
>
> So if the ret > 0 it is supposed to represent how many pages were
> onlined within a given block? What if the ret was negative? Really I am
> not a fan of adding a return value to the online functions unless we
> specifically document what the expected return values are supposed to
> be. If we don't have any return values other than 0 there isn't much
> point in having one anyway.
I ll document this.
Regards,
Arun
Powered by blists - more mailing lists