[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20181112203737.e4jnsp4rxpie4trr@soleen.tm1wkky2jk1uhgkn0ivaxijq1c.bx.internal.cloudapp.net>
Date: Mon, 12 Nov 2018 20:37:37 +0000
From: Pavel Tatashin <pasha.tatashin@...een.com>
To: Alexander Duyck <alexander.duyck@...il.com>
Cc: Andrew Morton <akpm@...ux-foundation.org>,
alexander.h.duyck@...ux.intel.com,
Daniel Jordan <daniel.m.jordan@...cle.com>,
linux-mm <linux-mm@...ck.org>, sparclinux@...r.kernel.org,
LKML <linux-kernel@...r.kernel.org>, linux-nvdimm@...ts.01.org,
David Miller <davem@...emloft.net>,
pavel.tatashin@...rosoft.com, Michal Hocko <mhocko@...e.com>,
Ingo Molnar <mingo@...nel.org>,
"Kirill A. Shutemov" <kirill.shutemov@...ux.intel.com>,
dan.j.williams@...el.com, dave.jiang@...el.com,
rppt@...ux.vnet.ibm.com, Matthew Wilcox <willy@...radead.org>,
Vlastimil Babka <vbabka@...e.cz>, khalid.aziz@...cle.com,
ldufour@...ux.vnet.ibm.com,
Mel Gorman <mgorman@...hsingularity.net>,
yi.z.zhang@...ux.intel.com
Subject: Re: [mm PATCH v5 0/7] Deferred page init improvements
On 18-11-12 11:10:35, Alexander Duyck wrote:
>
> The point I was trying to make is that it doesn't. You say it is an
> order of magnitude better but it is essentially 3.5x vs 3.8x and to
> achieve the 3.8x you are using a ton of system resources. My approach
> is meant to do more with less, while this approach will throw a
> quarter of the system at page initialization.
3.8x is a bug, that is going to be fixed before ktasks are accepted. The
final results will be close to time/nthreads.
Using more resources to initialize pages is fine, because other CPUs are
idling during this time in boot.
Lets wait for what Daniel finds out after Linux Plumber. And we can
continue this discussion in ktask thread.
>
> An added advantage to my approach is that it speeds up things
> regardless of the number of cores used, whereas the scaling approach
Yes, I agree, I like your approach. It is clean, simplifies, and
improves the performance. I have tested it on both ARM and x86, and
verified the performance improvements. So:
Tested-by: Pavel Tatashin <pasha.tatashin@...een.com>
> requires that there be more cores available to use. So for example on
> some of the new AMD Zen stuff I am not sure the benefit would be all
> that great since if I am not mistaken each tile is only 8 processors
> so at most you are only doubling the processing power applied to the
> initialization. In such a case it is likely that my approach would
> fare much better then this approach since I don't require additional
> cores to achieve the same results.
>
> Anyway there are tradeoffs we have to take into account.
>
> I will go over the changes you suggested after Plumbers. I just need
> to figure out if I am doing incremental changes, or if Andrew wants me
> to just resubmit the whole set. I can probably deal with these changes
> either way since most of them are pretty small.
Send the full series again, Andrew is very good at taking only
incremental changes once a new version is posted of something
that is already in mm-tree.
Thank you,
Pasha
Powered by blists - more mailing lists