[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <d8347087-18a6-1709-8aa8-3c6f2d16aa94@linux.intel.com>
Date: Thu, 18 Jan 2018 06:45:00 -0800
From: Dave Hansen <dave.hansen@...ux.intel.com>
To: "Kirill A. Shutemov" <kirill@...temov.name>,
Tetsuo Handa <penguin-kernel@...ove.SAKURA.ne.jp>
Cc: torvalds@...ux-foundation.org, kirill.shutemov@...ux.intel.com,
akpm@...ux-foundation.org, hannes@...xchg.org,
iamjoonsoo.kim@....com, mgorman@...hsingularity.net,
tony.luck@...el.com, vbabka@...e.cz, mhocko@...nel.org,
aarcange@...hat.com, hillf.zj@...baba-inc.com, hughd@...gle.com,
oleg@...hat.com, peterz@...radead.org, riel@...hat.com,
srikar@...ux.vnet.ibm.com, vdavydov.dev@...il.com,
mingo@...nel.org, linux-kernel@...r.kernel.org, linux-mm@...ck.org,
x86@...nel.org
Subject: Re: [mm 4.15-rc8] Random oopses under memory pressure.
On 01/18/2018 04:25 AM, Kirill A. Shutemov wrote:
> [ 10.084024] diff: -858690919
> [ 10.084258] hpage_nr_pages: 1
> [ 10.084386] check1: 0
> [ 10.084478] check2: 0
...
> diff --git a/mm/page_vma_mapped.c b/mm/page_vma_mapped.c
> index d22b84310f6d..57b4397f1ea5 100644
> --- a/mm/page_vma_mapped.c
> +++ b/mm/page_vma_mapped.c
> @@ -70,6 +70,14 @@ static bool check_pte(struct page_vma_mapped_walk *pvmw)
> }
> if (pte_page(*pvmw->pte) < pvmw->page)
> return false;
> +
> + if (pte_page(*pvmw->pte) - pvmw->page) {
> + printk("diff: %d\n", pte_page(*pvmw->pte) - pvmw->page);
> + printk("hpage_nr_pages: %d\n", hpage_nr_pages(pvmw->page));
> + printk("check1: %d\n", pte_page(*pvmw->pte) - pvmw->page < 0);
> + printk("check2: %d\n", pte_page(*pvmw->pte) - pvmw->page >= hpage_nr_pages(pvmw->page));
> + BUG();
> + }
This says that pte_page(*pvmw->pte) and pvmw->page are roughly 4GB away
from each other (858690919*4=0xccba559c0). That's not the compiler
being wonky, it just means that the virtual addresses of the memory
sections are that far apart.
This won't happen when you have vmemmap or flatmem because the mem_map[]
is virtually contiguous and pointer arithmetic just works against all
'struct page' pointers. But with classic sparsemem, it doesn't.
You need to make sure that the PFNs are in the same section before you
can do the math that you want to do here.
Powered by blists - more mailing lists