[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20210720045000.GA2211@PWN>
Date: Tue, 20 Jul 2021 00:50:00 -0400
From: Peilin Ye <yepeilin.cs@...il.com>
To: Joerg Roedel <jroedel@...e.de>
Cc: x86@...nel.org, Thomas Gleixner <tglx@...utronix.de>,
Ingo Molnar <mingo@...hat.com>, Borislav Petkov <bp@...en8.de>,
Jonathan Corbet <corbet@....net>,
"H. Peter Anvin" <hpa@...or.com>,
Cong Wang <xiyou.wangcong@...il.com>,
Zefang Han <hanzefang@...il.com>,
Wei Lin Chang <r09922117@...e.ntu.edu.tw>,
linux-kernel@...r.kernel.org, linux-doc@...r.kernel.org
Subject: Re: [PATCH] docs: x86: Remove obsolete information about x86_64
vmalloc() faulting
Hi Joerg,
On Mon, Jul 19, 2021 at 02:34:31PM +0200, Joerg Roedel wrote:
> On Fri, Jul 16, 2021 at 02:09:58AM -0400, Peilin Ye wrote:
> > This information is out-of-date, and it took me quite some time of
> > ftrace'ing before I figured it out... I think it would be beneficial to
> > update, or at least remove it.
> >
> > As a proof that I understand what I am talking about, on my x86_64 box:
> >
> > 1. I allocated a vmalloc() area containing linear address `addr`;
> > 2. I manually pagewalked `addr` in different page tables, including
> > `init_mm.pgd`;
> > 3. The corresponding PGD entries for `addr` in different page tables,
> > they all immediately pointed at the same PUD table (my box uses
> > 4-level paging), at the same physical address;
> > 4. No "lazy synchronization" via page fault handling happened at all,
> > since it is the same PUD table pre-allocated by
> > preallocate_vmalloc_pages() during boot time.
>
> Yes, this is the story for x86-64, because all PUD/P4D pages for the vmalloc
> area are pre-allocated at boot. So no faulting or synchronization needs
> to happen.
>
> On x86-32 this is a bit different. Pre-allocation of PMD/PTE pages is
> not an option there (even less when 4MB large-pages with 2-level paging
> come into the picture).
>
> So what happens there is that vmalloc related changes to the init_mm.pgd
> are synchronized to all page-tables in the system. But this
> synchronization is subject to race conditions in a way that another CPU
> might vmalloc an area below a PMD which is not fully synchronized yet.
>
> When this happens there is a fault, which is handled as a vmalloc()
> fault on x86-32 just as before. So vmalloc faults still exist on 32-bit,
> they are just less likely as they used to be.
Thanks a lot for the information! I will improve my commit message and
send a v2 soon.
I think for this patch, removing that out-of-date statement is
sufficient, since mm.rst is x86-64-specific, but maybe we should
document this behavior for x86-32 somewhere as well...
Thank you,
Peilin Ye
Powered by blists - more mailing lists