[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <ea518cf0-890d-4292-b775-dd3880c85bc6@redhat.com>
Date: Mon, 26 Aug 2024 17:36:00 +0200
From: David Hildenbrand <david@...hat.com>
To: Qi Zheng <zhengqi.arch@...edance.com>, hughd@...gle.com,
willy@...radead.org, muchun.song@...ux.dev, vbabka@...nel.org,
akpm@...ux-foundation.org, rppt@...nel.org, vishal.moola@...il.com,
peterx@...hat.com, ryan.roberts@....com, christophe.leroy2@...soprasteria.com
Cc: linux-kernel@...r.kernel.org, linux-mm@...ck.org,
linux-arm-kernel@...ts.infradead.org, linuxppc-dev@...ts.ozlabs.org
Subject: Re: [PATCH v2 06/14] mm: handle_pte_fault() use
pte_offset_map_rw_nolock()
On 22.08.24 09:13, Qi Zheng wrote:
> In handle_pte_fault(), we may modify the vmf->pte after acquiring the
> vmf->ptl, so convert it to using pte_offset_map_rw_nolock(). But since we
> will do the pte_same() check, so there is no need to get pmdval to do
> pmd_same() check, just pass a dummy variable to it.
>
> Signed-off-by: Qi Zheng <zhengqi.arch@...edance.com>
> ---
> mm/memory.c | 12 ++++++++++--
> 1 file changed, 10 insertions(+), 2 deletions(-)
>
> diff --git a/mm/memory.c b/mm/memory.c
> index 93c0c25433d02..7b6071a0e21e2 100644
> --- a/mm/memory.c
> +++ b/mm/memory.c
> @@ -5499,14 +5499,22 @@ static vm_fault_t handle_pte_fault(struct vm_fault *vmf)
> vmf->pte = NULL;
> vmf->flags &= ~FAULT_FLAG_ORIG_PTE_VALID;
> } else {
> + pmd_t dummy_pmdval;
> +
> /*
> * A regular pmd is established and it can't morph into a huge
> * pmd by anon khugepaged, since that takes mmap_lock in write
> * mode; but shmem or file collapse to THP could still morph
> * it into a huge pmd: just retry later if so.
> + *
> + * Use the maywrite version to indicate that vmf->pte will be
> + * modified, but since we will use pte_same() to detect the
> + * change of the pte entry, there is no need to get pmdval, so
> + * just pass a dummy variable to it.
> */
> - vmf->pte = pte_offset_map_nolock(vmf->vma->vm_mm, vmf->pmd,
> - vmf->address, &vmf->ptl);
> + vmf->pte = pte_offset_map_rw_nolock(vmf->vma->vm_mm, vmf->pmd,
> + vmf->address, &dummy_pmdval,
> + &vmf->ptl);
> if (unlikely(!vmf->pte))
> return 0;
> vmf->orig_pte = ptep_get_lockless(vmf->pte);
No I understand why we don't need the PMD val in these cases ... the PTE
would also be pte_none() at the point the page table is freed, so we
would detect the change as well.
I do enjoy documenting why we use a dummy value, though. Likely without
that, new users will just pass NULL and call it a day.
Acked-by: David Hildenbrand <david@...hat.com>
--
Cheers,
David / dhildenb
Powered by blists - more mailing lists