[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250814141136.GG802098@nvidia.com>
Date: Thu, 14 Aug 2025 11:11:36 -0300
From: Jason Gunthorpe <jgg@...dia.com>
To: Mika Penttilä <mpenttil@...hat.com>
Cc: linux-mm@...ck.org, linux-kernel@...r.kernel.org,
David Hildenbrand <david@...hat.com>,
Leon Romanovsky <leonro@...dia.com>,
Alistair Popple <apopple@...dia.com>,
Balbir Singh <balbirs@...dia.com>
Subject: Re: [RFC PATCH 1/4] mm: use current as mmu notifier's owner
On Thu, Aug 14, 2025 at 04:20:42PM +0300, Mika Penttilä wrote:
>
> On 8/14/25 16:04, Jason Gunthorpe wrote:
>
> > On Thu, Aug 14, 2025 at 03:53:00PM +0300, Mika Penttilä wrote:
> >> On 8/14/25 15:40, Jason Gunthorpe wrote:
> >>> On Thu, Aug 14, 2025 at 10:19:26AM +0300, Mika Penttilä wrote:
> >>>> When doing migration in combination with device fault handling,
> >>>> detect the case in the interval notifier.
> >>>>
> >>>> Without that, we would livelock with our own invalidations
> >>>> while migrating and splitting pages during fault handling.
> >>>>
> >>>> Note, pgmap_owner, used in some other code paths as owner for filtering,
> >>>> is not readily available for split path, so use current for this use case.
> >>>> Also, current and pgmap_owner, both being pointers to memory, can not be
> >>>> mis-interpreted to each other.
> >>>>
> >>>> Cc: David Hildenbrand <david@...hat.com>
> >>>> Cc: Jason Gunthorpe <jgg@...dia.com>
> >>>> Cc: Leon Romanovsky <leonro@...dia.com>
> >>>> Cc: Alistair Popple <apopple@...dia.com>
> >>>> Cc: Balbir Singh <balbirs@...dia.com>
> >>>>
> >>>> Signed-off-by: Mika Penttilä <mpenttil@...hat.com>
> >>>> ---
> >>>> lib/test_hmm.c | 5 +++++
> >>>> mm/huge_memory.c | 6 +++---
> >>>> mm/rmap.c | 4 ++--
> >>>> 3 files changed, 10 insertions(+), 5 deletions(-)
> >>>>
> >>>> diff --git a/lib/test_hmm.c b/lib/test_hmm.c
> >>>> index 761725bc713c..cd5c139213be 100644
> >>>> --- a/lib/test_hmm.c
> >>>> +++ b/lib/test_hmm.c
> >>>> @@ -269,6 +269,11 @@ static bool dmirror_interval_invalidate(struct mmu_interval_notifier *mni,
> >>>> range->owner == dmirror->mdevice)
> >>>> return true;
> >>>>
> >>>> + if (range->event == MMU_NOTIFY_CLEAR &&
> >>>> + range->owner == current) {
> >>>> + return true;
> >>>> + }
> >>> I don't understand this, there is nothing in hmm that says only
> >>> current can call hmm_range_fault, and indeed most applications won't
> >>> even gurantee that.
> >> No it's the opposite, if we are ourselves the invalidator, don't care.
> > I think you've missed the point, you cannot use range->owner in any
> > way to tell "we are ourselves the invalidator". It is simply not the
> > meaning of range->owner.
>
> Usually it is the device but used similarly, look for instance nouveau:
Yes, dev is fine, but current or struct task is not reasonable.
> Yes the commit message could be better.
> But this is essentially the same as nouveau is doing with
> MMU_NOTIFY_EXCLUSIVE handling, just not using the dev as the
> qualifier,
I wouldn't necesarily assume anything nouveau is correct, but this is
certainly not the same thing. nouveau is trying to eliminate an
unncessary invalidation for it's HW when it knows the memory is
already only private to this local device.
This is not a statement about callchain or recursion.
Jason
Powered by blists - more mailing lists