[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20250403082801.GA14239@mazurka.cambridge.arm.com>
Date: Thu, 3 Apr 2025 08:28:01 +0000
From: Mikołaj Lenczewski <miko.lenczewski@....com>
To: Ryan Roberts <ryan.roberts@....com>
Cc: suzuki.poulose@....com, yang@...amperecomputing.com, corbet@....net,
catalin.marinas@....com, will@...nel.org, jean-philippe@...aro.org,
robin.murphy@....com, joro@...tes.org, akpm@...ux-foundation.org,
ardb@...nel.org, mark.rutland@....com, joey.gouly@....com,
maz@...nel.org, james.morse@....com, broonie@...nel.org,
oliver.upton@...ux.dev, baohua@...nel.org, david@...hat.com,
ioworker0@...il.com, jgg@...pe.ca, nicolinc@...dia.com,
mshavit@...gle.com, jsnitsel@...hat.com, smostafa@...gle.com,
linux-doc@...r.kernel.org, linux-kernel@...r.kernel.org,
linux-arm-kernel@...ts.infradead.org, iommu@...ts.linux.dev
Subject: Re: [PATCH v5 3/3] arm64/mm: Elide tlbi in contpte_convert() under
BBML2
On Thu, Apr 03, 2025 at 09:14:43AM +0100, Ryan Roberts wrote:
> On 25/03/2025 05:36, Mikołaj Lenczewski wrote:
> > diff --git a/arch/arm64/mm/contpte.c b/arch/arm64/mm/contpte.c
> > index 55107d27d3f8..77ed03b30b72 100644
> > --- a/arch/arm64/mm/contpte.c
> > +++ b/arch/arm64/mm/contpte.c
> > @@ -68,7 +68,8 @@ static void contpte_convert(struct mm_struct *mm, unsigned long addr,
> > pte = pte_mkyoung(pte);
> > }
> >
> > - __flush_tlb_range(&vma, start_addr, addr, PAGE_SIZE, true, 3);
> > + if (!system_supports_bbml2_noabort())
> > + __flush_tlb_range(&vma, start_addr, addr, PAGE_SIZE, true, 3);
> >
> > __set_ptes(mm, start_addr, start_ptep, pte, CONT_PTES);
>
> Despite all the conversation we had about completely eliding the TLBI for the
> BBML2 case, I've continued to be a bit uneasy about it. I had another chat with
> Alex C and we concluded that it is safe, but there could be conceivable
> implementations where it is not performant. Alex suggested doing a TLBI without
> the DSB and I think that's a good idea. So after the __set_ptes(), I suggest adding:
>
> if (system_supports_bbml2_noabort())
> __flush_tlb_range_nosync(mm, start_addr, addr, PAGE_SIZE,
> true, 3);
>
> That will issue the TLBI but won't wait for it to complete. So it should be very
> fast. We are guranteed correctness immediately. We are guranteed performance
> after the next DSB (worst-case; next context switch).
>
> Thanks,
> Ryan
Hi Ryan,
Sure, perfectly happy to add that on. Will respin and add a note about
this behaviour to the source code and to the patch / cover letter.
--
Kind regards,
Mikołaj Lenczewski
Powered by blists - more mailing lists