[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <57049B6B.1010009@linaro.org>
Date: Wed, 6 Apr 2016 13:15:23 +0800
From: Alex Shi <alex.shi@...aro.org>
To: Andy Lutomirski <luto@...capital.net>
Cc: X86 ML <x86@...nel.org>, Thomas Gleixner <tglx@...utronix.de>,
Ingo Molnar <mingo@...hat.com>,
"open list:X86 ARCHITECTURE (32-BIT AND 64-BIT)"
<linux-kernel@...r.kernel.org>,
Andrew Morton <akpm@...ux-foundation.org>,
"H. Peter Anvin" <hpa@...or.com>, Rik van Riel <riel@...hat.com>
Subject: Re: [REF PATCH] x86/tlb: just do tlb flush on one of siblings of SMT
On 04/06/2016 12:47 PM, Andy Lutomirski wrote:
> On Apr 5, 2016 8:17 PM, "Alex Shi" <alex.shi@...aro.org> wrote:
>>
>> It seems Intel core still share the TLB pool, flush both of threads' TLB
>> just cause a extra useless IPI and a extra flush. The extra flush will
>> flush out TLB again which another thread just introduced.
>> That's double waste.
>
> Do you have a reference in both the SDM and the APM for this?
No. as I said in the end of commit log. There are no any official
guarantee for this usage, but it seems working widely in Intel CPUs.
And the performance benefit is so tempted...
Is there Intel's guys like to dig it more? :)
>
> Do we have a guarantee that this serialized the front end such that
> the non-targetted sibling won't execute an instruction that it decoded
> from a stale translation?
Is your worrying an evidence for my guess? Otherwise the stale
instruction happens either before IPI coming in... :)
>
> This will conflict rather deeply with my PCID series, too.
>
> --Andy
>
Powered by blists - more mailing lists