[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <455D0155.9000305@goop.org>
Date:	Thu, 16 Nov 2006 16:24:53 -0800
From:	Jeremy Fitzhardinge <jeremy@...p.org>
To:	Ingo Molnar <mingo@...e.hu>
CC:	Arjan van de Ven <arjan@...radead.org>, Andi Kleen <ak@...e.de>,
	Eric Dumazet <dada1@...mosbay.com>, akpm@...l.org,
	linux-kernel@...r.kernel.org
Subject: Re: [PATCH] i386-pda UP optimization
Ingo Molnar wrote:
> what point would there be in using it? It's not like the kernel could 
> make use of the thread keyword anytime soon (it would need /all/ 
> architectures to support it) ...
The plan was to implement the x86 arch-specific percpu stuff to use it,
since it allows gcc better optimisation opportunities.
>  and the kernel doesnt mind how the 
> current per_cpu() primitives are implemented, via assembly or via C. In 
> any case, it very much matters to see the precise cost of having the pda 
> selector value in %gs versus %fs.
>   
Hm, well, unfortunately for me, there is a small but distinct advantage
to using %fs rather than %gs (around 0-5ns per iteration).  The notable
exception being the "AMD-K6(tm) 3D+ Processor", where %gs is about 25%
(15ns) faster.
I'll revise the patches to use %fs and resubmit.
    J
View attachment "results-mixed.txt" of type "text/plain" (3721 bytes)
Powered by blists - more mailing lists
 
