[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <f2b55d220702221847p6b28f6eehed1bb5b1b0edf9f4@mail.gmail.com>
Date: Thu, 22 Feb 2007 18:47:46 -0800
From: "Michael K. Edwards" <medwards.linux@...il.com>
To: Alan <alan@...rguk.ukuu.org.uk>
Cc: "Ingo Molnar" <mingo@...e.hu>,
"Evgeniy Polyakov" <johnpol@....mipt.ru>,
"Ulrich Drepper" <drepper@...hat.com>,
linux-kernel@...r.kernel.org,
"Linus Torvalds" <torvalds@...ux-foundation.org>,
"Arjan van de Ven" <arjan@...radead.org>,
"Christoph Hellwig" <hch@...radead.org>,
"Andrew Morton" <akpm@....com.au>,
"Zach Brown" <zach.brown@...cle.com>,
"David S. Miller" <davem@...emloft.net>,
"Suparna Bhattacharya" <suparna@...ibm.com>,
"Davide Libenzi" <davidel@...ilserver.org>,
"Jens Axboe" <jens.axboe@...cle.com>,
"Thomas Gleixner" <tglx@...utronix.de>
Subject: Re: [patch 00/13] Syslets, "Threadlets", generic AIO support, v3
On 2/22/07, Alan <alan@...rguk.ukuu.org.uk> wrote:
> > to do anything but chase pointers through cache. Done right, it
> > hardly even branches (although the branch misprediction penalty is a
> > lot less of a worry on current x86_64 than it was in the
> > mega-superscalar-out-of-order-speculative-execution days). It's damn
>
> Actually it costs a lot more on at least one vendors processor because
> you stall very long pipelines.
You're right; I overreached there. I haven't measured branch
misprediction penalties in dog's years (I focus more on system latency
issues these days), so I'm just going on rumor. If your CPU vendor is
still playing the tune-for-SpecINT-at-the-expense-of-real-code game
(*cough* Itanic *cough*), get another CPU vendor -- while you still
can.
> > threadlets promise that they will not touch anything thread-local, and
> > that when the FPU is handed to them in a specific, known state, they
> > leave it in that same state. (Some of the flags can be
>
> We don't use the FPU in the kernel except in very weird cases where it
> makes an enormous performance difference. The threadlets also have the
> same page tables so they have the same %cr3 so its very cheap to switch,
> basically a predicted jump and some register loads
Do you not understand that real user code touches FPU state at
unpredictable (to the kernel) junctures? Maybe not in a database or a
web server, but in the GUIs and web-based monitoring applications that
are 99% of the potential customers for kernel AIO? I have no idea
what a %cr3 is, but if you don't fence off thread-local stuff from the
threadlets you are just begging for end-user Heisenbugs and a place in
the dustheap of history next to Symbolics LISP.
> > Do me a favor. Do some floating point math and a memcpy() in between
> > syscalls in the threadlet. Actually fiddle with errno and the FPU
>
> We don't have an errno in the kernel because its a stupid idea. Errno is
> a user space hack for compatibility with 1970's bad design. So its not
> relevant either.
Dude, it's thread-local, and the glibc wrapper around most synchronous
syscalls touches it. If you don't instantiate a new TLS context (or
whatever the right lingo for that is) for every threadlet, you are
TOAST -- if you let the user call stuff out of <stdlib.h> (let alone
<stdio.h>) from within the threadlet.
Cheers,
- Michael
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists