[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Pine.LNX.4.64.0801091503270.13306@alien.or.mcafeemobile.com>
Date: Wed, 9 Jan 2008 15:15:49 -0800 (PST)
From: Davide Libenzi <davidel@...ilserver.org>
To: Rusty Russell <rusty@...tcorp.com.au>
cc: Zach Brown <zach.brown@...cle.com>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
Linus Torvalds <torvalds@...ux-foundation.org>,
Ingo Molnar <mingo@...e.hu>,
Ulrich Drepper <drepper@...hat.com>,
Arjan van de Ven <arjan@...radead.org>,
Andrew Morton <akpm@....com.au>,
Alan Cox <alan@...rguk.ukuu.org.uk>,
Evgeniy Polyakov <johnpol@....mipt.ru>,
"David S. Miller" <davem@...emloft.net>,
Suparna Bhattacharya <suparna@...ibm.com>,
Jens Axboe <jens.axboe@...cle.com>,
Thomas Gleixner <tglx@...utronix.de>,
Dan Williams <dan.j.williams@...il.com>,
Jeff Moyer <jmoyer@...hat.com>,
Simon Holm Thogersen <odie@...aau.dk>,
suresh.b.siddha@...el.com
Subject: Re: [PATCH 5/6] syslets: add generic syslets infrastructure
On Thu, 10 Jan 2008, Rusty Russell wrote:
> On Thursday 10 January 2008 05:16:57 Zach Brown wrote:
> > > The latter. A ring is optimal for processing a huge number of requests,
> > > but if you're really going to be firing off syslet threads all over the
> > > place you're not going to be optimal anyway. And being able to point the
> > > return value to the stack or into some datastructure is way nicer to code
> > > (zero setup == easy to understand and easy to convert).
> >
> > One of Linus' rhetorical requirements for the syslets work is that we be
> > able to wait for the result without spending overhead building up state
> > in some completion context. The userland rings in the current syslet
> > patches achieve that and don't seem outrageously complicated.
>
> I'd have to read his original statement, but eventfd doesn't build up state,
> so I think it qualifies.
I think you and Zach are talking about different issues ;)
Eventfd born for two different reasons. First, to be able to have
userspace to signal to a poll/select/epoll based listener an event. This
can elso be done with pipes, but eventfd has a few advantages over pipes
(3-4 times faster and *a lot* less memory footprint). Second, as a generic
way for kernel subsystems to signal completions to a poll/select/epoll
userspace listener. And this is what is used in the new KAIO eventfd
feature (patch was like 5 lines IIRC).
This allow for KAIO events to be signaled to poll/select/epoll in a pretty
easy way, using a simple extension of the AIO API.
What we talked originally with Ingo, when the first syslet code came up,
was the ability to do the reverse thing. That is, host an epoll_wait()
inside a syslet, and gather the completion using whatever the syslet code
was/is going to use for it.
Given that 1) the eventfd completion patch was trivial and immediately
available 2) the future of the whole syslet concept was/is still unclear,
I believe it makes/made sense. If the syslets will become mainline, it'll
mean that userspace will then be able to select the event-completion
"hosting" method that better suits their needs. That are, either AIO
completion hosted inside an epoll_wait() via an eventfd, or an
epoll_wait() hosted inside a syslet.
- Davide
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists