[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <45506D51.30604@garzik.org>
Date: Tue, 07 Nov 2006 06:26:09 -0500
From: Jeff Garzik <jeff@...zik.org>
To: Evgeniy Polyakov <johnpol@....mipt.ru>
CC: David Miller <davem@...emloft.net>,
Ulrich Drepper <drepper@...hat.com>,
Andrew Morton <akpm@...l.org>, netdev <netdev@...r.kernel.org>,
linux-kernel@...r.kernel.org, Linus Torvalds <torvalds@...l.org>
Subject: Re: [take21 0/4] kevent: Generic event handling mechanism.
Evgeniy Polyakov wrote:
> Generic event handling mechanism.
>
> Consider for inclusion.
>
> Changes from 'take20' patchset:
> * new ring buffer implementation
> * removed artificial limit on possible number of kevents
> With this release and fixed userspace web server it was possible to
> achive 3960+ req/s with client connection rate of 4000 con/s
> over 100 Mbit lan, data IO over network was about 10582.7 KB/s, which
> is too close to wire speed if we get into account headers and the like.
OK, now that ring buffer is here, I definitely like the direction this
code is taking. I just committed the patches to a local repo for a good
in-depth review.
Could you write up a simple text file, documenting (a) your proposed
syscalls and (b) your ring buffer design?
Overall I have a Linux "design wish", that I hope kevent can fulfill:
To develop completely async applications (generally network servers, in
Linux-land) and increase the chance of zero-copy I/O, network and file
I/O submission and completion should be as async as possible.
As such, syscalls themselves have come a serializing bottleneck that
isn't strictly necessary. A fully-async application should be able to
submit file read, file write, and network write requests
asynchronously... in batches. Network reads, and file I/O completions
should be received asynchronously, potentially in batches.
Even with epoll and AIO syscalls, Linux isn't quite up to the task.
So to me, the design of the userspace interface that solves this problem
is a fundamental issue.
My best guess at a solution would be two classes of mmap'd ring buffers,
request and response. Let the app allocate one or more. Then have two
hooks, (a) kick the kernel to read the request ring, and (b) kick the
app when one or more events have arrived on a ring.
But that's just thinking out loud. I welcome any solution that gives
userspace a fully-async submission/completion interface for both network
and file I/O.
Setting the standard for a good interface here means Linux will kick ass
for decades more to come ;-) This is IMO a Big Deal(tm).
Jeff
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists