[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080408172652.GA4877@2ka.mipt.ru>
Date: Tue, 8 Apr 2008 21:26:52 +0400
From: Evgeniy Polyakov <johnpol@....mipt.ru>
To: Eric Dumazet <dada1@...mosbay.com>
Cc: David Miller <davem@...emloft.net>, Jens Axboe <axboe@...nel.dk>,
netdev@...r.kernel.org, Rusty Russell <rusty@...tcorp.com.au>
Subject: Re: [take 2] Fix for the fundamental network/block layer race in sendfile().
On Tue, Apr 08, 2008 at 02:58:43PM +0200, Eric Dumazet (dada1@...mosbay.com) wrote:
> >With this patch it is now guaranteed that data was transferred over the
> >wire after splice/sendfile returns.
> >
> 1) So a nonblocking operation (O_NDELAY) can become a blocking one now ?
No, it should not - ->sendpage() will return -EAGAIN and we will break
from the main loop before going into sleep.
> >P.S. Previous patch was not an April 1 joke as long as this one :)
> >
> >
> Hum, I see you forgot me in CC list, dont try to escape :)
My fault, sorry :)
> >+static void skb_splice_destructor(struct skb_shared_info *shi)
> >+{
> >+ if (shi->nr_frags) {
> >+ int i;
> >+ struct pipe_inode_info *pipe;
> >+
> >+ for (i = 0; i < shi->nr_frags; i++) {
> >+ struct page *page = shi->frags[i].page;
> >+
> >+ if (page->lru.prev == (struct list_head *)page) {
> >+ pipe = (struct pipe_inode_info
> >*)page->lru.next;
> >
> Unless I mistaken, you store in page->lru.next some info to find
> your pipe pointer, assuming it is unique for this page.
>
> What happens if two threads are doing a splice()/sendfile() using the
> same underlying (source) file (and same pages in this file)
Page will be referenced twice (for each thread) and each thread will
have own pipe_inode_info structure, so each one will sleep on own wait
queue and will be awakened separately, where its release counter will be
dropped.
> >diff --git a/fs/splice.c b/fs/splice.c
> >index 0670c91..dcda32e 100644
> >--- a/fs/splice.c
> >+++ b/fs/splice.c
> >@@ -29,6 +29,7 @@
> > #include <linux/syscalls.h>
> > #include <linux/uio.h>
> > #include <linux/security.h>
> >+#include <net/sock.h>
> >
> > /*
> > * Attempt to steal a page from a pipe buffer. This should perhaps go into
> >@@ -535,6 +536,8 @@ static int pipe_to_sendpage(struct pipe_inode_info
> >*pipe,
> > if (!ret) {
> > more = (sd->flags & SPLICE_F_MORE) || sd->len <
> > sd->total_len;
> >
> >+ buf->page->lru.next = (void *)pipe;
> >
> is it really allowed here, are you the only user ot this page ?
Yes, it should be safe - afaics no slab pages or slab allocated data are
allowed in ->sendpage() (there is even BUG_ON() somewhere in
tcp_sendpage()/release code), so both lru pointers can be reused.
--
Evgeniy Polyakov
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists