[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20061008134831.GV8814@kernel.dk>
Date: Sun, 8 Oct 2006 15:48:31 +0200
From: Jens Axboe <jens.axboe@...cle.com>
To: Nick Piggin <npiggin@...e.de>
Cc: Linux Kernel Mailing List <linux-kernel@...r.kernel.org>
Subject: Re: [rfc][patch 2.6.18-rc7] block: explicit plugging
On Sun, Oct 08 2006, Nick Piggin wrote:
> On Fri, Oct 06, 2006 at 01:57:31PM +0200, Jens Axboe wrote:
> > On Sat, Sep 16 2006, Nick Piggin wrote:
> > >
> > > On a parallel tiobench benchmark, of the 800 000 calls to __make_request
> > > performed, this patch avoids 490 000 (62%) of queue_lock aquisitions by
> > > early merging on the private plugged list.
> >
> > Nick, this looks pretty good in general from the vm side, and the
> > concept is nice for reduced lock bouncing. I've merged this for more
> > testing in a 'plug' branch in the block repo.
>
> Thanks, glad you like it :)
>
> I had a browse through your git branch and it looks pretty sane. The
> queue delay looks like a nice elegant fix for the stuff I butchered
> out. I didn't think it would take a huge amount of fixing, but I'm
> glad you did it, because I know very little about SCSI :P
Yeah, it was definitely needed as otherwise we could soft hang devices
in some conditions.
The md/ directory is also still currently largely broken (md as well as
dm), I'll take a gander at fixing that up as well.
> > > Testing and development is in early stages yet. In particular, the lack of
> > > a timer based unplug kick probably breaks some block device drivers in
> > > funny ways (though works here for me with SCSI and UML so far). Also needs
> > > much wider testing.
> >
> > Your SCSI changes are pretty broken, I've fixed them up. We need some
> > way of asking the block layer to back off and rerun is sometime soon,
> > which is what the plugging does in that case. I've introduced a new
> > mechanism for that.
> >
> > Changes:
> >
> > - Don't invoke ->request_fn() in blk_queue_invalidate_tags
> >
> > - Fixup all filesystems for block_sync_page()
> >
> > - Add blk_delay_queue() to handle the old plugging-on-shortage
> > usage.
> >
> > - Unconditionally run replug_current_nested() in ioschedule()
> >
> > - Merge to current git tree.
>
> All looks good to me... I don't know about namespace though: do you
> think prepending a blk_ or block_ to the plug operations would be nicer?
Yep I think I'll change that too, so it's clear that we are dealing with
the io side of things.
> > I'll try to do some serious testing on this soon. It would also be nice
> > to retain the plugging information for blktrace, even if it isn't per
> > queue anymore. Hmmm.
>
> I guess you still merge against a particular queue, because you'll
> flush the private list when submitting to a different queue. However
> trying to combine the stats when you don't hold the queue lock might
> be interesting? I guess you don't want to reintroduce any cacheline
> bouncing if you can help it.
The merging is good enough as is, I don't think you broke accounting
there. Currently we lack the queue to signal a plug against in the
private plugging, this is what needs some more massaging (if possible,
it may just make more sense to forget about it).
> I will be very interested to know what happens to performance in IO
> critical applications.
Me too, I'll try and get some testing done next week.
--
Jens Axboe
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists