lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:	Thu, 8 Oct 2009 14:57:46 +0200
From:	Jens Axboe <jens.axboe@...cle.com>
To:	Nick Piggin <npiggin@...e.de>
Cc:	Linus Torvalds <torvalds@...ux-foundation.org>,
	Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
	linux-fsdevel@...r.kernel.org,
	Ravikiran G Thirumalai <kiran@...lex86.org>,
	Peter Zijlstra <peterz@...radead.org>
Subject: Re: [rfc][patch] store-free path walking

On Thu, Oct 08 2009, Nick Piggin wrote:
> On Wed, Oct 07, 2009 at 07:56:33AM -0700, Linus Torvalds wrote:
> > On Wed, 7 Oct 2009, Nick Piggin wrote:
> > > 
> > > OK, I have a really basic patch that does store-free path walking
> > > (except on the final element).
> > 
> > Yay!
> > 
> > > dbench is pretty nasty still because it seems to do a lot of stupid 
> > > things like reading from /proc/mounts all the time.
> > 
> > You should largely forget about dbench, it can certainly be a useful 
> > benchmark, but at the same time it's certainly not a _meaningful_ one.
> > There are better things to try.
> 
> OK, here's one you might find interesting. It is a cached git diff
> workload in a linux kernel tree. I actually ran it in a loop 100
> times in order to get some reasonable sample sizes, then I ran
> parallel and serial configs (PreloadIndex = true/false). Compared
> plain kernel with all vfs patches to now.
> 
> 2.6.32-rc3 serial
> 5.35user 7.12system 0:12.47elapsed 100%CPU
> 
> 2.6.32-rc3 parallel
> 5.79user 17.69system 0:09.41elapsed 249%CPU
> 
> vfs serial
> 5.30user 5.62system 0:10.92elapsed 100%CPU
> 
> vfs parallel
> 4.86user 0.68system 0:06.82elapsed 81%CPU

Since the box was booted anyway, I tried the git test too. Results are
with 2.6.32-rc3 serial being the baseline 1.00 scores, smaller than 1.00
are faster and vice versa.

2.6.32-rc3 serial
real    1.00
user    1.00
sys     1.00

2.6.32-rc3 parallel
real    0.80
user    0.83
sys     8.38

sys time, auch...

vfs serial
real    0.86
user    0.93
sys     0.84

vfs parallel
real    0.43
user    0.72
sys     0.13

Let me know if you want profiles or anything like that. I'd say that
looks veeeery tasty.

-- 
Jens Axboe

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ