[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20140201103721.9011.5971.stgit@birch.djwong.org>
Date: Sat, 01 Feb 2014 02:37:21 -0800
From: "Darrick J. Wong" <darrick.wong@...cle.com>
To: tytso@....edu, darrick.wong@...cle.com
Cc: linux-ext4@...r.kernel.org
Subject: [RFC PATCH v2 0/3] e2fsck metadata prefetch
This is the second go at a patchset that tries to reduce e2fsck run
times by pre-loading ext4 metadata concurrent with e2fsck execution.
The first patch is Andreas Dilger's patch to add a readahead method to
the IO manager interface. The second patch extends libext2fs with a
function call to invoke readahead on a list of blocks, and a second
call that invokes readahead on the bitmaps and inode tables of a bunch
of groups. The third patch enhances e2fsck to start threads that call
the readahead functions.
Crude testing has been done via:
# echo 3 > /proc/sys/vm/drop_caches
# READAHEAD=1 /usr/bin/time ./e2fsck/e2fsck -Fnfvtt /dev/XXX
So far in my crude testing on a cold system, I've seen about a ~20%
speedup on a SSD, a ~40% speedup on a 3x RAID1 SATA array, and maybe
a 5% speedup on a single-spindle SATA disk. On a single-queue USB
HDD, performance doesn't change much. It looks as though in general,
single-spindle HDDs will not benefit, which doesn't surprise me. The
SSD numbers are harder to quantify since they're already fast.
This second version of the patch uses posix_fadvise to hint to the
kernel that it really wants to have the blocks loaded in the page
cache ready to go. This is much easier to manage, because all we need
to do is throw a list of blocks at it and let it go... and if we're
careful not to change any FS state, we can easily offload the
readahead work to a thread without weird crashes.
Note that this draft code does little to prevent page cache thrashing.
It doesn't hold back from issuing a large flood of IO. It's not clear
if it's better to try to constrain how far the prefetcher gets ahead
of the checker code, or better to let the kernel sort it out.
I've tested these e2fsprogs changes against the -next branch as of
1/31. These days, I use an 8GB ramdisk and whatever hardware I have
lying around. The make check tests should pass.
Comments and questions are, as always, welcome.
--D
--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists