[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20100316081919.GA4258@infradead.org>
Date: Tue, 16 Mar 2010 04:19:19 -0400
From: Christoph Hellwig <hch@...radead.org>
To: Anthony Liguori <anthony@...emonkey.ws>
Cc: Christoph Hellwig <hch@...radead.org>,
Chris Webb <chris@...chsys.com>, Avi Kivity <avi@...hat.com>,
balbir@...ux.vnet.ibm.com,
KVM development list <kvm@...r.kernel.org>,
Rik van Riel <riel@...riel.com>,
KAMEZAWA Hiroyuki <kamezawa.hiroyu@...fujitsu.com>,
"linux-mm@...ck.org" <linux-mm@...ck.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH][RF C/T/D] Unmapped page cache control - via boot
parameter
On Mon, Mar 15, 2010 at 08:27:25PM -0500, Anthony Liguori wrote:
>> Actually cache=writeback is as safe as any normal host is with a
>> volatile disk cache, except that in this case the disk cache is
>> actually a lot larger. With a properly implemented filesystem this
>> will never cause corruption.
>
> Metadata corruption, not necessarily corruption of data stored in a file.
Again, this will not cause metadata corruption either if the filesystem
loses barriers, although we may lose up to the cache size of new (data
or metadata operations). The consistency of the filesystem is still
guaranteed.
> Not all software uses fsync as much as they should. And often times,
> it's for good reason (like ext3).
If an application needs data on disk it must call fsync, or there
is no guaranteed at all, even on ext3. And with growing disk caches
these issues show up on normal disks often enough that people have
realized it by now.
> IIUC, an O_DIRECT write using cache=writeback is not actually on the
> spindle when the write() completes. Rather, an explicit fsync() would
> be required. That will cause data corruption in many applications (like
> databases) regardless of whether the fs gets metadata corruption.
It's neither for O_DIRECT without qemu involved. The O_DIRECT write
goes through the disk cache and requires and explicit fsync or O_SYNC
open flag to make sure it goes to disk.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists