[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <503DA954.80009@ce.jp.nec.com>
Date: Wed, 29 Aug 2012 14:32:04 +0900
From: "Jun'ichi Nomura" <j-nomura@...jp.nec.com>
To: Dave Chinner <david@...morbit.com>
CC: Naoya Horiguchi <n-horiguchi@...jp.nec.com>,
Andi Kleen <andi.kleen@...el.com>,
Wu Fengguang <fengguang.wu@...el.com>,
Andrew Morton <akpm@...ux-foundation.org>,
Tony Luck <tony.luck@...el.com>,
Rik van Riel <riel@...hat.com>, linux-mm@...ck.org,
linux-kernel@...r.kernel.org
Subject: Re: [PATCH 3/3] HWPOISON: prevent inode cache removal to keep AS_HWPOISON
sticky
On 08/29/12 11:59, Dave Chinner wrote:
> On Mon, Aug 27, 2012 at 06:05:06PM -0400, Naoya Horiguchi wrote:
>> And yes, I understand it's ideal, but many applications choose not to
>> do that for performance reason.
>> So I think it's helpful if we can surely report to such applications.
I suspect "performance vs. integrity" is not a correct
description of the problem.
> If performance is chosen over data integrity, we are under no
> obligation to keep the error around indefinitely. Fundamentally,
> ensuring a write completes successfully is the reponsibility of the
> application, not the kernel. There are so many different filesytem
> and storage errors that can be lost right now because data is not
> fsync()d, adding another one to them really doesn't change anything.
> IOWs, a memory error is no different to a disk failing or the system
> crashing when it comes to data integrity. If you care, you use
> fsync().
I agree that applications should fsync() or O_SYNC
when it wants to make sure the written data in on disk.
AFAIU, what Naoya is going to address is the case where
fsync() is not necessarily needed.
For example, if someone do:
$ patch -p1 < ../a.patch
$ tar cf . > ../a.tar
and disk failure occurred between "patch" and "tar",
"tar" will either see uptodate data or I/O error.
OTOH, if the failure was detected on dirty pagecache, the current memory
failure handler invalidates the dirty page and the "tar" command will
re-read old contents from disk without error.
(Well, the failures above are permanent failures. IOW, the current
memory failure handler turns permanent failure into transient error,
which is often more difficult to handle, I think.)
Naoya's patch will keep the failure information and allows the reader
to get I/O error when it reads from broken pagecache.
--
Jun'ichi Nomura, NEC Corporation
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists