[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <45A7F396.4080600@tls.msk.ru>
Date: Fri, 12 Jan 2007 23:46:14 +0300
From: Michael Tokarev <mjt@....msk.ru>
To: Chris Mason <chris.mason@...cle.com>
CC: Linus Torvalds <torvalds@...l.org>, dean gaudet <dean@...tic.org>,
Viktor <vvp01@...ox.ru>, Aubrey <aubreylee@...il.com>,
Hua Zhong <hzhong@...il.com>, Hugh Dickins <hugh@...itas.com>,
linux-kernel@...r.kernel.org, hch@...radead.org,
kenneth.w.chen@...el.com, akpm@...l.org
Subject: Re: O_DIRECT question
Chris Mason wrote:
[]
> I recently spent some time trying to integrate O_DIRECT locking with
> page cache locking. The basic theory is that instead of using
> semaphores for solving O_DIRECT vs buffered races, you put something
> into the radix tree (I call it a placeholder) to keep the page cache
> users out, and lock any existing pages that are present.
But seriously - what about just disallowing non-O_DIRECT opens together
with O_DIRECT ones ?
If the thing will allow non-DIRECT READ-ONLY open, I personally see no
problems whatsoever, at all. If non-DIRECT READONLY open will be disallowed
too, -- well, a bit less nice, but still workable (allowing online backup
of database files opened in O_DIRECT mode using other tools such as `cp' --
if non-direct opens aren't allowed, i'll switch to using dd or somesuch).
Yes there may be still a race between ftruncate() and reads (either direct
or not), or when filling gaps by writing into places which were skipped
by using ftruncate. I don't know how serious those races are.
That to say - if the whole thing will be a bit more strict wrt allowing
set of operations, races (or some of them, anyway) will just go away
(and maybe it will work even better due to quite some code and lock
contention removal), and maybe after that, Linus will like the whole
thing a bit better... ;)
After all the explanations, I still don't see anything wrong with the
interface itself. O_DIRECT isn't "different semantics" - we're still
writing and reading some data. Yes, O_DIRECT and non-O_DIRECT usages
somewhat contradicts with each other, but there are other ways to make
the two happy, instead of introducing alot of stupid, complex, and racy
code all over.
/mjt
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists