[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190528202659.GA12412@mit.edu>
Date: Tue, 28 May 2019 16:26:59 -0400
From: "Theodore Ts'o" <tytso@....edu>
To: Amir Goldstein <amir73il@...il.com>
Cc: Jan Kara <jack@...e.cz>,
"Darrick J . Wong" <darrick.wong@...cle.com>,
Dave Chinner <david@...morbit.com>, Chris Mason <clm@...com>,
Al Viro <viro@...iv.linux.org.uk>,
linux-fsdevel@...r.kernel.org, linux-xfs@...r.kernel.org,
linux-ext4@...r.kernel.org, linux-btrfs@...r.kernel.org,
linux-api@...r.kernel.org
Subject: Re: [RFC][PATCH] link.2: AT_ATOMIC_DATA and AT_ATOMIC_METADATA
On Mon, May 27, 2019 at 08:26:55PM +0300, Amir Goldstein wrote:
>
> Following our discussions on LSF/MM and beyond [1][2], here is
> an RFC documentation patch.
>
> Ted, I know we discussed limiting the API for linking an O_TMPFILE
> to avert the hardlinks issue, but I decided it would be better to
> document the hardlinks non-guaranty instead. This will allow me to
> replicate the same semantics and documentation to renameat(2).
> Let me know how that works out for you.
>
> I also decided to try out two separate flags for data and metadata.
> I do not find any of those flags very useful without the other, but
> documenting them seprately was easier, because of the fsync/fdatasync
> reference. In the end, we are trying to solve a social engineering
> problem, so this is the least confusing way I could think of to describe
> the new API.
The way you have stated thigs is very confusing, and prone to be
misunderstood. I think it would be helpful to state things in the
positive, instead of the negative.
Let's review what you had wanted:
*If* the filename is visible in the directory after the crash,
*then* all of the metadata/data that had been written to the file
before the linkat(2) would be visible.
HOWEVER, you did not want to necessarily force an fsync(2) in
order to provide that guarantee. That is, the filename would
not necessarily be guaranteed to be visible after a crash when
linkat(2) returns, but if the existence of the filename is
persisted, then the data would be too.
Also, at least initially we talked about this only making
sense for O_TMPFILE file desacriptors. I believe you were
trying to generalize things so it wouldn't necessarily have to
be a file created using O_TMPFILE. Is that correct?
So instead of saying "A filesystem that accepts this flag will
guaranty, that old inode data will not be exposed in the new linked
name." It's much clearer to state this in the affirmative:
A filesystem which accepts this flag will guarantee that if
the new pathname exists after a crash, all of the data written
to the file at the time of the linkat(2) call will be visible.
I would think it's much simpler to say what *will* happen, instead of
what will not be visible. (After all, technically speaking, returning
all zeros or random garbage data fufills the requirement "old data
will not be exposed", but that's probably not what you had in mind. :-)
Also please note that it's spelled "guarantee".
Cheers,
- Ted
Powered by blists - more mailing lists