lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20150415124658.7ef9bde65965cc3031239522@linux-foundation.org>
Date:	Wed, 15 Apr 2015 12:46:58 -0700
From:	Andrew Morton <akpm@...ux-foundation.org>
To:	Jens Axboe <axboe@...com>
Cc:	<linux-fsdevel@...r.kernel.org>, <hch@...radead.org>,
	<viro@...iv.linux.org.uk>, <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH] direct-io: only inc/dec inode->i_dio_count for file
 systems

On Wed, 15 Apr 2015 13:26:51 -0600 Jens Axboe <axboe@...com> wrote:

> > Is there similar impact to direct-io-to-file?  It would be nice to fix
> > that up also.  Many filesystems do something along the lines of
> >
> > 	atomic_inc(i_dio_count);
> > 	wibble()
> > 	atomic_dev(i_dio_count);
> > 	__blockdev_direct_IO(...);
> >
> > and with your patch I think we could change them to
> >
> > 	atomic_inc(i_dio_count);
> > 	wibble()
> > 	__blockdev_direct_IO(..., flags|DIO_IGNORE_TRUNCATE);
> > 	atomic_dev(i_dio_count);
> >
> > which would halve the atomic op load.
> 
> I haven't checked pure file, but without extending, I suspect that we 
> should see similar benefits there. In any case, it'd make sense doing, 
> having twice the atomic inc/dec is just a bad idea in general, if we can 
> get rid of it.
> 
> A quick grep doesn't show this use case, or I'm just blind. Where do you 
> see that?

btrfs_direct_IO() holds i_dio_count across its call to
__blockdev_direct_IO() for writes.  That makes the dio_bio_count
manipulation in do_blockdev_direct_IO() unneeded?  ext4 is similar.

Reducing from 4 ops to 2 probably won't make as much difference as
reducing from 2 to 0 - most of the cost comes from initially grabbing that
cacheline from a different CPU.

> > But that's piling hack on top of hack.  Can we change the
> 
> I'd more view it as reducing the hack, the real hack is the way that we 
> manually do atomic_inc() on i_dio_count, and then call a magic 
> inode_dio_done() that decrements it again. It's not very pretty, I'm 
> just reducing the scope of the hack :-)

A magic flag which says "you don't need to do this in that case because
I know this inode is special".  direct-io already has too much of this :(

> > do_blockdev_direct_IO() interface to "caller shall hold i_mutex, or
> > increment i_dio_count"?  ie: exclusion against truncate is wholly the
> > caller's responsibility.  That way, this awkward sharing of
> > responsibility between caller and callee gets cleaned up and
> > DIO_IGNORE_TRUNCATE goes away.
> 
> That would clean it up, at the expense of touching more churn. I'd be 
> fine with doing it that way.

OK, could be done later I suppose.

> > inode_dio_begin() would be a good place to assert that i_mutex is held,
> > btw.
> 
> How would you check it? If the i_dio_count is bumped, then you'd not 
> need to hold i_mutex.

	if (atomic_add_return() == 1)
		assert()

I guess.  It was just a thought.  Having wandered around the code, I'm
not 100% confident that everyone is holding i_mutex - it's not all
obviously correct.

otoh, the caller doesn't *have* to choose i_mutex for the external
exclusion, and perhaps some callers have used something else.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ