lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080804153331.GD9397@skywalker>
Date:	Mon, 4 Aug 2008 21:03:31 +0530
From:	"Aneesh Kumar K.V" <aneesh.kumar@...ux.vnet.ibm.com>
To:	"Theodore Ts'o" <tytso@....edu>
Cc:	linux-ext4@...r.kernel.org, Mingming Cao <cmm@...ibm.com>
Subject: Re: Problem with delayed allocation

Hi Ted,

On Mon, Aug 04, 2008 at 08:57:30PM +0530, Aneesh Kumar K.V wrote:
> On Mon, Aug 04, 2008 at 08:22:49PM +0530, Aneesh Kumar K.V wrote:
> > On Sat, Aug 02, 2008 at 04:07:19PM -0400, Theodore Ts'o wrote:
> > > 
> > > Apparently __fsync_super(), which is called right before remounting a
> > > filesystem read-only, isn't working correctly.  To reproduce, create a
> > > script which does this:
> > > 
> > > #!/bin/sh
> > > DEVICE=/dev/closure/test
> > > mke2fs -t ext4dev /dev/closure/test
> > > mount $DEVICE /mnt
> > > cd /mnt
> > > tar xfj /var/tmp/linux-2.6.26.tar.gz  <----- or some really big file
> > > du -s
> > > cd ..
> > > mount -o remount,ro /mnt
> > > sync
> > > dmesg > /tmp/dmesg.out  <----- note all of the ext4_da_writepages error messages
> > > umount /mnt
> > > du -s /mnt
> > > sync
> > > mount $DEVICE /mnt
> > > du -s /mnt    <---  note that size of the unpacked hierarcy is much smaller
> > > 
> > > This doesn't happen if the ext4 filesystem is mounted with nodelalloc,
> > > so I assume the problem is in ext4_da_writepages().
> > > 
> > 
> > Can you try this patch and see if it makes any difference ?
> > 
> > diff --git a/fs/fs-writeback.c b/fs/fs-writeback.c
> > index 25adfc3..5a8a2d3 100644
> > --- a/fs/fs-writeback.c
> > +++ b/fs/fs-writeback.c
> > @@ -518,6 +518,7 @@ void generic_sync_sb_inodes(struct super_block *sb,
> >  		spin_lock(&inode_lock);
> >  		if (wbc->nr_to_write <= 0) {
> >  			wbc->more_io = 1;
> > +			printk(KERN_CRIT "Breaking from the %s loop\n", __func__);
> >  			break;
> >  		}
> >  		if (!list_empty(&sb->s_more_io))
> > @@ -611,6 +612,8 @@ void sync_inodes_sb(struct super_block *sb, int wait)
> >  			(inodes_stat.nr_inodes - inodes_stat.nr_unused) +
> >  			nr_dirty + nr_unstable;
> >  	wbc.nr_to_write += wbc.nr_to_write / 2;		/* Bit more for luck */
> > +	wbc.nr_to_write =  LONG_MAX;
> > +
> >  	sync_sb_inodes(sb, &wbc);
> >  }
> >  
> 
> 
> I guess this could be the reason. I am not hitting the error during
> remount, ro with this change. But I have other changes also accumulated
> as a part of rewrite.
> 
> 
> diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c
> index 4a50445..ecabe77 100644
> --- a/fs/ext4/inode.c
> +++ b/fs/ext4/inode.c
> @@ -2225,8 +2288,10 @@ static int ext4_da_writepages(struct address_space *mapping,
>  	if (!mapping->nrpages || !mapping_tagged(mapping, PAGECACHE_TAG_DIRTY))
>  		return 0;
>  
> +#if 0
>  	if (wbc->nr_to_write > mapping->nrpages)
>  		wbc->nr_to_write = mapping->nrpages;
> +#endif
>  
>  
>  	if (!wbc->range_cyclic) {

The reason why you are able to reproduce it with the linus tree is
because of

/*
 * set the max dirty pages could be write at a time
 * to fit into the reserved transaction credits
 */
if (wbc->nr_to_write > EXT4_MAX_WRITEBACK_PAGES)
	wbc->nr_to_write = EXT4_MAX_WRITEBACK_PAGES;



-aneesh
--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ