lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20110907173444.GF7725@quack.suse.cz>
Date:	Wed, 7 Sep 2011 19:34:44 +0200
From:	Jan Kara <jack@...e.cz>
To:	Masayoshi MIZUMA <m.mizuma@...fujitsu.com>
Cc:	Jan Kara <jack@...e.cz>, Andrew Morton <akpm@...ux-foundation.org>,
	Andreas Dilger <adilger.kernel@...ger.ca>,
	linux-ext4@...r.kernel.org, linux-fsdevel@...r.kernel.org,
	Valerie Aurora <val@...consulting.com>
Subject: Re: [BUG] ext3: cannot unfreeze a filesystem due to a deadlock

  Hello,

  Thanks for report!

On Wed 07-09-11 12:29:30, Masayoshi MIZUMA wrote:
> When I checked the freeze feature for ext3 filesystem using fsfreeze
> command at 3.1.0-rc4, I think the following deadlock problem happened.
> 
> How to reproduce:
>  # mkfs -t ext3 /dev/sdd1
>  # mount /dev/sdd1 /MNT
>  # ./fsstress -d /MNT/tmp -n 10 -p 1000 > /dev/null 2>&1 &
>  # fsfreeze -f /MNT
>  # fsfreeze -u /MNT
> 
>  If this deadlock is reproduced, "fsfreeze -u /MNT" does not return.
> 
> The detail of deadlock:
> o [flush-8:16:1523]
>   wb_do_writeback
>    wb_writeback
>    ...
>      ext3_journalled_writepage
>       journal_start
>        start_this_handle
>        # waiting until journal->j_barrier_count turns 0...
>        # j_barrier_count was incremented by journal_lock_updates()
>        # via ext3_freeze().
> 
> o [fsstress:2673]
>   sys_sync
>    sync_filesystems
>     iterate_supers
>      down_read(sb->s_umount)
>      sync_one_sb
>       __sync_filesystem
>        writeback_inodes_sb
>         writeback_inodes_sb_nr
>          wait_for_completion
>           wait_for_common
>           # waiting for completion of [flush-8:16:1523]...
> 
> o [fsfreeze:2749]
>   sys_ioctl
>    do_vfs_ioctl
>     thaw_super
>     # waiting for down_write(sb->s_umount)...
>     # [fsfreeze:2673] did down_read(sb->s_umount).
  Yes, this is a classical deadlock that can happen for any filesystem. The
problem is flusher thread holds s_umount semaphore (either directly, or as
in your case, indirectly via blocked sync) and tries to do some IO which
blocks on frozen filesystem. It's particularly easy to hit for ext3 because
it doesn't do vfs_check_frozen() checks but all other filesystems have the
race window as well. Val Henson is working on fixing the problem - she even
has some first version of patches I believe.

								Honza

-- 
Jan Kara <jack@...e.cz>
SUSE Labs, CR
--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ