[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20080130144930.e23f78de.akpm@linux-foundation.org>
Date: Wed, 30 Jan 2008 14:49:30 -0800
From: Andrew Morton <akpm@...ux-foundation.org>
To: linux-ext4@...r.kernel.org, Andreas Gruenbacher <agruen@...e.de>
Cc: bugme-daemon@...zilla.kernel.org, kmshanah@...b.org.au
Subject: Re: [Bugme-new] [Bug 9855] New: ext3 ACL corruption
On Wed, 30 Jan 2008 14:29:27 -0800 (PST)
bugme-daemon@...zilla.kernel.org wrote:
> http://bugzilla.kernel.org/show_bug.cgi?id=9855
>
> Summary: ext3 ACL corruption
> Product: File System
> Version: 2.5
> KernelVersion: 2.6.23
> Platform: All
> OS/Version: Linux
> Tree: Mainline
> Status: NEW
> Severity: normal
> Priority: P1
> Component: ext3
> AssignedTo: akpm@...l.org
> ReportedBy: kmshanah@...b.org.au
>
>
> Latest working kernel version: Unknown
> Earliest failing kernel version: Definitely 2.6.23 and 2.6.23.8 but earlier is
> possible
> Distribution: Debian Etch
> Hardware Environment: Multiple x86 machines
>
> Software Environment:
> Filesystem is Ext3 on LVM on RAID-1 (on SATA).
> # e2fsck -V
> e2fsck 1.40-WIP (14-Nov-2006)
> Using EXT2FS Library version 1.40-WIP, 14-Nov-2006
>
> Problem Description:
> On several occasions now I have had e2fsck prune away ACLs on my file systems
> during a file system check after rebooting a number of (reasonably) long
> running Samba servers. This morning I decided to manually run fsck before
> rebooting one of these:
>
> # e2fsck -pfv /dev/mapper/vg_main-lv_samba
> (entry->e_value_offs + entry->e_value_size: 116, offs: 120)
> /dev/mapper/vg_main-lv_samba: Extended attribute in inode 163841 has a value
> offset (56) which is invalid
> CLEARED.
> (entry->e_value_offs + entry->e_value_size: 116, offs: 120)
> /dev/mapper/vg_main-lv_samba: Extended attribute in inode 262146 has a value
> offset (56) which is invalid
> CLEARED.
> [ snip lots of (near) identical errors]
>
> 8301 inodes used (0.08%)
> 1621 non-contiguous inodes (19.5%)
> # of inodes with ind/dind/tind blocks: 3837/24/0
> 1108478 blocks used (5.29%)
> 0 bad blocks
> 1 large file
>
> 7590 regular files
> 662 directories
> 0 character device files
> 0 block device files
> 0 fifos
> 0 links
> 40 symbolic links (38 fast symbolic links)
> 0 sockets
> --------
> 8292 files
>
> (Note: after remounting)
> # tune2fs -l /dev/mapper/vg_main-lv_samba
> tune2fs 1.40-WIP (14-Nov-2006)
> Filesystem volume name: <none>
> Last mounted on: <not available>
> Filesystem UUID: 88677414-c1f8-41ba-b737-d9f6170d771b
> Filesystem magic number: 0xEF53
> Filesystem revision #: 1 (dynamic)
> Filesystem features: has_journal ext_attr resize_inode dir_index filetype
> needs_recovery sparse_super large_file
> Filesystem flags: signed directory hash
> Default mount options: (none)
> Filesystem state: clean
> Errors behavior: Continue
> Filesystem OS type: Linux
> Inode count: 10485760
> Block count: 20971520
> Reserved block count: 1048576
> Free blocks: 19863038
> Free inodes: 10477459
> First block: 0
> Block size: 4096
> Fragment size: 4096
> Reserved GDT blocks: 1019
> Blocks per group: 32768
> Fragments per group: 32768
> Inodes per group: 16384
> Inode blocks per group: 1024
> Filesystem created: Wed Feb 21 21:38:33 2007
> Last mount time: Thu Jan 31 03:18:54 2008
> Last write time: Thu Jan 31 03:18:54 2008
> Mount count: 1
> Maximum mount count: 30
> Last checked: Thu Jan 31 03:16:51 2008
> Check interval: 15552000 (6 months)
> Next check after: Tue Jul 29 02:16:51 2008
> Reserved blocks uid: 0 (user root)
> Reserved blocks gid: 0 (group root)
> First inode: 11
> Inode size: 256
> Journal inode: 8
> Default directory hash: tea
> Directory Hash Seed: be8c201b-3563-4fa5-a2a6-e2864e4b73e2
> Journal backup: inode blocks
>
>
> Steps to reproduce:
> Unfortunately, precise steps are not known. Restoring all the filesystem's ACLs
> from a recent dump made using "getfacl -RP" fixes the ACLs without causing the
> corruption to return.
>
> These are production Samba servers making fairly extensive use of file and
> directory ACLs. Thus far, I've only noticed the corruptions when it came time
> to upgrade to a new kernel and reboot (and the boot scripts then run fsck).
> Note that I've never noticed any issues at runtime because of this - only when
> I later realised that ACLs had been removed from random files and/or
> directories.
>
> I think I will implement some scripts to unmount and run fsck nightly from
> cron, so I can at least detect the corruption a little earlier. If there is
> some more helpful debugging output I can provide, please let me know.
>
-
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists