[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20220923102827.s6osm6tmgqviomvh@quack3>
Date: Fri, 23 Sep 2022 12:28:27 +0200
From: Jan Kara <jack@...e.cz>
To: Boyang Xue <bxue@...hat.com>
Cc: Jan Kara <jack@...e.cz>, linux-ext4@...r.kernel.org,
linux-fsdevel@...r.kernel.org, Lukas Czerner <lczerner@...hat.com>
Subject: Re: [bug report] disk quota exceed after multiple write/delete loops
On Fri 23-09-22 15:37:55, Boyang Xue wrote:
> Hi Jan,
>
> On Thu, Sep 22, 2022 at 8:02 PM Jan Kara <jack@...e.cz> wrote:
> >
> > Hello!
> >
> > On Tue 23-08-22 12:16:46, Boyang Xue wrote:
> > > On the latest kernel 6.0.0-0.rc2, I find the user quota limit in an
> > > ext4 mount is unstable, that after several successful "write file then
> > > delete" loops, it will finally fail with "Disk quota exceeded". This
> > > bug can be reproduced on at least kernel-6.0.0-0.rc2 and
> > > kernel-5.14.0-*, but can't be reproduced on kernel-4.18.0 based RHEL8
> > > kernel.
> >
> > <snip reproducer>
> >
> > > Run log on kernel-6.0.0-0.rc2
> > > ```
> > > (...skip successful Run#[1-2]...)
> > > *** Run#3 ***
> > > --- Quota before writing file ---
> > > Disk quotas for user quota_test_user1 (uid 1003):
> > > Filesystem blocks quota limit grace files quota limit grace
> > > /dev/loop0 0 200000 300000 0 2000 3000
> > > --- ---
> > > dd: error writing '/mntpt/test_300m': Disk quota exceeded
> > > 299997+0 records in
> > > 299996+0 records out
> > > 307195904 bytes (307 MB, 293 MiB) copied, 1.44836 s, 212 MB/s
> >
> > So this shows that we have failed allocating the last filesystem block. I
> > suspect this happens because the file gets allocted from several free space
> > extens and so one extra indirect tree block needs to be allocated (or
> > something like that). To verify that you can check the created file with
> > "filefrag -v".
>
> By hooking a "filefrag -v" in each run, I find a pattern that only
> when the dd command writes out of disk quota, "filefrag -v" shows
> "unwritten extents", like this:
> ```
> Filesystem type is: ef53
> File size of /mntpt/test_300m is 307195904 (74999 blocks of 4096 bytes)
> ext: logical_offset: physical_offset: length: expected: flags:
> 0: 0.. 1023: 98976.. 99999: 1024:
> 1: 1024.. 18431: 112640.. 130047: 17408: 100000:
> 2: 18432.. 51199: 131072.. 163839: 32768: 130048:
> 3: 51200.. 55236: 165888.. 169924: 4037: 163840: unwritten
> 4: 55237.. 74998: 0.. 0: 0:
> last,unknown_loc,delalloc,eof
> /mntpt/test_300m: 5 extents found
> ```
OK, this matches what I've said. The unwritten extent is there because the
inode is just undergoing writeback and that may (at least temporarily)
increase number of extents. The inode can hold 4 extents, once fifth extent
is added we have to allocate indirect block which is what breaks your test.
So nothing unexpected here. Thanks for checking!
Honza
--
Jan Kara <jack@...e.com>
SUSE Labs, CR
Powered by blists - more mailing lists