[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20230423154604.p65lfge3ari3jgeu@zlang-mailbox>
Date: Sun, 23 Apr 2023 23:46:04 +0800
From: Zorro Lang <zlang@...hat.com>
To: "Ritesh Harjani (IBM)" <ritesh.list@...il.com>
Cc: fstests@...r.kernel.org, linux-ext4@...r.kernel.org,
Baokun Li <libaokun1@...wei.com>
Subject: Re: [RFC 1/2] ext4/060: Regression test against dioread_nolock mount
option inconsistency
On Sat, Apr 22, 2023 at 09:47:33PM +0530, Ritesh Harjani (IBM) wrote:
> During ext4_writepages, ext4 queries dioread_nolock mount option twice
> and if someone remount the filesystem in between with ^dioread_nolock,
> then this can cause an inconsistency causing WARN_ON() to be triggered.
>
> This fix describes the problem in more detail -
>
> https://lore.kernel.org/linux-ext4/20230328090534.662l7yxj2e425j7w@quack3/T/#md19c34646e8b4a816498532c298a66ecf2ae77d4
>
> This test reproduces below warning for me w/o the fix.
>
> ------------[ cut here ]------------
> WARNING: CPU: 2 PID: 26 at fs/ext4/page-io.c:231 ext4_put_io_end_defer+0xfb/0x140
> Modules linked in:
> CPU: 2 PID: 26 Comm: ksoftirqd/2 Not tainted 6.3.0-rc6-xfstests-00044-ga5c68786f1b1 #23
> Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.15.0-0-g2dd4b9b3f840-prebuilt.qemu.org 04/01/2014
> RIP: 0010:ext4_put_io_end_defer+0xfb/0x140
> Code: 5d 41 5e 41 5f e9 a5 73 d0 00 5b 48 89 ef 5d 41 5c 41 5d 41 5e 41 5f e9 d3 fa ff ff 49 83 be a8 03 00 00 00 0f 84 7b ff fd
> <...>
> Call Trace:
> <TASK>
> blk_update_request+0x116/0x4c0
> ? finish_task_switch.isra.0+0xfb/0x320
> blk_mq_end_request+0x1e/0x40
> blk_complete_reqs+0x40/0x50
> __do_softirq+0xd8/0x3e1
> ? smpboot_thread_fn+0x30/0x280
> run_ksoftirqd+0x3a/0x60
> smpboot_thread_fn+0x1d8/0x280
> ? __pfx_smpboot_thread_fn+0x10/0x10
> kthread+0xf6/0x120
> ? __pfx_kthread+0x10/0x10
> ret_from_fork+0x2c/0x50
> </TASK>
> [
>
> Signed-off-by: Ritesh Harjani (IBM) <ritesh.list@...il.com>
> ---
> tests/ext4/060 | 88 ++++++++++++++++++++++++++++++++++++++++++++++
> tests/ext4/060.out | 2 ++
> 2 files changed, 90 insertions(+)
> create mode 100755 tests/ext4/060
> create mode 100644 tests/ext4/060.out
>
> diff --git a/tests/ext4/060 b/tests/ext4/060
> new file mode 100755
> index 00000000..d9fe1a99
> --- /dev/null
> +++ b/tests/ext4/060
> @@ -0,0 +1,88 @@
> +#! /bin/bash
> +# SPDX-License-Identifier: GPL-2.0
> +# Copyright (c) 2023 IBM Corporation. All Rights Reserved.
> +#
> +# FS QA Test 060
> +#
> +# This is to test a ext4 regression against inconsistent values of
Great, a new regression test case!
> +# dioread_nolock mount option while in ext4_writepages path.
> +# See - https://lore.kernel.org/linux-ext4/20230328090534.662l7yxj2e425j7w@quack3/T/#md19c34646e8b4a816498532c298a66ecf2ae77d4
You can use the commit id and subject to replace the link.
> +#
> +. ./common/preamble
> +_begin_fstest auto quick
also add mount/remount tag?
> +
> +PID1=""
> +PIDS=""
> +trap "_cleanup; exit \$status" 0 1 2 3 15
> +# Override the default cleanup function.
> + _cleanup()
> +{
> + {
> + kill -SIGKILL $PID1 $PIDS
> + wait $PID1 $PIDS
> + } > /dev/null 2>&1
I think the curly braces "{ }" is not necessary. Refer to generic/390 to deal
with the background processes.
[ -n "$PIDS" ] && kill -9 $PIDS
wait $PIDS
> +
> + cd /
> + rm -r -f $tmp.*
> +}
> +
> +# Import common functions.
> + . ./common/filter
> +
> +# real QA test starts here
> +
> +# Modify as appropriate.
Remove this comment.
> +_supported_fs ext4
_fixed_by_kernel_commit ?
> +_require_scratch
> +
> +_scratch_mkfs_ext4 >> $seqres.full 2>&1
> +_scratch_mount
> +_scratch_remount "dioread_nolock" >> $seqres.full 2>&1
> +ret=$?
If the "$ret" is only used once as below...
> +if [ $ret -ne 0 ]; then
... then we can use "$?" directly.
> + _notrun "dioread_nolock mount option not supported"
When ext4 start to support dioread_nolock/dioread_lock ?
If it's old enough, we don't need to check this option. Or we can have a new
helper (e.g. require_scratch_ext4_mount_option()). You can refer to
_require_scratch_ext4_feature(), or maybe we can change it to support mount
option test.
> +fi
> +
> +testfile=$SCRATCH_MNT/testfile
> +
> +function run_buff_io_loop()
> +{
> + # add buffered io case here
> + while [ 1 ]; do
> + xfs_io -fc "truncate 0" -c "pwrite 0 200M" -c "fsync" "$testfile.$1" > /dev/null 2>&1
I only find the $testfile is used at here once, if so you can make it as
a local variable of this function.
> + sleep 2;
> + done
> +}
> +
> +function run_remount_loop()
> +{
> + # add remount loop case here
> + while [ 1 ]; do
> + _scratch_remount "dioread_nolock" >> $seqres.full 2>&1
> + sleep 1
> + _scratch_remount "dioread_lock" >> $seqres.full 2>&1
> + sleep 1
> + done
> +}
> +
> +run_remount_loop &
> +PID1=$!
If you don't need to kill these processes in a specific order, I think
you can:
PIDS=$!
> +for i in $(seq 1 20); do
> + run_buff_io_loop $i &
> + PID=$!
> + PIDS="${PIDS} ${PID}"
PIDS="$PIDS $!"
> +done
> +
> +sleep 10
$((10 * TIME_FACTOR)) ?
> +
> +{
> + kill -SIGKILL $PID1 $PIDS
> + wait $PID1 $PIDS
> +} > /dev/null 2>&1
kill -9 $$PIDS
wait $PIDS
unset PIDS
> +
> +echo "Silence is golden"
> +
> +# success, all done
> +status=0
> +exit
> diff --git a/tests/ext4/060.out b/tests/ext4/060.out
> new file mode 100644
> index 00000000..8ffce4de
> --- /dev/null
> +++ b/tests/ext4/060.out
> @@ -0,0 +1,2 @@
> +QA output created by 060
> +Silence is golden
> --
> 2.39.2
>
Powered by blists - more mailing lists