lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20260121192346.GI5945@frogsfrogsfrogs>
Date: Wed, 21 Jan 2026 11:23:46 -0800
From: "Darrick J. Wong" <djwong@...nel.org>
To: Jiaming Zhang <r772577952@...il.com>
Cc: cem@...nel.org, linux-xfs@...r.kernel.org, linux-kernel@...r.kernel.org,
	pchelkin@...ras.ru, syzkaller@...glegroups.com
Subject: Re: [Linux Kernel Bugs] general protection fault in xchk_btree and
 another slab-use-after-free issue

On Wed, Jan 21, 2026 at 06:14:06PM +0800, Jiaming Zhang wrote:
> Darrick J. Wong <djwong@...nel.org> 于2026年1月21日周三 03:17写道:
> >
> > On Tue, Jan 20, 2026 at 06:13:44PM +0800, Jiaming Zhang wrote:
> > > Dear Linux kernel developers and maintainers,
> > >
> > > We are writing to report a general protection fault discovered in the
> > > xfs subsystem with our generated syzkaller specifications. This issue
> > > is reproducible on the latest version of linux (v6.19-rc6, commit
> > > 24d479d26b25bce5faea3ddd9fa8f3a6c3129ea7). The KASAN report from
> > > kernel is listed below (formatted by syz-symbolize):
> > >
> > > ---
> > >
> > > loop0: detected capacity change from 0 to 32768
> > > XFS (loop0): Mounting V5 Filesystem 9f91832a-3b79-45c3-9d6d-ed0bc7357fe4
> > > XFS (loop0): Ending clean mount
> > > XFS (loop0): Injecting error at file fs/xfs/libxfs/xfs_btree.c, line
> > > 309, on filesystem "loop0"
> > > Oops: general protection fault, probably for non-canonical address
> > > 0xdffffc0000000009: 0000 [#1] SMP KASAN NOPTI
> > > KASAN: null-ptr-deref in range [0x0000000000000048-0x000000000000004f]
> > > CPU: 1 UID: 0 PID: 9920 Comm: repro.out Not tainted 6.19.0-rc6 #24 PREEMPT(full)
> > > Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.15.0-1 04/01/2014
> > > RIP: 0010:xchk_btree+0xb9/0x1380 fs/xfs/scrub/btree.c:701
> > > Code: f2 00 66 43 c7 44 35 0d f3 f3 43 c6 44 35 0f f3 e8 1c 44 39 fe
> > > 48 89 5c 24 40 48 83 c3 48 48 89 d8 48 c1 e8 03 48 89 44 24 30 <42> 0f
> > > b6 04 30 84 c0 0f 85 d6 11 00 00 44 0f b6 33 41 ff ce bf 53
> > > RSP: 0018:ffffc9000854f360 EFLAGS: 00010206
> > > RAX: 0000000000000009 RBX: 0000000000000048 RCX: ffff888020bebd80
> > > RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff888044710e00
> > > RBP: ffffc9000854f510 R08: ffffc9000854f540 R09: 0000000000000002
> > > R10: 0000000000000006 R11: 0000000000000000 R12: ffffffff837c5b20
> > > R13: 1ffff920010a9e88 R14: dffffc0000000000 R15: ffffffff8ba6c880
> > > FS:  000000001d1543c0(0000) GS:ffff8880ec5e0000(0000) knlGS:0000000000000000
> > > CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > > CR2: 0000200000002700 CR3: 00000000229e4000 CR4: 0000000000752ef0
> > > PKRU: 55555554
> > > Call Trace:
> > >  <TASK>
> > >  xchk_allocbt+0x112/0x190 fs/xfs/scrub/alloc.c:173
> > >  xrep_revalidate_allocbt+0xf3/0x160 fs/xfs/scrub/alloc_repair.c:930
> > >  xfs_scrub_metadata+0xc08/0x1920 fs/xfs/scrub/scrub.c:-1
> > >  xfs_ioc_scrubv_metadata+0x74a/0xaf0 fs/xfs/scrub/scrub.c:981
> > >  xfs_file_ioctl+0x751/0x1560 fs/xfs/xfs_ioctl.c:1266
> > >  vfs_ioctl fs/ioctl.c:51 [inline]
> > >  __do_sys_ioctl fs/ioctl.c:597 [inline]
> > >  __se_sys_ioctl+0xfc/0x170 fs/ioctl.c:583
> > >  do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > >  do_syscall_64+0xe8/0xf80 arch/x86/entry/syscall_64.c:94
> > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > RIP: 0033:0x45a879
> > > Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 31 18 00 00 90 48 89 f8 48
> > > 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d
> > > 01 f0 ff ff 73 01 c3 48 c7 c1 c0 ff ff ff f7 d8 64 89 01 48
> > > RSP: 002b:00007ffda72db7f8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
> > > RAX: ffffffffffffffda RBX: 00000000004004b8 RCX: 000000000045a879
> > > RDX: 00002000000000c0 RSI: 00000000c0285840 RDI: 0000000000000005
> > > RBP: 00007ffda72db860 R08: 0000000000000004 R09: 0000000000000005
> > > R10: 0000000000000004 R11: 0000000000000246 R12: 000000000040b990
> > > R13: 0000000000000000 R14: 00000000004ca018 R15: 00000000004004b8
> > >  </TASK>
> > > Modules linked in:
> > > ---[ end trace 0000000000000000 ]---
> > > RIP: 0010:xchk_btree+0xb9/0x1380 fs/xfs/scrub/btree.c:701
> > > Code: f2 00 66 43 c7 44 35 0d f3 f3 43 c6 44 35 0f f3 e8 1c 44 39 fe
> > > 48 89 5c 24 40 48 83 c3 48 48 89 d8 48 c1 e8 03 48 89 44 24 30 <42> 0f
> > > b6 04 30 84 c0 0f 85 d6 11 00 00 44 0f b6 33 41 ff ce bf 53
> > > RSP: 0018:ffffc9000854f360 EFLAGS: 00010206
> > > RAX: 0000000000000009 RBX: 0000000000000048 RCX: ffff888020bebd80
> > > RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff888044710e00
> > > RBP: ffffc9000854f510 R08: ffffc9000854f540 R09: 0000000000000002
> > > R10: 0000000000000006 R11: 0000000000000000 R12: ffffffff837c5b20
> > > R13: 1ffff920010a9e88 R14: dffffc0000000000 R15: ffffffff8ba6c880
> > > FS:  000000001d1543c0(0000) GS:ffff8880ec5e0000(0000) knlGS:0000000000000000
> > > CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > > CR2: 00007fbbbc0430c8 CR3: 00000000229e4000 CR4: 0000000000752ef0
> > > PKRU: 55555554
> > > ----------------
> > > Code disassembly (best guess):
> > >    0: f2 00 66 43          repnz add %ah,0x43(%rsi)
> > >    4: c7 44 35 0d f3 f3 43 movl   $0xc643f3f3,0xd(%rbp,%rsi,1)
> > >    b: c6
> > >    c: 44 35 0f f3 e8 1c    rex.R xor $0x1ce8f30f,%eax
> > >   12: 44 39 fe              cmp    %r15d,%esi
> > >   15: 48 89 5c 24 40        mov    %rbx,0x40(%rsp)
> > >   1a: 48 83 c3 48          add    $0x48,%rbx
> > >   1e: 48 89 d8              mov    %rbx,%rax
> > >   21: 48 c1 e8 03          shr    $0x3,%rax
> > >   25: 48 89 44 24 30        mov    %rax,0x30(%rsp)
> > > * 2a: 42 0f b6 04 30        movzbl (%rax,%r14,1),%eax <-- trapping instruction
> > >   2f: 84 c0                test   %al,%al
> > >   31: 0f 85 d6 11 00 00    jne    0x120d
> > >   37: 44 0f b6 33          movzbl (%rbx),%r14d
> > >   3b: 41 ff ce              dec    %r14d
> > >   3e: bf                    .byte 0xbf
> > >   3f: 53                    push   %rbx
> > >
> > > ---
> > >
> > > The root cause of this issue is that in xchk_btree(), where the
> > > argument cur can be NULL but the function assume cur is not NULL,
> > > leading to a NULL pointer dereference when accessing member
> > > (https://github.com/torvalds/linux/blob/v6.19-rc6/fs/xfs/scrub/btree.c#L701).
> > >
> > > We can add a NULL check at the beginning of xchk_btree() to fix this issue:
> > > ```
> > > --- a/fs/xfs/scrub/btree.c
> > > +++ b/fs/xfs/scrub/btree.c
> > > @@ -693,6 +693,9 @@ xchk_btree(
> > >   int level;
> > >   int error = 0;
> > >
> > > + if (!cur)
> > > + return -EINVAL;
> >
> > Uh, no, don't just fling EINVAL up to userspace.  Line 930 is the cntbt
> > revalidation in xrep_revalidate_allocbt.  Why is that pointer
> > 0xdffffc0000000009?  Did we somehow fail to allocate a cntbt cursor in
> > xchk_ag_btcur_init?  Did that xchk_should_check_xref free it?  Did we
> > fail to attach the AGF to sc->sa.agf_bp?
> 
> Thanks for the feedback! I dug deeper into the root cause as you
> suggested, here is what I found:
> 
> (1) The program executes XFS_IOC_ERROR_INJECTION branch in
> xfs_file_ioctl(), causes xfs_btree_check_block() to return
> -EFSCORRUPTED, which consequently marks the AG as sick (via
> xfs_btree_mark_sick).

"marks the AG as sick" ... which structure, specifically?  I'm guessing
XFS_SICK_AG_CNTBT from context, but it'd be useful to state these things
from the data you've collected rather than relying on me to infer what's
going on.

> (2) Then, the program executes XFS_IOC_SCRUBV_METADATA branch in
> xfs_file_ioctl(), the setup function (xchk_setup_ag_allocbt())

Which scrub type is it calling, bnobt or cntbt?

> attempts to initialize the cursor. However, the sick flag makes kernel
> executes xchk_ag_btree_del_cursor_if_sick(), the cursor is freed and
> nullified.

Are you talking about the ->setup call after we've rebuilt both free
space btrees, just prior to step three?  When that happens,
xchk_ag_btree_del_cursor_if_sick will see the XREP_ALREADY_FIXED flag
and mask out sc->sick_mask from mask:

	/*
	 * If we just repaired some AG metadata, sc->sick_mask will reflect all
	 * the per-AG metadata types that were repaired.  Exclude these from
	 * the filesystem health query because we have not yet updated the
	 * health status and we want everything to be scanned.
	 */
	if ((sc->flags & XREP_ALREADY_FIXED) &&
	    type_to_health_flag[sc->sm->sm_type].group == XHG_AG)
		mask &= ~sc->sick_mask;

> (3) Lastly, the repair_eval function (xrep_revalidate_allocbt()) calls
> xchk_allocbt(). Since xchk_allocbt() assumes the cursor is valid, it
> passes the NULL pointer to xchk_btree, leading to the null-ptr-deref.

Sure, but xrep_allocbt sets sc->sick_mask to XFS_SICK_AG_BNOBT |
XFS_SICK_AG so any pre-existing bnobt or cntbt sick state in the
xfs_group will be ignored and neither cursor will be deleted when
setting up the revalidation.

> Based on above analysis, I think sc->sa.cnt_cur being NULL is expected
> when the AG is sick. I think the appropriate fix is to check NULL
> inside xchk_allocbt():
> ```
> --- a/fs/xfs/scrub/alloc.c
> +++ b/fs/xfs/scrub/alloc.c
> @@ -170,6 +170,9 @@ xchk_allocbt(
>         return -EIO;
>     }
> 
> +   if (!cur)
> +       return -ENOENT;

This is not correct either.  We've just rebuilt the bnobt and cntbt for
the AG which means that cursors for both btrees should be loaded and
ready for revalidation.

I think you need to look into xchk_ag_btree_del_cursor_if_sick to figure
out exactly what the xfs_group's sick state is, what @mask is, and what
sc->sick_mask is, and from that figure out if it's really deleting the
cntbt cursor.  This is made more difficult because XFS error injection
is probabilistic so it could trigger on /any/ btree.

--D

> +
>     return xchk_btree(sc, cur, xchk_allocbt_rec, &XFS_RMAP_OINFO_AG, &ca);
>  }
>  ```
> What do you think? :)
> 
> >
> > >   /*
> > >   * Allocate the btree scrub context from the heap, because this
> > >   * structure can get rather large.  Don't let a caller feed us a
> > > ```
> > >
> > > After applying changes above and re-running reproducer, another issues
> > > is triggered:
> > >
> > > ---
> > > TITLE: KASAN: slab-use-after-free Read in xchk_btree_check_block_owner
> > >
> > > XFS (loop6): Mounting V5 Filesystem 9f91832a-3b79-45c3-9d6d-ed0bc7357fe4
> > > XFS (loop6): Ending clean mount
> > > ==================================================================
> > > BUG: KASAN: slab-use-after-free in
> > > xchk_btree_check_block_owner+0x3a2/0x600 fs/xfs/scrub/btree.c:401
> > > Read of size 8 at addr ffff88806af035d8 by task syz.6.59/14096
> > >
> > > CPU: 1 UID: 0 PID: 14096 Comm: syz.6.59 Not tainted 6.19.0-rc6-dirty
> > > #30 PREEMPT(full)
> > > Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.15.0-1 04/01/2014
> > > Call Trace:
> > >  <TASK>
> > >  __dump_stack lib/dump_stack.c:94 [inline]
> > >  dump_stack_lvl+0x10e/0x190 lib/dump_stack.c:120
> > >  print_address_description mm/kasan/report.c:378 [inline]
> > >  print_report+0x17e/0x810 mm/kasan/report.c:482
> > >  kasan_report+0x147/0x180 mm/kasan/report.c:595
> > >  xchk_btree_check_block_owner+0x3a2/0x600 fs/xfs/scrub/btree.c:401
> > >  xchk_btree+0x57e/0x1320 fs/xfs/scrub/btree.c:797
> > >  xchk_allocbt+0x112/0x190 fs/xfs/scrub/alloc.c:173
> > >  xrep_revalidate_allocbt+0x69/0x160 fs/xfs/scrub/alloc_repair.c:925
> > >  xfs_scrub_metadata+0xc08/0x1920 fs/xfs/scrub/scrub.c:-1
> > >  xfs_ioc_scrubv_metadata+0x74a/0xaf0 fs/xfs/scrub/scrub.c:981
> > >  xfs_file_ioctl+0x751/0x1560 fs/xfs/xfs_ioctl.c:1266
> > >  vfs_ioctl fs/ioctl.c:51 [inline]
> > >  __do_sys_ioctl fs/ioctl.c:597 [inline]
> > >  __se_sys_ioctl+0xfc/0x170 fs/ioctl.c:583
> > >  do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > >  do_syscall_64+0xe8/0xf80 arch/x86/entry/syscall_64.c:94
> > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > RIP: 0033:0x7f71bddb459d
> > > Code: 02 b8 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa 48 89 f8 48
> > > 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d
> > > 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
> > > RSP: 002b:00007f71bed71f98 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
> > > RAX: ffffffffffffffda RBX: 00007f71be045fa0 RCX: 00007f71bddb459d
> > > RDX: 00002000000000c0 RSI: 00000000c0285840 RDI: 0000000000000005
> > > RBP: 00007f71bde52610 R08: 0000000000000000 R09: 0000000000000000
> > > R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
> > > R13: 00007f71be046038 R14: 00007f71be045fa0 R15: 00007f71bed52000
> > >  </TASK>
> > >
> > > Allocated by task 14096:
> > >  kasan_save_stack mm/kasan/common.c:57 [inline]
> > >  kasan_save_track+0x3e/0x80 mm/kasan/common.c:78
> > >  unpoison_slab_object mm/kasan/common.c:340 [inline]
> > >  __kasan_slab_alloc+0x6c/0x80 mm/kasan/common.c:366
> > >  kasan_slab_alloc include/linux/kasan.h:253 [inline]
> > >  slab_post_alloc_hook mm/slub.c:4953 [inline]
> > >  slab_alloc_node mm/slub.c:5263 [inline]
> > >  kmem_cache_alloc_noprof+0x37d/0x710 mm/slub.c:5270
> > >  xfs_btree_alloc_cursor fs/xfs/libxfs/xfs_btree.h:683 [inline]
> > >  xfs_bnobt_init_cursor+0x64/0x210 fs/xfs/libxfs/xfs_alloc_btree.c:485
> > >  xchk_ag_btcur_init+0xe0/0x5d0 fs/xfs/scrub/common.c:612
> > >  xchk_ag_init fs/xfs/scrub/common.c:698 [inline]
> > >  xchk_setup_ag_btree+0x295/0x310 fs/xfs/scrub/common.c:943
> > >  xchk_setup_ag_allocbt+0x70/0x190 fs/xfs/scrub/alloc.c:35
> > >  xfs_scrub_metadata+0xa9e/0x1920 fs/xfs/scrub/scrub.c:709
> > >  xfs_ioc_scrubv_metadata+0x74a/0xaf0 fs/xfs/scrub/scrub.c:981
> > >  xfs_file_ioctl+0x751/0x1560 fs/xfs/xfs_ioctl.c:1266
> > >  vfs_ioctl fs/ioctl.c:51 [inline]
> > >  __do_sys_ioctl fs/ioctl.c:597 [inline]
> > >  __se_sys_ioctl+0xfc/0x170 fs/ioctl.c:583
> > >  do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > >  do_syscall_64+0xe8/0xf80 arch/x86/entry/syscall_64.c:94
> > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > >
> > > Freed by task 14096:
> > >  kasan_save_stack mm/kasan/common.c:57 [inline]
> > >  kasan_save_track+0x3e/0x80 mm/kasan/common.c:78
> > >  kasan_save_free_info+0x46/0x50 mm/kasan/generic.c:584
> > >  poison_slab_object mm/kasan/common.c:253 [inline]
> > >  __kasan_slab_free+0x58/0x80 mm/kasan/common.c:285
> > >  kasan_slab_free include/linux/kasan.h:235 [inline]
> > >  slab_free_hook mm/slub.c:2540 [inline]
> > >  slab_free mm/slub.c:6670 [inline]
> > >  kmem_cache_free+0x197/0x620 mm/slub.c:6781
> > >  xchk_should_check_xref+0xf9/0x420 fs/xfs/scrub/common.c:1351
> > >  xchk_xref_is_used_space+0x14b/0x210 fs/xfs/scrub/alloc.c:190
> > >  xchk_btree_check_block_owner+0x2fe/0x600 fs/xfs/scrub/btree.c:395
> > >  xchk_btree+0x57e/0x1320 fs/xfs/scrub/btree.c:797
> > >  xchk_allocbt+0x112/0x190 fs/xfs/scrub/alloc.c:173
> > >  xrep_revalidate_allocbt+0x69/0x160 fs/xfs/scrub/alloc_repair.c:925
> > >  xfs_scrub_metadata+0xc08/0x1920 fs/xfs/scrub/scrub.c:-1
> > >  xfs_ioc_scrubv_metadata+0x74a/0xaf0 fs/xfs/scrub/scrub.c:981
> > >  xfs_file_ioctl+0x751/0x1560 fs/xfs/xfs_ioctl.c:1266
> > >  vfs_ioctl fs/ioctl.c:51 [inline]
> > >  __do_sys_ioctl fs/ioctl.c:597 [inline]
> > >  __se_sys_ioctl+0xfc/0x170 fs/ioctl.c:583
> > >  do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > >  do_syscall_64+0xe8/0xf80 arch/x86/entry/syscall_64.c:94
> > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > >
> > > The buggy address belongs to the object at ffff88806af035c8
> > >  which belongs to the cache xfs_bnobt_cur of size 232
> > > The buggy address is located 16 bytes inside of
> > >  freed 232-byte region [ffff88806af035c8, ffff88806af036b0)
> > >
> > > The buggy address belongs to the physical page:
> > > page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x6af03
> > > ksm flags: 0x4fff00000000000(node=1|zone=1|lastcpupid=0x7ff)
> > > page_type: f5(slab)
> > > raw: 04fff00000000000 ffff88801dd96a00 ffffea000094fdc0 0000000000000003
> > > raw: 0000000000000000 00000000800d000d 00000000f5000000 0000000000000000
> > > page dumped because: kasan: bad access detected
> > > page_owner tracks the page as allocated
> > > page last allocated via order 0, migratetype Unmovable, gfp_mask
> > > 0x1052c40(GFP_NOFS|__GFP_NOWARN|__GFP_NORETRY|__GFP_COMP|__GFP_NOLOCKDEP),
> > > pid 13126, tgid 13119 (syz.4.29), ts 58027826746, free_ts 57929202822
> > >  set_page_owner include/linux/page_owner.h:32 [inline]
> > >  post_alloc_hook+0x234/0x290 mm/page_alloc.c:1884
> > >  prep_new_page mm/page_alloc.c:1892 [inline]
> > >  get_page_from_freelist+0x24e4/0x2580 mm/page_alloc.c:3945
> > >  __alloc_frozen_pages_noprof+0x181/0x370 mm/page_alloc.c:5240
> > >  alloc_pages_mpol+0x232/0x4a0 mm/mempolicy.c:2486
> > >  alloc_slab_page mm/slub.c:3075 [inline]
> > >  allocate_slab+0x86/0x3b0 mm/slub.c:3248
> > >  new_slab mm/slub.c:3302 [inline]
> > >  ___slab_alloc+0xe70/0x1860 mm/slub.c:4656
> > >  __slab_alloc+0x65/0x100 mm/slub.c:4779
> > >  __slab_alloc_node mm/slub.c:4855 [inline]
> > >  slab_alloc_node mm/slub.c:5251 [inline]
> > >  kmem_cache_alloc_noprof+0x40f/0x710 mm/slub.c:5270
> > >  xfs_btree_alloc_cursor fs/xfs/libxfs/xfs_btree.h:683 [inline]
> > >  xfs_cntbt_init_cursor+0x64/0x210 fs/xfs/libxfs/xfs_alloc_btree.c:511
> > >  xfs_free_ag_extent+0x570/0x1890 fs/xfs/libxfs/xfs_alloc.c:2149
> > >  __xfs_free_extent+0x2a7/0x460 fs/xfs/libxfs/xfs_alloc.c:4047
> > >  xfs_extent_free_finish_item+0x299/0x840 fs/xfs/xfs_extfree_item.c:555
> > >  xfs_defer_finish_one+0x5a6/0xcc0 fs/xfs/libxfs/xfs_defer.c:595
> > >  xfs_defer_finish_noroll+0x94a/0x1300 fs/xfs/libxfs/xfs_defer.c:707
> > >  xfs_defer_finish+0x1e/0x270 fs/xfs/libxfs/xfs_defer.c:741
> > >  xrep_defer_finish+0x16e/0x240 fs/xfs/scrub/repair.c:242
> > > page last free pid 785 tgid 785 stack trace:
> > >  reset_page_owner include/linux/page_owner.h:25 [inline]
> > >  free_pages_prepare mm/page_alloc.c:1433 [inline]
> > >  __free_frozen_pages+0xbc4/0xd40 mm/page_alloc.c:2973
> > >  vfree+0x25a/0x400 mm/vmalloc.c:3466
> > >  delayed_vfree_work+0x55/0x80 mm/vmalloc.c:3385
> > >  process_one_work kernel/workqueue.c:3257 [inline]
> > >  process_scheduled_works+0xa45/0x1670 kernel/workqueue.c:3340
> > >  worker_thread+0x8a0/0xda0 kernel/workqueue.c:3421
> > >  kthread+0x711/0x8a0 kernel/kthread.c:463
> > >  ret_from_fork+0x510/0xa50 arch/x86/kernel/process.c:158
> > >  ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
> > >
> > > Memory state around the buggy address:
> > >  ffff88806af03480: fc fc fc fc fa fb fb fb fb fb fb fb fb fb fb fb
> > >  ffff88806af03500: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> > > >ffff88806af03580: fb fc fc fc fc fc fc fc fc fa fb fb fb fb fb fb
> > >                                                     ^
> > >  ffff88806af03600: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> > >  ffff88806af03680: fb fb fb fb fb fb fc fc fc fc fc fc fc fc fa fb
> > > ==================================================================
> > > ---
> > >
> > > I also analyzed the root cause of this issue. In
> > > xchk_btree_check_block_owner(), bs->cur is an alias for
> > > bs->sc->sa.bnocur (or rmap_cur,
> > > https://github.com/torvalds/linux/blob/v6.19-rc6/fs/xfs/scrub/btree.c#L396-L400).
> > > The issue occurs when error injection triggers a failure path:
> > >
> > > 1. xchk_btree_check_block_owner() calls xchk_xref_is_used_space()
> > > 2. In xchk_xref_is_used_space(), xfs_alloc_has_records() returns a
> > > non-zero error due to error injection
> > > 3. Non-zero error causes xchk_should_check_xref() to free curpp (which
> > > points to bs->sc->sa.bnocur).
> > > 4. Memory pointed to by bs->cur is freed.
> > >
> > > Control returns to xchk_btree_check_block_owner(), which subsequently
> > > accesses bs->cur->bc_ops, triggering the UAF.
> > >
> > > P.S. this issue can also be triggered independently by syzkaller using
> > > our generated specs.
> > >
> > > To fix this issue, we can cache values of
> > > xfs_btree_is_bno(bs->cur->bc_ops) and
> > > xfs_btree_is_rmap(bs->cur->bc_ops) at the beginning of the function:
> > > ```
> > > --- a/fs/xfs/scrub/btree.c
> > > +++ b/fs/xfs/scrub/btree.c
> > > @@ -371,6 +371,8 @@ xchk_btree_check_block_owner(
> > >   xfs_agnumber_t agno;
> > >   xfs_agblock_t agbno;
> > >   bool init_sa;
> > > + bool is_bno;
> > > + bool is_rmap;
> > >   int error = 0;
> > >
> > >   if (!bs->cur)
> > > @@ -379,6 +381,9 @@ xchk_btree_check_block_owner(
> > >   agno = xfs_daddr_to_agno(bs->cur->bc_mp, daddr);
> > >   agbno = xfs_daddr_to_agbno(bs->cur->bc_mp, daddr);
> > >
> > > + is_bno = xfs_btree_is_bno(bs->cur->bc_ops);
> > > + is_rmap = xfs_btree_is_rmap(bs->cur->bc_ops);
> > > +
> > >   /*
> > >   * If the btree being examined is not itself a per-AG btree, initialize
> > >   * sc->sa so that we can check for the presence of an ownership record
> > > @@ -398,11 +403,11 @@ xchk_btree_check_block_owner(
> > >   * have to nullify it (to shut down further block owner checks) if
> > >   * self-xref encounters problems.
> > >   */
> > > - if (!bs->sc->sa.bno_cur && xfs_btree_is_bno(bs->cur->bc_ops))
> > > + if (!bs->sc->sa.bno_cur && is_bno)
> > >   bs->cur = NULL;
> > >
> > >   xchk_xref_is_only_owned_by(bs->sc, agbno, 1, bs->oinfo);
> > > - if (!bs->sc->sa.rmap_cur && xfs_btree_is_rmap(bs->cur->bc_ops))
> > > + if (!bs->sc->sa.rmap_cur && is_rmap)
> >
> > Indentation problems notwithstanding, that looks like a correct
> > resolution to the UAF problem.
> >
> > >   bs->cur = NULL;
> > >
> > >  out_free:
> > > ```
> > >
> > > After applying above changes, reproducer ran for ~35 minutes without
> > > triggering any issues.
> > >
> > > If above solutions are acceptable, we are happy to submit patches :)
> > >
> > > The kernel console output, kernel config, syzkaller reproducer, and C
> > > reproducer are also attached to help with analysis.
> > >
> > > Please let me know if any further information is required.
> > >
> > > Best Regards,
> > > Jiaming Zhang
> >
> > Please just link to your dashboard, don't send a 1MB email to dozens
> > of people.
> >
> > --D
> 

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ