[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <CAJuCfpHMJ-5hP+eHpD1jyJtdmFb9qaSfaZ8GdsHY8BrorvUdkg@mail.gmail.com>
Date: Tue, 27 Jan 2026 15:52:12 -0800
From: Suren Baghdasaryan <surenb@...gle.com>
To: Andrii Nakryiko <andrii.nakryiko@...il.com>
Cc: Hillf Danton <hdanton@...a.com>,
syzbot <syzbot+4e70c8e0a2017b432f7a@...kaller.appspotmail.com>, axboe@...nel.dk,
linux-block@...r.kernel.org, Lorenzo Stoakes <lorenzo.stoakes@...cle.com>,
linux-mm@...ck.org, linux-kernel@...r.kernel.org,
syzkaller-bugs@...glegroups.com
Subject: Re: [syzbot] [block?] possible deadlock in blkdev_read_iter
On Tue, Jan 27, 2026 at 10:51 AM Andrii Nakryiko
<andrii.nakryiko@...il.com> wrote:
>
> On Mon, Jan 26, 2026 at 6:22 PM Suren Baghdasaryan <surenb@...gle.com> wrote:
> >
> > On Mon, Jan 26, 2026 at 2:33 PM Suren Baghdasaryan <surenb@...gle.com> wrote:
> > >
> > > On Mon, Jan 26, 2026 at 9:20 AM Suren Baghdasaryan <surenb@...gle.com> wrote:
> > > >
> > > > On Sat, Jan 24, 2026 at 3:32 AM Hillf Danton <hdanton@...a.com> wrote:
> > > > >
> > > > > Add Lorenzo and Suren
> > > >
> > > > Thanks!
> > > >
> > > > >
> > > > > > Date: Fri, 23 Jan 2026 15:14:36 -0800
> > > > > > Hello,
> > > > > >
> > > > > > syzbot found the following issue on:
> > > > > >
> > > > > > HEAD commit: 24d479d26b25 Linux 6.19-rc6
> > > > > > git tree: upstream
> > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=100033fa580000
> > > > > > kernel config: https://syzkaller.appspot.com/x/.config?x=1859476832863c41
> > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4e70c8e0a2017b432f7a
> > > > > > compiler: gcc (Debian 12.2.0-14+deb12u1) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40
> > > > > > syz repro: https://syzkaller.appspot.com/x/repro.syz?x=11451b9a580000
> > > > > > C reproducer: https://syzkaller.appspot.com/x/repro.c?x=1045e852580000
> > > > > >
> > > > > > Downloadable assets:
> > > > > > disk image (non-bootable): https://storage.googleapis.com/syzbot-assets/d900f083ada3/non_bootable_disk-24d479d2.raw.xz
> > > > > > vmlinux: https://storage.googleapis.com/syzbot-assets/d0f3c47f6869/vmlinux-24d479d2.xz
> > > > > > kernel image: https://storage.googleapis.com/syzbot-assets/800231513703/bzImage-24d479d2.xz
> > > > > >
> > > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > > > > Reported-by: syzbot+4e70c8e0a2017b432f7a@...kaller.appspotmail.com
> > > > > >
> > > > > > WARNING: possible circular locking dependency detected
> > > > > > syzkaller #0 Not tainted
> > > > > > ------------------------------------------------------
> > > > > > syz.0.17/6091 is trying to acquire lock:
> > > > > > ffff8881061287a8 (
> > > > > > &sb->s_type->i_mutex_key#8){++++}-{4:4}, at: inode_lock_shared include/linux/fs.h:1042 [inline]
> > > > > > &sb->s_type->i_mutex_key#8){++++}-{4:4}, at: blkdev_read_iter+0x19e/0x500 block/fops.c:855
> > > > > >
> > > > > > but task is already holding lock:
> > > > > > ffff888012aa0448 (vm_lock){++++}-{0:0}, at: lock_next_vma+0x10e/0xed0 mm/mmap_lock.c:334
> > > > > >
> > > > > > which lock already depends on the new lock.
> > > > > >
> > > > > >
> > > > > > the existing dependency chain (in reverse order) is:
> > > > > >
> > > > > > -> #2 (vm_lock){++++}-{0:0}:
> > > > > > __vma_enter_locked+0x260/0x770 mm/mmap_lock.c:72
> > > > > > __vma_start_write+0x21/0x160 mm/mmap_lock.c:104
> > > > > > vma_start_write include/linux/mmap_lock.h:213 [inline]
> > > > > > mprotect_fixup+0x4e3/0xb80 mm/mprotect.c:768
> > > > > > setup_arg_pages+0x4a2/0xbb0 fs/exec.c:670
> > > > > > load_elf_binary+0xb5b/0x4fe0 fs/binfmt_elf.c:1028
> > > > > > search_binary_handler fs/exec.c:1669 [inline]
> > > > > > exec_binprm fs/exec.c:1701 [inline]
> > > > > > bprm_execve fs/exec.c:1753 [inline]
> > > > > > bprm_execve+0x8c2/0x1620 fs/exec.c:1729
> > > > > > kernel_execve+0x2ef/0x3b0 fs/exec.c:1919
> > > > > > try_to_run_init_process init/main.c:1506 [inline]
> > > > > > kernel_init+0x14a/0x2b0 init/main.c:1634
> > > > > > ret_from_fork+0x983/0xb10 arch/x86/kernel/process.c:158
> > > > > > ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
> > > > > >
> > > > > > -> #1 (&mm->mmap_lock){++++}-{4:4}:
> > > > > > __might_fault mm/memory.c:7174 [inline]
> > > > > > __might_fault+0x113/0x190 mm/memory.c:7168
> > > > > > _copy_to_iter+0x1c2/0x1710 lib/iov_iter.c:196
> > > > > > copy_page_to_iter lib/iov_iter.c:374 [inline]
> > > > > > copy_page_to_iter+0x12a/0x1e0 lib/iov_iter.c:361
> > > > > > copy_folio_to_iter include/linux/uio.h:204 [inline]
> > > > > > filemap_read+0x6b1/0xe40 mm/filemap.c:2851
> > > > > > blkdev_read_iter+0x1ac/0x500 block/fops.c:856
> > > > > > new_sync_read fs/read_write.c:491 [inline]
> > > > > > vfs_read+0x8bf/0xcf0 fs/read_write.c:572
> > > > > > ksys_read+0x12a/0x250 fs/read_write.c:715
> > > > > > do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > > > > > do_syscall_64+0xcd/0xf80 arch/x86/entry/syscall_64.c:94
> > > > > > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > > > >
> > > > > > -> #0 (&sb->s_type->i_mutex_key#8){++++}-{4:4}:
> > > > > > check_prev_add kernel/locking/lockdep.c:3165 [inline]
> > > > > > check_prevs_add kernel/locking/lockdep.c:3284 [inline]
> > > > > > validate_chain kernel/locking/lockdep.c:3908 [inline]
> > > > > > __lock_acquire+0x1669/0x2890 kernel/locking/lockdep.c:5237
> > > > > > lock_acquire kernel/locking/lockdep.c:5868 [inline]
> > > > > > lock_acquire+0x179/0x330 kernel/locking/lockdep.c:5825
> > > > > > down_read+0x9b/0x460 kernel/locking/rwsem.c:1537
> > > > > > inode_lock_shared include/linux/fs.h:1042 [inline]
> > > > > > blkdev_read_iter+0x19e/0x500 block/fops.c:855
> > > > > > __kernel_read+0x3f3/0xbf0 fs/read_write.c:530
> > > > > > freader_fetch+0x1d7/0x9d0 lib/buildid.c:100
> > > > > > __build_id_parse.isra.0+0xdd/0x6c0 lib/buildid.c:297
> > > > > > do_procmap_query+0xb0e/0x1080 fs/proc/task_mmu.c:733
> > > > > > procfs_procmap_ioctl+0x9d/0xe0 fs/proc/task_mmu.c:813
> > > > > > vfs_ioctl fs/ioctl.c:51 [inline]
> > > > > > __do_sys_ioctl fs/ioctl.c:597 [inline]
> > > > > > __se_sys_ioctl fs/ioctl.c:583 [inline]
> > > > > > __x64_sys_ioctl+0x18e/0x210 fs/ioctl.c:583
> > > > > > do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > > > > > do_syscall_64+0xcd/0xf80 arch/x86/entry/syscall_64.c:94
> > > > > > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > > > >
> > > >
> > > > It looks like:
> > > > #0 is executing PROCMAP_QUERY ioclt, read-locks vm_lock and then calls
> > > > build_id_parse()->__build_id_parse(...,
> > > > may_fault=true)->__kernel_read() which eventually takes
> > > > inode->i_rwsem.
> > > > #1 is a file-backed page fault which asserts that it might take
> > > > mmap_lock for read.
> > > > #2 is load_elf_binary()->mprotect_fixup() which write-locks both
> > > > mmap_lock and vm_lock. I'm guessing it already holds inode->i_rwsem
> > > > before write-locking these locks.
> > > >
> > > > Originally I thought the issue is most liley introduced in
> > > > d9d1c2d81797 ("fs/proc/task_mmu: execute PROCMAP_QUERY ioctl under
> > > > per-vma locks"). But if #2 indeed takes inode->i_rwsem before
> > > > write-locking mmap_lock, then the problem should exist even before
> > > > that change when we didn't use vm_lock and relied on mmap_lock...
> > > >
> > > > I'll try to analyze this more before attempting a fix.
> > >
> > > I was able to reproduce the same issue even after reverting
> > > d9d1c2d81797. The deadlock in this case is simpler and involves
> > > mmap_lock instead of vm_lock (see below).
> > > Looks like the race is between the read() syscall and do_procmap_query().
> > > I'll continue investigating, in the meantime CC'ing Andrii.
> >
> > So, here is a cleaner version of that report (with d9d1c2d81797 reverted):
> >
> > -> #1 (&mm->mmap_lock){++++}-{4:4}:
> > __might_fault+0xed/0x170
> > _copy_to_iter+0x118/0x1720
> > copy_page_to_iter+0x12d/0x1e0
> > filemap_read+0x720/0x10a0
> > blkdev_read_iter+0x2b5/0x4e0
> > vfs_read+0x7f4/0xae0
> > ksys_read+0x12a/0x250
> > do_syscall_64+0xcb/0xf80
> > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> >
> > -> #0 (&sb->s_type->i_mutex_key#8){++++}-{4:4}:
> > __lock_acquire+0x1509/0x26d0
> > lock_acquire+0x185/0x340
> > down_read+0x98/0x490
> > blkdev_read_iter+0x2a7/0x4e0
> > __kernel_read+0x39a/0xa90
> > freader_fetch+0x1d5/0xa80
> > __build_id_parse.isra.0+0xea/0x6a0
> > do_procmap_query+0xd75/0x1050
> > procfs_procmap_ioctl+0x7a/0xb0
> > __x64_sys_ioctl+0x18e/0x210
> > do_syscall_64+0xcb/0xf80
> > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> >
> > other info that might help us debug this:
> >
> > Possible unsafe locking scenario:
> >
> > CPU0 CPU1
> > ---- ----
> > rlock(&mm->mmap_lock);
> > lock(&sb->s_type->i_mutex_key#8);
> > lock(&mm->mmap_lock);
> > rlock(&sb->s_type->i_mutex_key#8);
> >
> > *** DEADLOCK ***
> >
> > Both threads are calling blkdev_read_iter(), which uses
> > inode_lock_shared() to read-lock inode->i_rwsem. I'm not sure why CPU1
> > shows lock() instead of rlock(). So both threads read-lock
> > inode->i_rwsem and mmap_lock but in a different order. IIUC, with
> > read-locks this should not deadlock until some other thread
> > write-locks the mmap_lock in between and this becomes a real deadlock:
> >
> > CPU0 CPU1 CPU2
> > ---- ---- ----
> > rlock(&mm->mmap_lock);
> > rlock(&sb->s_type->i_mutex_key#8);
> > wlock(&mm->mmap_lock) <-- waiting for CPU0
> > rlock(&mm->mmap_lock); <-- waiting for CPU1
> > rlock(&sb->s_type->i_mutex_key#8); <-- waiting for CPU2
> >
> > I believe in the original report this write-locking thread was the one
> > calling mprotect_fixup().
> >
> > Per https://docs.kernel.org/mm/process_addrs.html#lock-ordering,
> > inode->i_rwsem should be locked before mm->mmap_lock, so
> > procfs_procmap_ioctl() has to be fixed to follow this lock ordering.
> > One possibility I can think of is to use build_id_parse_nofault()
> > first and if it fails because the required page is not faulted, we do
> > freader_init_from_file(), then drop the mmap/vma lock and execute
> > freader_fetch() outside of these locks to fault in that page. Once
> > that's done, we'll retry the whole operation and this time
> > build_id_parse_nofault() should pass (unless we already evicted that
> > page, which is extremely unlikely and in that case, we'll retry
> > again).
> >
> > I tried a POC with build_id_parse_nofault() but without the whole
> > dance with freader_init_from_file/freader_fetch and the deadlock is
> > gone. Andrii, WDYT?
>
> I don't like it :) Too much complexity, _nofault() variant only makes
> sense for BPF in non-sleepable contexts. I think this can be fixed
> simpler and cleaner. We don't need to hold VMA lock while fetching
> build ID. Build ID works with vma's vm_file, so we can just get its
> reference, drop vma lock, then fetch build id. Below diff passes our
> BPF selftests. Might need to think about a bit leaner code changes,
> but the idea should be clear. Diff below will be butchered by gmail,
> but you can fetch it at [0]. Do you mind validating that deadlock is
> gone? Thanks!
Sure. I'll test it later today, once I'm home.
>
> [0] https://git.kernel.org/pub/scm/linux/kernel/git/andrii/bpf-next.git/commit/?h=procmap-query-vma-deadlock-fix&id=7faf95b63a8a7ac6e78b6d90101c94bfa6ecdfd1
>
> Author: Andrii Nakryiko <andrii@...nel.org>
> Date: Tue Jan 27 10:46:04 2026 -0800
>
> procfs: avoid fetching build ID while holding VMA lock
>
> Signed-off-by: Andrii Nakryiko <andrii@...nel.org>
>
> diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
> index 81dfc26bfae8..564bf82e3731 100644
> --- a/fs/proc/task_mmu.c
> +++ b/fs/proc/task_mmu.c
> @@ -656,6 +656,7 @@ static int do_procmap_query(struct mm_struct *mm,
> void __user *uarg)
> struct proc_maps_locking_ctx lock_ctx = { .mm = mm };
> struct procmap_query karg;
> struct vm_area_struct *vma;
> + struct file *vm_file = NULL;
> const char *name = NULL;
> char build_id_buf[BUILD_ID_SIZE_MAX], *name_buf = NULL;
> __u64 usize;
> @@ -720,6 +721,9 @@ static int do_procmap_query(struct mm_struct *mm,
> void __user *uarg)
> karg.dev_major = MAJOR(inode->i_sb->s_dev);
> karg.dev_minor = MINOR(inode->i_sb->s_dev);
> karg.inode = inode->i_ino;
> +
> + if (karg.build_id_size)
> + vm_file = get_file(vma->vm_file);
> } else {
> karg.vma_offset = 0;
> karg.dev_major = 0;
> @@ -727,21 +731,6 @@ static int do_procmap_query(struct mm_struct *mm,
> void __user *uarg)
> karg.inode = 0;
> }
>
> - if (karg.build_id_size) {
> - __u32 build_id_sz;
> -
> - err = build_id_parse(vma, build_id_buf, &build_id_sz);
> - if (err) {
> - karg.build_id_size = 0;
> - } else {
> - if (karg.build_id_size < build_id_sz) {
> - err = -ENAMETOOLONG;
> - goto out;
> - }
> - karg.build_id_size = build_id_sz;
> - }
> - }
> -
> if (karg.vma_name_size) {
> size_t name_buf_sz = min_t(size_t, PATH_MAX,
> karg.vma_name_size);
> const struct path *path;
> @@ -779,6 +768,28 @@ static int do_procmap_query(struct mm_struct *mm,
> void __user *uarg)
> query_vma_teardown(&lock_ctx);
> mmput(mm);
>
> + if (karg.build_id_size) {
> + __u32 build_id_sz;
> +
> + err = -ENOENT;
> + if (vm_file)
> + err = build_id_parse_file(vm_file,
> build_id_buf, &build_id_sz);
> + if (err) {
> + karg.build_id_size = 0;
> + } else {
> + if (karg.build_id_size < build_id_sz) {
> + err = -ENAMETOOLONG;
> + goto out;
> + }
> + karg.build_id_size = build_id_sz;
> + }
> + }
> +
> + if (vm_file) {
> + fput(vm_file);
> + vm_file = NULL;
> + }
> +
> if (karg.vma_name_size &&
> copy_to_user(u64_to_user_ptr(karg.vma_name_addr),
> name, karg.vma_name_size)) {
> kfree(name_buf);
> @@ -797,6 +808,8 @@ static int do_procmap_query(struct mm_struct *mm,
> void __user *uarg)
>
> out:
> query_vma_teardown(&lock_ctx);
> + if (vm_file)
> + fput(vm_file);
> mmput(mm);
> kfree(name_buf);
> return err;
> diff --git a/include/linux/buildid.h b/include/linux/buildid.h
> index 831c1b4b626c..7acc06b22fb7 100644
> --- a/include/linux/buildid.h
> +++ b/include/linux/buildid.h
> @@ -7,7 +7,10 @@
> #define BUILD_ID_SIZE_MAX 20
>
> struct vm_area_struct;
> +struct file;
> +
> int build_id_parse(struct vm_area_struct *vma, unsigned char
> *build_id, __u32 *size);
> +int build_id_parse_file(struct file *file, unsigned char *build_id,
> __u32 *size);
> int build_id_parse_nofault(struct vm_area_struct *vma, unsigned char
> *build_id, __u32 *size);
> int build_id_parse_buf(const void *buf, unsigned char *build_id, u32 buf_size);
>
> diff --git a/lib/buildid.c b/lib/buildid.c
> index aaf61dfc0919..c0002129d526 100644
> --- a/lib/buildid.c
> +++ b/lib/buildid.c
> @@ -271,7 +271,7 @@ static int get_build_id_64(struct freader *r,
> unsigned char *build_id, __u32 *si
> /* enough for Elf64_Ehdr, Elf64_Phdr, and all the smaller requests */
> #define MAX_FREADER_BUF_SZ 64
>
> -static int __build_id_parse(struct vm_area_struct *vma, unsigned char
> *build_id,
> +static int __build_id_parse(struct file *file, unsigned char *build_id,
> __u32 *size, bool may_fault)
> {
> const Elf32_Ehdr *ehdr;
> @@ -279,11 +279,7 @@ static int __build_id_parse(struct vm_area_struct
> *vma, unsigned char *build_id,
> char buf[MAX_FREADER_BUF_SZ];
> int ret;
>
> - /* only works for page backed storage */
> - if (!vma->vm_file)
> - return -EINVAL;
> -
> - freader_init_from_file(&r, buf, sizeof(buf), vma->vm_file, may_fault);
> + freader_init_from_file(&r, buf, sizeof(buf), file, may_fault);
>
> /* fetch first 18 bytes of ELF header for checks */
> ehdr = freader_fetch(&r, 0, offsetofend(Elf32_Ehdr, e_type));
> @@ -324,7 +320,11 @@ static int __build_id_parse(struct vm_area_struct
> *vma, unsigned char *build_id,
> */
> int build_id_parse_nofault(struct vm_area_struct *vma, unsigned char
> *build_id, __u32 *size)
> {
> - return __build_id_parse(vma, build_id, size, false /* !may_fault */);
> + /* only works for page backed storage */
> + if (!vma->vm_file)
> + return -EINVAL;
> +
> + return __build_id_parse(vma->vm_file, build_id, size, false /*
> !may_fault */);
> }
>
> /*
> @@ -340,7 +340,16 @@ int build_id_parse_nofault(struct vm_area_struct
> *vma, unsigned char *build_id,
> */
> int build_id_parse(struct vm_area_struct *vma, unsigned char
> *build_id, __u32 *size)
> {
> - return __build_id_parse(vma, build_id, size, true /* may_fault */);
> + /* only works for page backed storage */
> + if (!vma->vm_file)
> + return -EINVAL;
> +
> + return __build_id_parse(vma->vm_file, build_id, size, true /*
> may_fault */);
> +}
> +
> +int build_id_parse_file(struct file *file, unsigned char *build_id,
> __u32 *size)
> +{
> + return __build_id_parse(file, build_id, size, true /* may_fault */);
> }
>
> /**
>
>
> >
> > >
> > > [ 62.320932][ T9229]
> > > [ 62.321471][ T9229] ======================================================
> > > [ 62.323016][ T9229] WARNING: possible circular locking dependency detected
> > > [ 62.324618][ T9229] 6.19.0-rc6-00001-g40bea6261b2a #42 Not tainted
> > > [ 62.326013][ T9229] ------------------------------------------------------
> > > [ 62.327560][ T9229] hillf/9229 is trying to acquire lock:
> > > [ 62.328821][ T9229] ffff888145b7b5a8
> > > (&sb->s_type->i_mutex_key#8){++++}-{4:4}, at:
> > > blkdev_read_iter+0x2a7/0x4e0
> > > [ 62.331102][ T9229]
> > > [ 62.331102][ T9229] but task is already holding lock:
> > > [ 62.332722][ T9229] ffff888183a6e540 (&mm->mmap_lock){++++}-{4:4},
> > > at: do_procmap_query+0x39f/0x1050
> > > [ 62.334795][ T9229]
> > > [ 62.334795][ T9229] which lock already depends on the new lock.
> > > [ 62.334795][ T9229]
> > > [ 62.337072][ T9229]
> > > [ 62.337072][ T9229] the existing dependency chain (in reverse order) is:
> > > [ 62.338998][ T9229]
> > > [ 62.338998][ T9229] -> #1 (&mm->mmap_lock){++++}-{4:4}:
> > > [ 62.340646][ T9229] __might_fault+0xed/0x170
> > > [ 62.341763][ T9229] _copy_to_iter+0x118/0x1720
> > > [ 62.342913][ T9229] copy_page_to_iter+0x12d/0x1e0
> > > [ 62.344167][ T9229] filemap_read+0x720/0x10a0
> > > [ 62.345298][ T9229] blkdev_read_iter+0x2b5/0x4e0
> > > [ 62.346480][ T9229] vfs_read+0x7f4/0xae0
> > > [ 62.347518][ T9229] ksys_read+0x12a/0x250
> > > [ 62.348584][ T9229] do_syscall_64+0xcb/0xf80
> > > [ 62.349707][ T9229] entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > [ 62.351116][ T9229]
> > > [ 62.351116][ T9229] -> #0 (&sb->s_type->i_mutex_key#8){++++}-{4:4}:
> > > [ 62.353012][ T9229] __lock_acquire+0x1509/0x26d0
> > > [ 62.354213][ T9229] lock_acquire+0x185/0x340
> > > [ 62.355323][ T9229] down_read+0x98/0x490
> > > [ 62.356441][ T9229] blkdev_read_iter+0x2a7/0x4e0
> > > [ 62.357619][ T9229] __kernel_read+0x39a/0xa90
> > > [ 62.358767][ T9229] freader_fetch+0x1d5/0xa80
> > > [ 62.359927][ T9229] __build_id_parse.isra.0+0xea/0x6a0
> > > [ 62.361232][ T9229] do_procmap_query+0xd75/0x1050
> > > [ 62.362434][ T9229] procfs_procmap_ioctl+0x7a/0xb0
> > > [ 62.363687][ T9229] __x64_sys_ioctl+0x18e/0x210
> > > [ 62.364863][ T9229] do_syscall_64+0xcb/0xf80
> > > [ 62.365977][ T9229] entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > [ 62.367394][ T9229]
> > > [ 62.367394][ T9229] other info that might help us debug this:
> > > [ 62.367394][ T9229]
> > > [ 62.369637][ T9229] Possible unsafe locking scenario:
> > > [ 62.369637][ T9229]
> > > [ 62.371237][ T9229] CPU0 CPU1
> > > [ 62.372441][ T9229] ---- ----
> > > [ 62.373687][ T9229] rlock(&mm->mmap_lock);
> > > [ 62.374688][ T9229]
> > > lock(&sb->s_type->i_mutex_key#8);
> > > [ 62.376444][ T9229] lock(&mm->mmap_lock);
> > > [ 62.377956][ T9229] rlock(&sb->s_type->i_mutex_key#8);
> > > [ 62.379165][ T9229]
> > > [ 62.379165][ T9229] *** DEADLOCK ***
> > > [ 62.379165][ T9229]
> > > [ 62.380952][ T9229] 1 lock held by hillf/9229:
> > > [ 62.381971][ T9229] #0: ffff888183a6e540
> > > (&mm->mmap_lock){++++}-{4:4}, at: do_procmap_query+0x39f/0x1050
> > > [ 62.384162][ T9229]
> > > [ 62.384162][ T9229] stack backtrace:
> > > [ 62.385458][ T9229] CPU: 3 UID: 0 PID: 9229 Comm: hillf Not tainted
> > > 6.19.0-rc6-00001-g40bea6261b2a #42 PREEMPT(full)
> > > [ 62.385471][ T9229] Hardware name: QEMU Standard PC (i440FX + PIIX,
> > > 1996), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
> > > [ 62.385477][ T9229] Call Trace:
> > > [ 62.385482][ T9229] <TASK>
> > > [ 62.385487][ T9229] dump_stack_lvl+0x100/0x190
> > > [ 62.385505][ T9229] print_circular_bug.cold+0x185/0x1d5
> > > [ 62.385521][ T9229] check_noncircular+0x14a/0x170
> > > [ 62.385534][ T9229] __lock_acquire+0x1509/0x26d0
> > > [ 62.385547][ T9229] lock_acquire+0x185/0x340
> > > [ 62.385557][ T9229] ? blkdev_read_iter+0x2a7/0x4e0
> > > [ 62.385569][ T9229] ? __pfx___might_resched+0x10/0x10
> > > [ 62.385583][ T9229] down_read+0x98/0x490
> > > [ 62.385593][ T9229] ? blkdev_read_iter+0x2a7/0x4e0
> > > [ 62.385603][ T9229] ? __pfx_down_read+0x10/0x10
> > > [ 62.385612][ T9229] ? lock_acquire+0x185/0x340
> > > [ 62.385622][ T9229] ? is_bpf_text_address+0x25/0x1a0
> > > [ 62.385634][ T9229] blkdev_read_iter+0x2a7/0x4e0
> > > [ 62.385645][ T9229] __kernel_read+0x39a/0xa90
> > > [ 62.385658][ T9229] ? __pfx___kernel_read+0x10/0x10
> > > [ 62.385671][ T9229] ? __lock_acquire+0x481/0x26d0
> > > [ 62.385683][ T9229] freader_fetch+0x1d5/0xa80
> > > [ 62.385697][ T9229] ? find_held_lock+0x2b/0x80
> > > [ 62.385712][ T9229] ? __pfx_freader_fetch+0x10/0x10
> > > [ 62.385725][ T9229] ? __asan_memset+0x27/0x50
> > > [ 62.385737][ T9229] __build_id_parse.isra.0+0xea/0x6a0
> > > [ 62.385751][ T9229] ? __pfx___build_id_parse.isra.0+0x10/0x10
> > > [ 62.385766][ T9229] ? __pfx_find_vma+0x10/0x10
> > > [ 62.385774][ T9229] ? __might_fault+0x129/0x170
> > > [ 62.385788][ T9229] do_procmap_query+0xd75/0x1050
> > > [ 62.385798][ T9229] ? __pfx_do_procmap_query+0x10/0x10
> > > [ 62.385807][ T9229] ? __sanitizer_cov_trace_switch+0x53/0x90
> > > [ 62.385817][ T9229] ? do_vfs_ioctl+0x226/0x13b0
> > > [ 62.385828][ T9229] ? __pfx_do_vfs_ioctl+0x10/0x10
> > > [ 62.385839][ T9229] ? putname+0xfc/0x1b0
> > > [ 62.385846][ T9229] ? putname+0x101/0x1b0
> > > [ 62.385857][ T9229] ? __x64_sys_openat+0x143/0x210
> > > [ 62.385867][ T9229] procfs_procmap_ioctl+0x7a/0xb0
> > > [ 62.385877][ T9229] ? __pfx_procfs_procmap_ioctl+0x10/0x10
> > > [ 62.385888][ T9229] __x64_sys_ioctl+0x18e/0x210
> > > [ 62.385899][ T9229] do_syscall_64+0xcb/0xf80
> > > [ 62.385913][ T9229] entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > [ 62.385923][ T9229] RIP: 0033:0x412209
> > > [ 62.385931][ T9229] Code: c0 79 93 eb d5 48 8d 7c 1d 00 eb 99 0f 1f
> > > 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b
> > > 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 d8 ff ff ff f7 d8
> > > 64 89 01 48
> > > [ 62.385940][ T9229] RSP: 002b:00007fff380d5588 EFLAGS: 00000217
> > > ORIG_RAX: 0000000000000010
> > > [ 62.385950][ T9229] RAX: ffffffffffffffda RBX: 00007fff380d56c8
> > > RCX: 0000000000412209
> > > [ 62.385956][ T9229] RDX: 0000200000000180 RSI: 00000000c0686611
> > > RDI: 0000000000000004
> > > [ 62.385962][ T9229] RBP: 00007fff380d55a0 R08: 0000000000000000
> > > R09: 00007fff380d5640
> > > [ 62.385968][ T9229] R10: 0000000000000000 R11: 0000000000000217
> > > R12: 00007fff380d56b8
> > > [ 62.385974][ T9229] R13: 0000000000000002 R14: 00000000004a0e40
> > > R15: 0000000000000002
> > > [ 62.385982][ T9229] </TASK>
> > >
> > >
> > >
> > > >
> > > > > > other info that might help us debug this:
> > > > > >
> > > > > > Chain exists of:
> > > > > > &sb->s_type->i_mutex_key#8 --> &mm->mmap_lock --> vm_lock
> > > > > >
> > > > > > Possible unsafe locking scenario:
> > > > > >
> > > > > > CPU0 CPU1
> > > > > > ---- ----
> > > > > > rlock(vm_lock);
> > > > > > lock(&mm->mmap_lock);
> > > > > > lock(vm_lock);
> > > > > > rlock(&sb->s_type->i_mutex_key#8);
> > > > > >
> > > > > > *** DEADLOCK ***
> > > > > >
> > > > > > 1 lock held by syz.0.17/6091:
> > > > > > #0: ffff888012aa0448 (vm_lock){++++}-{0:0}, at: lock_next_vma+0x10e/0xed0 mm/mmap_lock.c:334
> > > > > >
> > > > > > stack backtrace:
> > > > > > CPU: 2 UID: 0 PID: 6091 Comm: syz.0.17 Not tainted syzkaller #0 PREEMPT(full)
> > > > > > Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
> > > > > > Call Trace:
> > > > > > <TASK>
> > > > > > __dump_stack lib/dump_stack.c:94 [inline]
> > > > > > dump_stack_lvl+0x116/0x1f0 lib/dump_stack.c:120
> > > > > > print_circular_bug+0x275/0x340 kernel/locking/lockdep.c:2043
> > > > > > check_noncircular+0x146/0x160 kernel/locking/lockdep.c:2175
> > > > > > check_prev_add kernel/locking/lockdep.c:3165 [inline]
> > > > > > check_prevs_add kernel/locking/lockdep.c:3284 [inline]
> > > > > > validate_chain kernel/locking/lockdep.c:3908 [inline]
> > > > > > __lock_acquire+0x1669/0x2890 kernel/locking/lockdep.c:5237
> > > > > > lock_acquire kernel/locking/lockdep.c:5868 [inline]
> > > > > > lock_acquire+0x179/0x330 kernel/locking/lockdep.c:5825
> > > > > > down_read+0x9b/0x460 kernel/locking/rwsem.c:1537
> > > > > > inode_lock_shared include/linux/fs.h:1042 [inline]
> > > > > > blkdev_read_iter+0x19e/0x500 block/fops.c:855
> > > > > > __kernel_read+0x3f3/0xbf0 fs/read_write.c:530
> > > > > > freader_fetch+0x1d7/0x9d0 lib/buildid.c:100
> > > > > > __build_id_parse.isra.0+0xdd/0x6c0 lib/buildid.c:297
> > > > > > do_procmap_query+0xb0e/0x1080 fs/proc/task_mmu.c:733
> > > > > > procfs_procmap_ioctl+0x9d/0xe0 fs/proc/task_mmu.c:813
> > > > > > vfs_ioctl fs/ioctl.c:51 [inline]
> > > > > > __do_sys_ioctl fs/ioctl.c:597 [inline]
> > > > > > __se_sys_ioctl fs/ioctl.c:583 [inline]
> > > > > > __x64_sys_ioctl+0x18e/0x210 fs/ioctl.c:583
> > > > > > do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > > > > > do_syscall_64+0xcd/0xf80 arch/x86/entry/syscall_64.c:94
> > > > > > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > > > > RIP: 0033:0x7ff1a238f7c9
> > > > > > Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
> > > > > > RSP: 002b:00007ffebbe538b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
> > > > > > RAX: ffffffffffffffda RBX: 00007ff1a25e5fa0 RCX: 00007ff1a238f7c9
> > > > > > RDX: 0000200000000180 RSI: 00000000c0686611 RDI: 0000000000000004
> > > > > > RBP: 00007ff1a2413f91 R08: 0000000000000000 R09: 0000000000000000
> > > > > > R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
> > > > > > R13: 00007ff1a25e5fa0 R14: 00007ff1a25e5fa0 R15: 0000000000000003
> > > > > > </TASK>
Powered by blists - more mailing lists