lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:	Mon, 2 Feb 2009 20:27:21 +0900
From:	MinChan Kim <minchan.kim@...il.com>
To:	Peter Zijlstra <peterz@...radead.org>
Cc:	Nick Piggin <npiggin@...e.de>,
	linux kernel <linux-kernel@...r.kernel.org>,
	linux mm <linux-mm@...ck.org>
Subject: Re: [BUG??] Deadlock between kswapd and
	sys_inotify_add_watch(lockdep  report)

On Mon, Feb 02, 2009 at 11:40:02AM +0100, Peter Zijlstra wrote:
> On Mon, 2009-02-02 at 19:25 +0900, MinChan Kim wrote:
> > But, I am not sure whether it's real bug or not.
> 
> Me neither, inode life-times are tricky, but on first sight it looks
> real enough.
> 
> > I always suffer from reading lockdep report's result. :(
> > It would be better to have a document about lockdep report analysis.
> 
> I've never found them hard to read, so I'm afraid you'll have to be more
> explicit about what is unclear to you.

It's becuase not lockdep humble report but my poor knowledge. :(
Could you elaborate please ?

>[  331.718120] [ INFO: inconsistent lock state ]
>[  331.718124] 2.6.28-rc2-mm1-lockdep #6
>[  331.718126] ---------------------------------
>[  331.718129] inconsistent {ov-reclaim-W} -> {in-reclaim-W} usage.
                                         ^                 ^ 
                                        write ?           write ?
>
>[  331.718133] kswapd0/218 [HC0[0]:SC0[0]:HE0:SE1] takes:
                            ^^^^^^^^^^^^^^^^^^^^^^
                            what means ? HC,SC,HE,SE
>
>[  331.718136]  (&inode->inotify_mutex){--..+.}, at: [<c01dba70>] inotify_inode_is_dead+0x20/0x90
>             

Is it related to recursive lock of inotify_mutex ?
but, Subject means 'inconsistent {ov-reclaim-W} -> {in-reclaim-W}', 
IOW, it's related to reclaim of GFP_FS. 
What's relation inotify_mutex and reclaim of GFP_FS?

I think if reclaim context which have GFP_FS already have lock A and then 
do pageout, if writepage need the lock A, we have to catch such a case. 
I thought Nick's patch's goal catchs such a case. 
But My thought is likely worng. Such a case, the lock A will be holded recursive. 
so It can be solved by recursive lockdep.

I can't understand this. :(

One more question is that what's difference between lock inversion and
circular locking dependency ?

>[  331.718148] {ov-reclaim-W} state was registered at:
>[  331.718150]   [<c01532ee>] mark_held_locks+0x3e/0x90
>[  331.718157]   [<c015338e>] lockdep_trace_alloc+0x4e/0x80
>[  331.718162]   [<c01acee6>] kmem_cache_alloc+0x26/0xf0
>[  331.718166]   [<c0243fa0>] idr_pre_get+0x50/0x70
>[  331.718172]   [<c01db761>] inotify_handle_get_wd+0x21/0x60
>[  331.718176]   [<c01dc012>] inotify_add_watch+0x52/0xe0
>[  331.718181]   [<c01dcca8>] sys_inotify_add_watch+0x148/0x170
>[  331.718185]   [<c0104032>] syscall_call+0x7/0xb
>[  331.718190]   [<ffffffff>] 0xffffffff
>[  331.718205] irq event stamp: 1288446
>[  331.718207] hardirqs last  enabled at (1288445): [<c0179695>] call_rcu+0x75/0x90
>[  331.718213] hardirqs last disabled at (1288446): [<c0370103>] mutex_lock_nested+0x53/0x2f0
>[  331.718221] softirqs last  enabled at (1284622): [<c0132fa2>] __do_softirq+0x132/0x180
>[  331.718226] softirqs last disabled at (1284617): [<c0133079>] do_softirq+0x89/0x90
>[  331.718231]
>[  331.718232] other info that might help us debug this:
>[  331.718236] 2 locks held by kswapd0/218:
>[  331.718238]  #0:  (shrinker_rwsem){----..}, at: [<c0192d65>] shrink_slab+0x25/0x1a0
>[  331.718248]  #1:  (&type->s_umount_key#4){-----.}, at: [<c01c21fb>] shrink_dcache_memory+0xfb/0x1a0
>[  331.718259]
>[  331.718260] stack backtrace:
>[  331.718263] Pid: 218, comm: kswapd0 Not tainted 2.6.28-rc2-mm1-lockdep #6
>[  331.718266] Call Trace:
>[  331.718272]  [<c0151726>] print_usage_bug+0x176/0x1c0
>[  331.718276]  [<c0152d05>] mark_lock+0xb05/0x10b0
>[  331.718282]  [<c018c0e9>] ? __free_pages_ok+0x349/0x450
>[  331.718287]  [<c0155362>] __lock_acquire+0x602/0xa80
>[  331.718291]  [<c01540ff>] ? validate_chain+0x3ef/0x1050
>[  331.718296]  [<c0155851>] lock_acquire+0x71/0xa0
>[  331.718300]  [<c01dba70>] ? inotify_inode_is_dead+0x20/0x90
>[  331.718305]  [<c037014d>] mutex_lock_nested+0x9d/0x2f0
>[  331.718310]  [<c01dba70>] ? inotify_inode_is_dead+0x20/0x90
>[  331.718314]  [<c01dba70>] ? inotify_inode_is_dead+0x20/0x90
>[  331.718318]  [<c01dba70>] inotify_inode_is_dead+0x20/0x90
>[  331.718323]  [<c024e2d6>] ? _raw_spin_unlock+0x46/0x80
>[  331.718328]  [<c01c1d14>] dentry_iput+0xa4/0xc0
>[  331.718333]  [<c01c1dfb>] d_kill+0x3b/0x60
>[  331.718337]  [<c01c1fe6>] __shrink_dcache_sb+0x1c6/0x2c0
>[  331.718342]  [<c01c228d>] shrink_dcache_memory+0x18d/0x1a0
>[  331.718347]  [<c0192e6b>] shrink_slab+0x12b/0x1a0
>[  331.718351]  [<c01939ff>] kswapd+0x3af/0x5c0
>[  331.718356]  [<c01910a0>] ? isolate_pages_global+0x0/0x220
>[  331.718362]  [<c0142800>] ? autoremove_wake_function+0x0/0x40
>[  331.718366]  [<c0193650>] ? kswapd+0x0/0x5c0
>[  331.718371]  [<c01424f7>] kthread+0x47/0x80
>[  331.718375]  [<c01424b0>] ? kthread+0x0/0x80
>[  331.718380]  [<c01054f7>] kernel_thread_helper+0x7/0x10


-- 
Kinds Regards
MinChan Kim

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ