lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5dd39b03-c40e-4f34-bf89-b3e5a12753dc@linux.alibaba.com>
Date: Wed, 5 Feb 2025 14:38:36 +0800
From: Baolin Wang <baolin.wang@...ux.alibaba.com>
To: "Alex Xu (Hello71)" <alex_y_xu@...oo.ca>, linux-mm@...ck.org,
 Daniel Gomez <da.gomez@...sung.com>
Cc: Barry Song <baohua@...nel.org>, David Hildenbrand <david@...hat.com>,
 Hugh Dickins <hughd@...gle.com>, Kefeng Wang <wangkefeng.wang@...wei.com>,
 Lance Yang <ioworker0@...il.com>, Matthew Wilcox <willy@...radead.org>,
 Ryan Roberts <ryan.roberts@....com>, linux-kernel@...r.kernel.org,
 Andrew Morton <akpm@...ux-foundation.org>
Subject: Re: Hang when swapping huge=within_size tmpfs from zram



On 2025/2/5 09:55, Baolin Wang wrote:
> Hi Alex,
> 
> On 2025/2/5 09:23, Alex Xu (Hello71) wrote:
>> Hi all,
>>
>> On 6.14-rc1, I found that creating a lot of files in tmpfs then deleting
>> them reliably hangs when tmpfs is mounted with huge=within_size, and it
>> is swapped out to zram (zstd/zsmalloc/no backing dev). I bisected this
>> to acd7ccb284b "mm: shmem: add large folio support for tmpfs".
>>
>> When the issue occurs, rm uses 100% CPU, cannot be killed, and has no
>> output in /proc/pid/stack or wchan. Eventually, an RCU stall is
>> detected:
> 
> Thanks for your report. Let me try to reproduce the issue locally and 
> see what happens.
> 
>> rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
>> rcu:     Tasks blocked on level-0 rcu_node (CPUs 0-11): P25160
>> rcu:     (detected by 10, t=2102 jiffies, g=532677, q=4997 ncpus=12)
>> task:rm              state:R  running task     stack:0     pid:25160 
>> tgid:25160 ppid:24309  task_flags:0x400000 flags:0x00004004
>> Call Trace:
>>   <TASK>
>>   ? __schedule+0x388/0x1000
>>   ? kmem_cache_free.part.0+0x23d/0x280
>>   ? sysvec_apic_timer_interrupt+0xa/0x80
>>   ? asm_sysvec_apic_timer_interrupt+0x16/0x20
>>   ? xas_load+0x12/0xc0
>>   ? xas_load+0x8/0xc0
>>   ? xas_find+0x144/0x190
>>   ? find_lock_entries+0x75/0x260
>>   ? shmem_undo_range+0xe6/0x5f0
>>   ? shmem_evict_inode+0xe4/0x230
>>   ? mtree_erase+0x7e/0xe0
>>   ? inode_set_ctime_current+0x2e/0x1f0
>>   ? evict+0xe9/0x260
>>   ? _atomic_dec_and_lock+0x31/0x50
>>   ? do_unlinkat+0x270/0x2b0
>>   ? __x64_sys_unlinkat+0x30/0x50
>>   ? do_syscall_64+0x37/0xe0
>>   ? entry_SYSCALL_64_after_hwframe+0x50/0x58
>>   </TASK>
>>
>> Let me know what information is needed to further troubleshoot this
>> issue.

Sorry, I can't reproduce this issue, and my testing process is as follows:
1. Mount tmpfs with huge=within_size
2. Create and write a tmpfs file
3. Swap out the large folios of the tmpfs file to zram
4. Execute 'rm' command to remove the tmpfs file

Do you have any reproduction scripts or reproduction steps guide? Thanks.

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ