lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <4880cdc7-f918-4da1-8a28-f64620f06f04@iwave-global.com>
Date: Mon, 1 Sep 2025 15:25:56 +0530
From: Nikhil Kashyap H R <nikhil.kashyap@...ve-global.com>
To: linux-mtd@...ts.infradead.org, linux-fsdevel@...r.kernel.org,
 linux-kernel@...r.kernel.org
Cc: murulidhar.mahesh@...ve-global.com,
 Poojashree <poojashree.shetty@...ve-global.com>,
 Deeksha N <deeksha.vishwanatha@...ve-global.com>
Subject: Kernel Workqueue Lockup on Microchip PolarFire MPFS-250T During
 mtd-utils integck UBIFS Stress Test

Dear Linux Kernel and Microchip Support Teams,

I am seeking your assistance with a critical kernel stability issue 
observed on our Microchip PolarFire Video Kit MPFS-250T platform during 
file system stress testing.
While running the integck test from the mtd-utils package (version 
2.0.0) on a 64GB Micron NAND (MT29F64G08AFAAAWP) configured with UBIFS 
as the file system, we encounter a persistent kernel workqueue lockup 
error. The exact message from the kernel logs is:

*/BUG: workqueue lockup - pool cpus=3 node=0 flags=0x0 nice=0 stuck for 
122s!/*

This workqueue lockup occurs consistently during the integrated 
filesystem integrity test that performs extensive random file creation, 
writes, truncations, renames, symlinks, deletions, and remount 
operations designed to stress both metadata and data layers of UBIFS on MTD.
Environment:
     Hardware Platform: Microchip PolarFire Video Kit MPFS-250T
     NAND Flash: Micron MT29F64G08AFAAAWP (64GB)
     Kernel Version: 6.6 (custom build)
     Filesystem: UBIFS on MTD device
     mtd-utils Version: 2.0.0 (https://github.com/lgirdk/mtd-utils)
     Memory: Approx. 1.6 GB RAM; no swap configured

Troubleshooting Steps Undertaken:
     Verified system memory availability and absence of swapping during 
test runs.
     Analyzed kernel logs confirming prolonged workqueue thread stalls.
     Reduced workload parameters with limited success.
     Validated kernel and driver versions as latest stable for this 
platform.
     Inspected the integck test source code, identifying its intensive 
asynchronous filesystem operations which may overwhelm kernel deferred 
workqueues.
     Collected syslog excerpts, memory snapshots, and environment details.

Assistance Requested:
I would appreciate your expertise and guidance regarding:
     Any known kernel or UBIFS (MTD) driver issues in kernel 6.6 that 
may trigger workqueue lockups under stress conditions.
     Recommended kernel patches, driver updates, or configurations to 
mitigate or resolve this issue.
     Diagnostic approaches or kernel debugging mechanisms for capturing 
actionable insights concerning stalled workqueues.
     Best practices when running sustained UBIFS stress tests on large 
NAND devices on this platform.

Supporting Information Provided:
     Kernel and system logs highlighting workqueue lockup and timing.
     Memory utilization statistics.
     Complete integck test source code from mtd-utils v2.0.0.
     Filesystem mount configurations and system setup details.

*
Best Regards
Nikhil Kashyap H R
*



Content of type "text/html" skipped

Download attachment "system_memeory.png" of type "image/png" (67647 bytes)

View attachment "integck_test_log.txt" of type "text/plain" (235777 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ