[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20241023190515.a80c77fe3fa895910d554888@linux-foundation.org>
Date: Wed, 23 Oct 2024 19:05:15 -0700
From: Andrew Morton <akpm@...ux-foundation.org>
To: Lance Yang <ioworker0@...il.com>
Cc: cunhuang@...cent.com, leonylgao@...cent.com, j.granados@...sung.com,
jsiddle@...hat.com, kent.overstreet@...ux.dev, 21cnbao@...il.com,
ryan.roberts@....com, david@...hat.com, ziy@...dia.com,
libang.li@...group.com, baolin.wang@...ux.alibaba.com,
linux-kernel@...r.kernel.org, linux-mm@...ck.org
Subject: Re: [PATCH 0/2] hung_task: add detect count for hung tasks
On Tue, 22 Oct 2024 19:47:34 +0800 Lance Yang <ioworker0@...il.com> wrote:
> Hi all,
>
> This patchset adds a counter, hung_task_detect_count, to track the number of
> times hung tasks are detected. This counter provides a straightforward way
> to monitor hung task events without manually checking dmesg logs.
>
> With this counter in place, system issues can be spotted quickly, allowing
> admins to step in promptly before system load spikes occur, even if the
> hung_task_warnings value has been decreased to 0 well before.
>
> Recently, we encountered a situation where warnings about hung tasks were
> buried in dmesg logs during load spikes. Introducing this counter could
> have helped us detect such issues earlier and improve our analysis efficiency.
>
Isn't the answer to this problem "write a better parser"? I mean,
we're providing userspace with information which is already available.
Powered by blists - more mailing lists