lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20250117160254.4a366149@gandalf.local.home>
Date: Fri, 17 Jan 2025 16:02:54 -0500
From: Steven Rostedt <rostedt@...dmis.org>
To: Tomas Glozar <tglozar@...hat.com>
Cc: linux-trace-kernel@...r.kernel.org, linux-kernel@...r.kernel.org, John
 Kacur <jkacur@...hat.com>, Luis Goncalves <lgoncalv@...hat.com>, Chang Yin
 <cyin@...hat.com>, Qiao Zhao <qzhao@...hat.com>
Subject: Re: [PATCH] tools/rtla: Add basic test suite

On Fri, 17 Jan 2025 11:53:48 +0100
Tomas Glozar <tglozar@...hat.com> wrote:

> Maybe after enough time repeating the command I'm also going to hit
> the bug you saw.

Nevermind. The divide by zero fix went into my tools/urgent branch and I
pushed it up to Linus, but I had already started the tools/core branch for
the next merge window that I've been adding these patches to. It does not
include the fix.

The reason for that crash was because of the divide by zero bug. After
cherry-picking it for testing, it works.

Now, what I also noticed was. When it did crash and die. The osnoise tracer
no longer creates any osnoise threads. Can you take a look.

Remove the fix, and run that test:

 ~# echo osnoise > /sys/kernel/tracing/current_tracer
 ~# ps aux |grep osnois
root         942  105  0.0      0     0 ?        R    16:00   0:02 [osnoise/0]
root         943  104  0.0      0     0 ?        R    16:00   0:02 [osnoise/1]
root         944  105  0.0      0     0 ?        R    16:00   0:02 [osnoise/2]
root         945  105  0.0      0     0 ?        R    16:00   0:02 [osnoise/3]
root         946  105  0.0      0     0 ?        R    16:00   0:02 [osnoise/4]
root         947  105  0.0      0     0 ?        R    16:00   0:02 [osnoise/5]
root         948  104  0.0      0     0 ?        R    16:00   0:02 [osnoise/6]
root         949  130  0.0      0     0 ?        R    16:00   0:03 [osnoise/7]
root         953  9.0  0.0   6468  2152 pts/0    S+   16:00   0:00 grep osnois

 ~# echo nop > /sys/kernel/tracing/current_tracer
 ~# cd linux.git
 ~# git show cfff4d019b4c01 | patch -p1 -R
 ~# cd tools/tracing/rtla/
 ~# make
 ~# ./rtla timerlat hist -i 2 -c 0 -n -d 30s
 # RTLA timerlat histogram
 # Time unit is nanoseconds (ns)
 # Duration:   0 00:00:02
 Index   IRQ-000   Thr-000   Usr-000
 over:         1         0         0 
 count:        1         0         0 
 min:    1481365         -         - 
 avg:    1481365         -         - 
 max:    1481365         -         - 
 Floating point exception

 ~# rmdir /sys/kernel/tracing/instances/timerlat_aa/
 ~# rmdir /sys/kernel/tracing/instances/timerlat_hist/

 ~# echo osnoise > /sys/kernel/tracing/current_tracer
 ~# ps aux |grep osnois
root         969 11.1  0.0   6468  2244 pts/0    S+   15:54   0:00 grep osnois

So that failure puts the system into a state that can no longer run osnoise.
It takes a reboot to fix it.

That should be fixed where a reboot is not needed.

-- Steve

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ