[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20230119112433.611fa273@gandalf.local.home>
Date: Thu, 19 Jan 2023 11:24:33 -0500
From: Steven Rostedt <rostedt@...dmis.org>
To: Peter Zijlstra <peterz@...radead.org>
Cc: kernel test robot <oliver.sang@...el.com>, oe-lkp@...ts.linux.dev,
lkp@...el.com, linux-kernel@...r.kernel.org, x86@...nel.org,
Ingo Molnar <mingo@...nel.org>,
linux-trace-kernel@...r.kernel.org
Subject: Re: [tip:sched/core] [tracing, hardirq] 9aedeaed6f:
WARNING:suspicious_RCU_usage
On Thu, 19 Jan 2023 16:45:49 +0100
Peter Zijlstra <peterz@...radead.org> wrote:
> Steve, what's the easiest way to figure out what triggers this? Put a
> printk() in prepare_ftrace_return() or so?
Does it only trigger if all functions are enabled when running function
graph tracer?
That is, if you just trace one function, say "schedule" does it
have an issue?
trace-cmd start -p function_graph -l schedule
And then run your code, and it doesn't trigger, but:
trace-cmd reset # make sure schedule is no longer filtered
trace-cmd start -p function_graph
does trigger the issue, you can then bisect the functions with the script:
scripts/tracing/ftrace-bisect.sh
in the kernel tree.
I need to update this script, because I've optimized it with the "number"
trick. That is, instead echoing in the names of functions, you echo in the
location of were they exist in the available_filter_functions file.
That is: echo 5 > set_ftrace_filter
Will enable the fifth function in available_filter_functions. The reason
for this is because this is an O(1) operation. And echoing in thousands of
these numbers is an O(n) operation where n is the number of functions you
echo in. But by doing it via names, its an O(n*m) operation, where m is the
number of *all* functions, and this can take several minutes to complete,
were as the "number" version usually takes less than a second.
Here's what you can do (ignore the instructions in the script, as that
needs to be updated):
# cd /sys/kernel/tracing
# seq `wc -l available_filter_functions | cut -d' ' -f1` > ~/full-file
# /path/to/ftrace-bisect ~/full-file ~/test-file ~/non-test-file
# cat ~/test-file > set_ftrace_filter
run your tests. If they don't fail (bisect good), then, you can assume that
the bad function is in ~/non-test-file
# /path/to/ftrace-bisect ~/non-test-file ~/test-file.1 ~/non-test-file.1
And repeat:
# cat ~/test-file.1 > set_ftrace_filter
I suggest appending the ".1", ".2", etc, in case something goes wrong and
you want to go back (basically a bisect log).
If the test fails, then you use the first file:
# /path/to/ftrace-bisect ~/test-file.1 ~/test-file.2 ~/non-test-file.2
Keep going until you find a single function that causes the issue.
Always cat the "test-file*" into the set_ftrace_filter.
If it works use the "non-test-file*" for the next iteration. If it fails,
use the "test-file*" for the next iteration.
-- Steve
Powered by blists - more mailing lists