[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20090116222600.GA3899@elte.hu>
Date: Fri, 16 Jan 2009 23:26:00 +0100
From: Ingo Molnar <mingo@...e.hu>
To: Chris Mason <chris.mason@...cle.com>
Cc: Steven Rostedt <rostedt@...dmis.org>, linux-kernel@...r.kernel.org,
Andrew Morton <akpm@...ux-foundation.org>,
Frederic Weisbecker <fweisbec@...il.com>
Subject: Re: [PATCH 0/2] ftrace: updates to tip
* Chris Mason <chris.mason@...cle.com> wrote:
> On Thu, 2009-01-15 at 19:40 -0500, Steven Rostedt wrote:
> > Ingo,
> >
> > The first patch is critical, and needs to stay with trace_output.c
> > Not that critical since trace_output.c is not in mainline yet.
> >
> > The second patch gives the ability to stack trace functions.
> > I've been leery about adding this and still keep it a separate
> > option from the "stacktrace" that already exists. This is because
> > when enabled with no filtering, the lag between typing and seeing
> > what is typed can be up to 10 seconds or more.
> >
>
> I mostly asked for this because I often try to find the most common
> reason a given function is called, and oprofile isn't always a great way
> to catch it. systemtap can do it too if you can get systemtap to work
> against your current devel kernel, but there are limits on how much
> memory it'll use.
>
> I've attached some simple python code that parses the output of the
> function stack tracer, it makes some dumb assumptions about the format
> but isn't a bad proof of concept. The first such assumption is that
> you're only filtering on a single function.
>
> Here is some example output, trying to find the most common causes of
> native_smp_send_reschedule() during a btrfs dbench run.
>
> It relates to the Oracle OLTP thread because oracle heavily uses IPC
> semaphores to trigger wakeups of processes as various events finish.
> I'd bet that try_to_wakeup is the most common cause of the reschedule
> calls there as well.
>
> For btrfs, the btree lock mutexes come back into the profile yet again.
> It would be interesting to change the spinning mutex code to look for
> spinners and skip the wakeup on unlock, but that's a different thread
> entirely.
>
> The short version is: thanks Steve, this is really cool!
>
> 12058 hits:
> <= check_preempt_wakeup
> <= try_to_wake_up
> <= wake_up_process
> <= __mutex_unlock_slowpath
> <= mutex_unlock
> <= btrfs_tree_unlock
> <= unlock_up
> ===========
Cool! We've got scripts/tracing/ [with one Python script in it already] -
so if this is tidied up to be generally useful we could put it there.
The other thing is that there's the statistics framework of ftrace, being
worked on by Frederic and Steve. That tries to handle and provide
higher-order summaries/"views" of plain traces, like histograms and counts
- provided by the kernel.
Maybe the above type of multi-dimensional-stack-trace based histogram
could fit into the statistics framework too?
Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists