[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20130510112756.GH31235@dyad.programming.kicks-ass.net>
Date: Fri, 10 May 2013 13:27:56 +0200
From: Peter Zijlstra <peterz@...radead.org>
To: Ingo Molnar <mingo@...nel.org>
Cc: Jiri Olsa <jolsa@...hat.com>, linux-kernel@...r.kernel.org,
Corey Ashford <cjashfor@...ux.vnet.ibm.com>,
Frederic Weisbecker <fweisbec@...il.com>,
Ingo Molnar <mingo@...e.hu>,
Namhyung Kim <namhyung@...nel.org>,
Paul Mackerras <paulus@...ba.org>,
Arnaldo Carvalho de Melo <acme@...hat.com>,
Andi Kleen <ak@...ux.intel.com>,
David Ahern <dsahern@...il.com>,
Stephane Eranian <eranian@...gle.com>
Subject: Re: [PATCH 0/9] perf: Adding better precise_ip field handling
On Fri, May 10, 2013 at 12:55:36PM +0200, Ingo Molnar wrote:
> Look at the tools/perf/ patches, they don't actually need or use that
> information to adjust for skid!
>
> If user-space wants _that_ level of control because it wants to correct
> for skid (if there's skid), or if it wants to display to the user how
> precise the profiling is, then they can do the (much) more complex probing
> dance.
>
> What is absolutely indefensible is to not give a good shortcut for the
> most common case of 'give me the most precise cycles event you got'...
That's not what I'm saying... the user (not userspace, but you and me) when
staring at perf output need to interpret the result.
If you don't know WTF the thing actually measured, how are you going to do
that?
> > I see such a feature only causing confusion; I told it to be precise,
> > therefore this register op after the memory load really is the more
> > expensive thing.
>
> You are creating confusion where there's none: "give me the best profiling
> you've got" is a pretty reasonable thing to ask.
Only if it then tells you what you got. It doesn't do that.
> The thing is, there's variations in the quality of profiling between CPUs,
> sometimes even between CPU models. 99.999% of the people don't care about
> that, because 99.9% of the time the profile is unambiguous: functions are
> typically big enough, with the overhead somewhere in the middle, so skid
> just doesn't matter.
Sure at function level it doesn't matter, but once you found your most
expensive function very often the next question is _why_ is it expensive.
At that point you're going to stare at asm output. The moment you do that you
need to know the type of output you're staring at.
Also, if you think function level output is the most relevant one, you
shouldn't use PEBS at all. PEBS has an issue with REP prefixes, it severely
under accounts the cycles spend on them. And since exact placement doesn't
matter (as you just argued) the little skid you have is irrelevant.
So either skid matters and you need to know what type of output you've got, or
it doesn't and the whole precise thing is irrelevant at best.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists