linux-kernel - Re: [GIT PULL 0/8] perf/pt -> Intel PT/BTS

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20150702190059.GB15416@gmail.com>
Date:	Thu, 2 Jul 2015 21:00:59 +0200
From:	Ingo Molnar <mingo@...nel.org>
To:	Adrian Hunter <adrian.hunter@...el.com>
Cc:	Arnaldo Carvalho de Melo <acme@...nel.org>,
	linux-kernel@...r.kernel.org, Jiri Olsa <jolsa@...hat.com>,
	David Ahern <dsahern@...il.com>,
	Namhyung Kim <namhyung@...nel.org>,
	Peter Zijlstra <a.p.zijlstra@...llo.nl>,
	Stephane Eranian <eranian@...gle.com>,
	Arnaldo Carvalho de Melo <acme@...hat.com>,
	Vince Weaver <vince@...ter.net>,
	Alexander Shishkin <alexander.shishkin@...ux.intel.com>
Subject: Re: [GIT PULL 0/8] perf/pt -> Intel PT/BTS


* Adrian Hunter <adrian.hunter@...el.com> wrote:

> > So I really think we need an extended error reporting feature on the perf 
> > kernel side, so that a 'natural' error (plus a string) is reported back to 
> > tooling, instead of the current -EINVAL.
> > 
> > No need to do it for everything, doing it for BTS and related functionality 
> > would be a good first step to start this.
> > 
> > If you are interested you could try this, or I can try to write something 
> > (after the merge window).
> 
> Don't have time sorry.

So who on the Intel side has time to finish Intel/PT support properly?

As a first step I think we need to disable the Intel/PT kernel side - it was
merged with the express promise that proper tooling would come along promptly,
but if that's not happening due to lack of resources, then the kernel side is
obviously in limbo as well.

> > Does this make sense to you?
> 
> Normally the warts of syscalls are hidden by libraries.  I am not sure a library 
> couldn't do just as well with the advantage that it would work for old kernels 
> too.  A library could also have information about multiple architectures, not 
> just the one that is actually running e.g. your BTS example won't work on ARM.

The method I suggested works _both_ with old and new kernels, with the new kernel 
simply giving better error feedback.

And these are not warts really, it's simply a threshold issue: above a certain 
complexity of features it makes sense to introduce a qualitatively better error 
reporting interface. When I tested Intel PT support I happened to go past that 
threshold and it became obvious that we need it.

> A library could provide a similar API to the one you described.  Either it just 
> does the syscall and returns, or if extended error information is requested, it 
> probes the API with various options, checks perf_event_paranoid, and generally 
> uses whatever information it can to best figure out what went wrong.

So I think eventually a proper libperf will crystalize out of perf's syscall 
wrappers (I suggested it for a long time), but this has very little impact on the 
end user who doesn't care whether it's done in perf, or in some intermediate 
library.

So it can be done in the current perf code just as much - when libperf gets 
introduced it will be factored out into tools/lib/perf/ without much problem and 
then other tools can use it too.

> > I.e. if the BTS driver was not created, we'd still have 
> > /sys/bus/event_source/devices/intel_bts/error (and no other file), which gives 
> > tooling a good way to discover why a particular PMU is not available. This 
> > adds a tiny bit more RAM overhead, but it's for the better I think, because 
> > tooling could be a lot more certain about what the capabilities of the kernel 
> > are.
> 
> That presumes the user understands what is or is not available on their 
> architecture, because on a non-x86 architecture they still get nothing 
> x86-specific.
> 
> It also runs a bit against the way config works e.g. even if the PMU driver is 
> config'ed out it still has to provide a sysfs interface.

I partially agree - but it's really a problem that we currently only get two 
states:

  - the PMU driver is there in sysfs
  - the PMU driver is not there in sysfs

while we really want to a lot more about why it's not there - and it's the kernel 
that knows this best, not tooling.

So maybe instead of having a separate directory, we could have a sysfs file that 
listed all the PMU drivers that the kernel knows about, with a status line that 
tells us whether they are:

  - not configured
  - configured but runtime disabled due to reason X or Y or Z
  - configured and enabled

OTOH having the directories with a single file in them is a close substitute, and 
better meets the sysfs 'one file, one value' principle.

> So, isn't the patch I proposed sufficient for now?

Well, no, because it really does the check in the wrong place and introduces 
possible friction between what tooling thinks is the supported environment and 
what the kernel thinks - and thus postpones the problem.

I could live with this current solution as the initial version if a subsequent 
effort fixes it up properly by adding much better error reporting infrastructure, 
but the 'no time' comment makes me doubt the whole Intel/PT feature set.

Thanks,

	Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/