lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <026914337fe69ed388e42a59e97d4a838bea6832.camel@arm.com>
Date: Mon, 25 Nov 2024 17:11:43 +0000
From: Deepak Surti <Deepak.Surti@....com>
To: "peterz@...radead.org" <peterz@...radead.org>
CC: Ben Gainey <Ben.Gainey@....com>, "alexander.shishkin@...ux.intel.com"
	<alexander.shishkin@...ux.intel.com>, Mark Barnett <Mark.Barnett@....com>,
	James Clark <James.Clark@....com>, "adrian.hunter@...el.com"
	<adrian.hunter@...el.com>, "ak@...ux.intel.com" <ak@...ux.intel.com>,
	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
	"linux-perf-users@...r.kernel.org" <linux-perf-users@...r.kernel.org>,
	"irogers@...gle.com" <irogers@...gle.com>, "mingo@...hat.com"
	<mingo@...hat.com>, "will@...nel.org" <will@...nel.org>, Mark Rutland
	<Mark.Rutland@....com>, "linux-arm-kernel@...ts.infradead.org"
	<linux-arm-kernel@...ts.infradead.org>, "acme@...nel.org" <acme@...nel.org>,
	"jolsa@...nel.org" <jolsa@...nel.org>, "namhyung@...nel.org"
	<namhyung@...nel.org>
Subject: Re: [PATCH v1 1/4] perf: Allow periodic events to alternate between
 two sample periods

On Thu, 2024-11-14 at 16:01 +0100, Peter Zijlstra wrote:
> On Thu, Nov 07, 2024 at 04:07:18PM +0000, Deepak Surti wrote:
> > From: Ben Gainey <ben.gainey@....com>
> > 
> > This change modifies perf_event_attr to add a second, alternative
> > sample period field, and modifies the core perf overflow handling
> > such that when specified an event will alternate between two sample
> > periods.
> > 
> > Currently, perf does not provide a  mechanism for decoupling the
> > period
> > over which counters are counted from the period between samples.
> > This is
> > problematic for building a tool to measure per-function metrics
> > derived
> > from a sampled counter group. Ideally such a tool wants a very
> > small
> > sample window in order to correctly attribute the metrics to a
> > given
> > function, but prefers a larger sample period that provides
> > representative
> > coverage without excessive probe effect, triggering throttling, or
> > generating excessive amounts of data.
> > 
> > By alternating between a long and short sample_period and
> > subsequently
> > discarding the long samples, tools may decouple the period between
> > samples that the tool cares about from the window of time over
> > which
> > interesting counts are collected.
> 
> Do you have a link to a paper or something that explains this method?

Ben had originally authored this as an internal doc but I can look at
publishing externally. Is there anything in particular about this
method that you are interested in?

> 
> 
> > +	/*
> > +	 * Indicates that the alternative_sample_period is used
> > +	 */
> > +	bool				using_alternative_sample_p
> > eriod;
> 
> I typically prefer variables names that are shorter.

Acknowledged. Will do it in version 2 of the patch.

> 
> 
> > @@ -9822,6 +9825,26 @@ static int __perf_event_overflow(struct
> > perf_event *event,
> >  	    !bpf_overflow_handler(event, data, regs))
> >  		return ret;
> >  
> > +	/*
> > +	 * Swap the sample period to the alternative period
> > +	 */
> > +	if (event->attr.alternative_sample_period) {
> > +		bool using_alt = hwc-
> > >using_alternative_sample_period;
> > +		u64 sample_period = (using_alt ? event-
> > >attr.sample_period
> > +					       : event-
> > >attr.alternative_sample_period);
> > +
> > +		hwc->sample_period = sample_period;
> > +		hwc->using_alternative_sample_period = !using_alt;
> > +
> > +		if (local64_read(&hwc->period_left) > 0) {
> > +			event->pmu->stop(event, PERF_EF_UPDATE);
> > +
> > +			local64_set(&hwc->period_left, 0);
> > +
> > +			event->pmu->start(event, PERF_EF_RELOAD);
> > +		}
> 
> This is quite terrible :-(
> 
> Getting here means we just went through the effort of programming the
> period and you'll pretty much always hit that 'period_left > 0' case.
> 
> Why do we need this case at all? If you don't do this, then the next
> overflow will pick the period you just wrote to hwc->sample_period
> (although you might want to audit all arch implementations).
> 
> Looking at it again, that truncation to 0 is just plain wrong --
> always.
> Why are you doing this?

This was due to Ben's lack of familiarity with the codebase when this
was originally written; this replicates what the IOCTL handler does to
change the sample period.

But you are right this is inefficient; we've tested with this removed
and for SW events and arm pmu events it appears to be fine.

A quick review of the other architecture overflow handlers tend to all
follow the same pattern so it's probably safe, but we will do some more
validation on that before version 2 of the patch.

Thanks,
Deepak
> 
> 
> 

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ