[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <85a11091-3c61-2d8b-28d4-2a251f3b8ffe@linux.intel.com>
Date: Wed, 4 Sep 2024 14:57:14 +0300 (EEST)
From: Ilpo Järvinen <ilpo.jarvinen@...ux.intel.com>
To: Reinette Chatre <reinette.chatre@...el.com>
cc: fenghua.yu@...el.com, shuah@...nel.org, tony.luck@...el.com,
peternewman@...gle.com, babu.moger@....com,
Maciej Wieczór-Retman <maciej.wieczor-retman@...el.com>,
linux-kselftest@...r.kernel.org, LKML <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH 2/6] selftests/resctrl: Ensure measurements skip
initialization of default benchmark
On Fri, 30 Aug 2024, Reinette Chatre wrote:
>
> Thank you for taking a look.
>
> On 8/30/24 3:56 AM, Ilpo Järvinen wrote:
> > On Thu, 29 Aug 2024, Reinette Chatre wrote:
> >
>
> ...
>
> > > @@ -684,11 +622,13 @@ int resctrl_val(const struct resctrl_test *test,
> > > const char * const *benchmark_cmd,
> > > struct resctrl_val_param *param)
> > > {
> > > - struct sigaction sigact;
> > > - int ret = 0, pipefd[2];
> > > - char pipe_message = 0;
> > > - union sigval value;
> > > - int domain_id;
> > > + int domain_id, operation = 0, memflush = 1;
> > > + size_t span = DEFAULT_SPAN;
> > > + unsigned char *buf = NULL;
> > > + cpu_set_t old_affinity;
> > > + bool once = false;
> > > + int ret = 0;
> > > + pid_t ppid;
> > > if (strcmp(param->filename, "") == 0)
> > > sprintf(param->filename, "stdio");
> > > @@ -699,111 +639,80 @@ int resctrl_val(const struct resctrl_test *test,
> > > return ret;
> > > }
> > > - /*
> > > - * If benchmark wasn't successfully started by child, then child
> > > should
> > > - * kill parent, so save parent's pid
> > > - */
> > > ppid = getpid();
> > > - if (pipe(pipefd)) {
> > > - ksft_perror("Unable to create pipe");
> > > + /* Taskset test to specified CPU. */
> > > + ret = taskset_benchmark(ppid, uparams->cpu, &old_affinity);
> >
> > Previously only CPU affinity for bm_pid was set but now it's set before
> > fork(). Quickly checking the Internet, it seems that CPU affinity gets
> > inherited on fork() so now both processes will have the same affinity
> > which might make the other process to interfere with the measurement.
>
> Setting the affinity is intended to ensure that the buffer preparation
> occurs in the same topology as where the runtime portion will run.
> This preparation is done before the work to be measured starts.
>
> This does tie in with the association with the resctrl group and I
> will elaborate more below ...
Okay, that's useful to retain but thinking this further, now we're also
going do non-trivial amount of work in between the setup and the test by
forking. I guess that doesn't matter for memflush = true case but might be
meaningful for the memflush = false case that seems to be there to allow
keeping caches hot (I personally haven't thought how to use "caches hot"
test but we do have that capability by the fact that memflush paremeter
exists).
> > > + if (ret)
> > > + return ret;
> > > - return -1;
> > > + /* Write test to specified control & monitoring group in resctrl FS.
> > > */
> > > + ret = write_bm_pid_to_resctrl(ppid, param->ctrlgrp, param->mongrp);
> >
> > Previously, this was done for bm_pid but now it's done for the parent. I'm
> > not sure how inheritance goes with resctrl on fork(), will the forked PID
> > get added to the list of PIDs or not? You probably know the answer :-).
>
> Yes. A process fork()ed will land in the same resctrl group as its parent.
>
> > Neither behavior, however, seems to result in the intended behavior as we
> > either get interfering processes (if inherited) or no desired resctrl
> > setup for the benchmark process.
>
> There are two processes to consider in the resource group, the parent (that
> sets up the buffer and does the measurements) and the child (that runs the
> workload to be measured). Thanks to your commit da50de0a92f3
> ("selftests/resctrl:
> Calculate resctrl FS derived mem bw over sleep(1) only") the parent
> will be sleeping while the child runs its workload and there is no
> other interference I am aware of. The only additional measurements
> that I can see would be the work needed to actually start and stop the
> measurements and from what I can tell this falls into the noise.
>
> Please do keep in mind that the performance counters used, iMC, cannot
> actually
> be bound to a single CPU since it is a per-socket PMU. The measurements have
> thus never been as fine grained as the code pretends it to be.
I was thinking if I should note the amount of work is small. Maybe it's
fine to leave that noise there and I'm just overly cautious :-), when I
used to do networking research in the past life, I wanted to eliminate as
much noise sources so I guess it comes from that background.
--
i.
Powered by blists - more mailing lists