[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20070817171053.GB4261@austin.ibm.com>
Date: Fri, 17 Aug 2007 12:10:53 -0500
From: linas@...tin.ibm.com (Linas Vepstas)
To: Paul Mackerras <paulus@...ba.org>
Cc: Michael Neuling <mikey@...ling.org>, Andrew Morton <akpm@...l.org>,
linuxppc-dev@...abs.org, linux-kernel@...r.kernel.org,
Balbir Singh <balbir@...ux.vnet.ibm.com>
Subject: Re: [PATCH 1/2] Add scaled time to taskstats based process accounting
On Fri, Aug 17, 2007 at 08:22:40AM +1000, Paul Mackerras wrote:
> Linas Vepstas writes:
>
> > My gut impression (maybe wrong?) is that the scaled time is,
> > in a certain sense, "more accurate" than the unscaled time.
>
> The "unscaled" time is just time, as in "how many seconds did this
> task spend on the CPU". It's what all the tools (except a certain
> proprietary workload manager) expect. Top, ps, etc. get unhappy if
> the times reported (user, system, hardirq, softirq, idle, stolen)
> don't add up to elapsed wall-clock time.
OK, so to keep the tools happy, the total time needs to add up
to wall-clock time. Which tells me that the "scaled idle time"
should be defined as "wall clock time minus the other stuff".
> The "scaled" time is really CPU cycles divided by some arbitrary
> factor (the notional CPU frequency). So yes it does give some
> indication of how much progress the task should have made, in some
> sense.
Yes, good, that's what I was expecting. As a sysadmin and/or
back-of-the-envelope performance person, I would certainly like
to have ps and top report the scaled time. When I do "performance
tuning", I almost always can get away with quick-n-dirty use of
vmstat and top, and only rarely have to descend into more complex
tools. I'd hate to loose this quick-n-dirty utility, which,
again ... my gut impression is that these numbers suddenly turn
mostly meaningless.
That is, if I run the same task 3 times over the next few hours,
will vmstat/top/ps report more or less he same figures? I'm
concerned that they won't ... that I'll see different values
come out, depending on whether the chip is overheating, or whether
some other partition is stealing, or whatever causes this thing to
dynamically scale.
> Both measures are useful. Because the current user API is in terms of
> real time rather than cycles, we have to continue reporting real time,
> not scaled time, which is why the existing interfaces report unscaled
> time, and the scaled time values are reported through a new extension
> to the taskstats interface.
This begs the question of "what is the real, actual elapsed time?"
... currently, the "real time" depends very much on how often your
process got scheduled -- but, if your process is scheduled but
(due to scaling) isn't "actually running", should that count towards
the "real time"?
---
I supose that its inevitable that this stuff will get more complex;
I'm just trying to make sure we don't end up doing this backwards,
and deciding to change it around later.
I already notice that "stolen time" is causing confusion in some
areas. Its disconcerting to have lots of cores, and lots of threads
per core, only to find that some of your time has been "stolen".
I'm still wondering ... was this the right way to report this?
--linas
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists