[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20090401190445.GA16033@redhat.com>
Date: Wed, 1 Apr 2009 21:04:45 +0200
From: Oleg Nesterov <oleg@...hat.com>
To: "Metzger, Markus T" <markus.t.metzger@...el.com>
Cc: "linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"mingo@...e.hu" <mingo@...e.hu>,
"tglx@...utronix.de" <tglx@...utronix.de>,
"hpa@...or.com" <hpa@...or.com>,
"markus.t.metzger@...il.com" <markus.t.metzger@...il.com>,
"roland@...hat.com" <roland@...hat.com>,
"eranian@...glemail.com" <eranian@...glemail.com>,
"Villacis, Juan" <juan.villacis@...el.com>,
"ak@...ux.jf.intel.com" <ak@...ux.jf.intel.com>
Subject: Re: [patch 3/21] x86, bts: wait until traced task has been
scheduled out
On 04/01, Metzger, Markus T wrote:
>
> >-----Original Message-----
> >From: Oleg Nesterov [mailto:oleg@...hat.com]
> >Sent: Wednesday, April 01, 2009 2:17 AM
> >To: Metzger, Markus T
>
> >> +static void wait_to_unschedule(struct task_struct *task)
> >> +{
> >> + unsigned long nvcsw;
> >> + unsigned long nivcsw;
> >> +
> >> + if (!task)
> >> + return;
> >> +
> >> + if (task == current)
> >> + return;
> >> +
> >> + nvcsw = task->nvcsw;
> >> + nivcsw = task->nivcsw;
> >> + for (;;) {
> >> + if (!task_is_running(task))
> >> + break;
> >> + /*
> >> + * The switch count is incremented before the actual
> >> + * context switch. We thus wait for two switches to be
> >> + * sure at least one completed.
> >> + */
> >> + if ((task->nvcsw - nvcsw) > 1)
> >> + break;
> >> + if ((task->nivcsw - nivcsw) > 1)
> >> + break;
> >> +
> >> + schedule();
> >
> >schedule() is a nop here. We can wait unpredictably long...
>
> Hmmm, As far as I understand the code, rt-workqueues use a higher sched_class
> and can thus not be preempted by normal threads. Non-rt workqueues
> use the fair_sched_class. And schedule_work() uses a non-rt workqueue.
I was unclear, sorry.
I meant, in this case
while (!CONDITION)
schedule();
is not better compared to
while (!CONDITION)
; /* do nothing */
(OK, schedule() is better without CONFIG_PREEMPT, but this doesn't matter).
wait_to_unschedule() just spins waiting for ->nXvcsw, this is not optimal.
And another problem, we can wait unpredictably long, because
> In practice, task is ptraced. It is either stopped or exiting.
> I don't expect to loop very often.
No. The task _was_ ptraced when we called (say) ptrace_detach(). But when
work->func() runs, the tracee is not traced, it is running (not necessary
of course, the tracer _can_ leave it in TASK_STOPPED).
Now, again, suppose that this task does "for (;;) ;" in user-space.
If CPU is "free", it can spin "forever" without re-scheduling. Yes sure,
this case is not likely in practice, but still.
Oleg.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists