[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4DF5F729.4060609@redhat.com>
Date: Mon, 13 Jun 2011 14:40:25 +0300
From: Avi Kivity <avi@...hat.com>
To: Borislav Petkov <bp@...64.org>
CC: Tony Luck <tony.luck@...el.com>, Ingo Molnar <mingo@...e.hu>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"Huang, Ying" <ying.huang@...el.com>,
Hidetoshi Seto <seto.hidetoshi@...fujitsu.com>
Subject: Re: [PATCH 08/10] NOTIFIER: Take over TIF_MCE_NOTIFY and implement
task return notifier
On 06/13/2011 12:55 PM, Borislav Petkov wrote:
> >
> > If running into the MCE again is really bad, then you need something
> > more, since other threads (or other processes) could run into the same
> > page as well.
>
> Well, the #MC handler runs on all CPUs on Intel so what we could do is
> set the current task to TASK_STOPPED or _UNINTERRUPTIBLE or something
> that doesn't make it viable for scheduling anymore.
>
> Then we can take our time running the notifier since the "problematic"
> task won't get scheduled until we're done. Then, when we finish
> analyzing the MCE, we either kill it so it has to handle SIGKILL the
> next time it gets scheduled or we unmap its page with error in it so
> that it #PFs on the next run.
If all cpus catch it, do we even know which task it is?
On the other hand, it makes user return notifiers attractive, since they
are per-cpu, and combined with MCE broadcast that turns them into a
global event.
> But no, I don't think we can catch all possible situations where a page
> is mapped by multiple tasks ...
>
> > If not, do we care? Let it hit the MCE again, as long as
> > we'll catch it eventually.
>
> ... and in that case we are going to have to let it hit again. Or is
> there a way to get to the tasklist of all the tasks mapping a page in
> atomic context, stop them from scheduling and run the notifier work in
> process context?
>
> Hmmm..
Surely not in atomic context, but you can use rmap to find all mappers
of a given page.
So: MCE uses irq_work_queue() -> wake up a realtime task -> process the
mce, unmap the page, go back to sleep.
--
error compiling committee.c: too many arguments to function
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists