[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <8e23f875-3c3c-957d-84db-530d2ece432e@intel.com>
Date: Thu, 8 Oct 2020 11:08:53 -0700
From: "Yu, Yu-cheng" <yu-cheng.yu@...el.com>
To: Andy Lutomirski <luto@...nel.org>,
Dave Hansen <dave.hansen@...el.com>
Cc: Sean Christopherson <sean.j.christopherson@...el.com>,
LKML <linux-kernel@...r.kernel.org>, X86 ML <x86@...nel.org>,
Rik van Riel <riel@...riel.com>,
Andrew Cooper <andrew.cooper3@...rix.com>
Subject: Re: How should we handle illegal task FPU state?
On 10/1/2020 3:04 PM, Andy Lutomirski wrote:
> On Thu, Oct 1, 2020 at 2:50 PM Dave Hansen <dave.hansen@...el.com> wrote:
>>
>> On 10/1/20 1:58 PM, Sean Christopherson wrote:
>>> One thought for a lowish effort approach to pave the way for CET would be to
>>> try XRSTORS multiple times in switch_fpu_return(). If the first try fails,
>>> then WARN, init non-supervisor state and try a second time, and if _that_ fails
>>> then kill the task. I.e. do the minimum effort to play nice with bad FPU
>>> state, but don't let anything "accidentally" turn off CET.
>>
>> I'm not sure we should ever keep running userspace after an XRSTOR*
>> failure. For MPX, this might have provided a nice, additional vector
>> for an attacker to turn off MPX. Same for pkeys if we didn't correctly
>> differentiate between the hardware init state versus the "software init"
>> state that we keep in init_task.
>>
>> What's the advantage of letting userspace keep running after we init its
>> state? That it _might_ be able to recover?
>
> I suppose we can kill userspace and change that behavior only if
> someone complains. I still think it would be polite to try to dump
> core, but that could be tricky with the current code structure. I'll
> try to whip up a patch. Maybe I'll add a debugfs file to trash MXCSR
> for testing.
>
One complication of letting XRSTORS fail is exit_to_user_mode_prepare()
will need to go back to exit_to_user_mode_loop() again (or repeat some
parts of it).
Currently, when exit_to_user_mode_loop() exits, xstates should have been
validated earlier and to be restored shortly. At this stage, XRSTORS
should not fault. If we need to kill the task, we should have done that
earlier.
There are only two sources of invalid xstates: PTRACE and sigreturn.
For user-mode data, both sources have been taken care of.
Supervisor-mode data can be checked/reviewed easily. There are only CET
and PASID supervisor states.
Yu-cheng
Powered by blists - more mailing lists