[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <ab1a3575ac66cf2f7cc05a21e5a20fbe415e834b.camel@intel.com>
Date: Wed, 29 Nov 2023 10:37:02 +0000
From: "Huang, Kai" <kai.huang@...el.com>
To: "kirill.shutemov@...ux.intel.com" <kirill.shutemov@...ux.intel.com>,
"jpiotrowski@...ux.microsoft.com" <jpiotrowski@...ux.microsoft.com>
CC: "tim.gardner@...onical.com" <tim.gardner@...onical.com>,
"cascardo@...onical.com" <cascardo@...onical.com>,
"dave.hansen@...ux.intel.com" <dave.hansen@...ux.intel.com>,
"thomas.lendacky@....com" <thomas.lendacky@....com>,
"roxana.nicolescu@...onical.com" <roxana.nicolescu@...onical.com>,
"haiyangz@...rosoft.com" <haiyangz@...rosoft.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"mingo@...hat.com" <mingo@...hat.com>,
"stable@...r.kernel.org" <stable@...r.kernel.org>,
"tglx@...utronix.de" <tglx@...utronix.de>,
"stefan.bader@...onical.com" <stefan.bader@...onical.com>,
"Cui, Dexuan" <decui@...rosoft.com>,
"nik.borisov@...e.com" <nik.borisov@...e.com>,
"mhkelley58@...il.com" <mhkelley58@...il.com>,
"hpa@...or.com" <hpa@...or.com>,
"peterz@...radead.org" <peterz@...radead.org>,
"linux-hyperv@...r.kernel.org" <linux-hyperv@...r.kernel.org>,
"wei.liu@...nel.org" <wei.liu@...nel.org>,
"bp@...en8.de" <bp@...en8.de>,
"sashal@...nel.org" <sashal@...nel.org>,
"kys@...rosoft.com" <kys@...rosoft.com>,
"x86@...nel.org" <x86@...nel.org>
Subject: Re: [PATCH v1 2/3] x86/coco: Disable TDX module calls when TD
partitioning is active
On Fri, 2023-11-24 at 11:38 +0100, Jeremi Piotrowski wrote:
> On 23/11/2023 15:13, Kirill A. Shutemov wrote:
> > On Wed, Nov 22, 2023 at 06:01:05PM +0100, Jeremi Piotrowski wrote:
> > > Introduce CC_ATTR_TDX_MODULE_CALLS to allow code to check whether TDX module
> > > calls are available. When TD partitioning is enabled, a L1 TD VMM handles most
> > > TDX facilities and the kernel running as an L2 TD VM does not have access to
> > > TDX module calls. The kernel still has access to TDVMCALL(0) which is forwarded
> > > to the VMM for processing, which is the L1 TD VM in this case.
> >
>
> Correction: it turns out TDVMCALL(0) is handled by L0 VMM.
> >
Some thoughts after checking the spec more to make sure we don't have
misunderstanding on each other:
The TDX module will unconditionally exit to L1 for any TDCALL (except the
TDVMCALL) from the L2. This is expected behaviour. Because the L2 isn't a true
TDX guest, L1 is expected to inject a #UD or #GP or whatever error to L2 based
on the hardware spec to make sure L2 gets an correct architectural behaviour for
the TDCALL instruction.
I believe this is also the reason you mentioned "L2 TD VM does not have access
to TDX module calls".
However TDX module actually allows the L1 to control whether the L2 is allowed
to execute TDVMCALL by controlling whether the TDVMCALL from L2 will exit to L0
or L1.
I believe you mentioned "TDVMCALL(0) is handled by L0 VMM" is because the L1
hypervisor -- specifically, hyperv -- chooses to let the TDVMCALL from L2 exit
to L0?
But IMHO this is purely the hyperv's implementation, i.e., KVM can choose not to
do so, and simply handle TDVMCALL in the same way as it handles normal TDCALL --
inject the architecture defined error to L2.
Also AFAICT there's no architectural thing that controlled by L2 to allow the L1
know whether L2 is expecting to use TDVMCALL or not. In other words, whether to
support TDVMCALL is purely L1 hypervisor implementation specific.
So to me this whole series is hyperv specific enlightenment for the L2 running
on TDX guest hyperv L1. And because of that, perhaps a better way to do is:
1) The default L2 should just be a normal VM that any TDX guest L1 hypervisor
should be able to handle (guaranteed by the TDX partitioning architecture).
2) Different L2/L1 hypervisor can have it's own enlightenments. We can even
have common enlightenments across different implementation of L1 hypervisors,
but that requires cross-hypervisor cooperation.
But IMHO it's not a good idea to say:
L2 is running on a TDX partitioning enabled environment, let us mark it
as a TDX guest but mark it as "TDX partitioning" to disable couple ofÂ
TDX functionalities.
Instead, perhaps it's better to let L2 explicitly opt-in TDX facilities that the
underneath hypervisor supports.
TDVMCALL can be the first facility to begin with.
At last, even TDVMCALL has bunch of leafs, and hypervisor can choose to support
them or not. Use a single "tdx_partitioning_active" to select what TDX
facilities are supported doesn't seem a good idea.
That's my 2cents w/o knowing details of hyperv enlightenments.
Powered by blists - more mailing lists