[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20170518142415.GA6425@leverpostej>
Date: Thu, 18 May 2017 15:24:15 +0100
From: Mark Rutland <mark.rutland@....com>
To: Leo Yan <leo.yan@...aro.org>
Cc: Will Deacon <will.deacon@....com>,
Suzuki K Poulose <Suzuki.Poulose@....com>,
Catalin Marinas <catalin.marinas@....com>,
Marc Zyngier <marc.zyngier@....com>,
Ard Biesheuvel <ard.biesheuvel@...aro.org>,
James Morse <james.morse@....com>,
linux-arm-kernel@...ts.infradead.org, linux-kernel@...r.kernel.org,
Guodong Xu <guodong.xu@...aro.org>,
Haojian Zhuang <haojian.zhuang@...aro.org>,
Douglas Raillard <douglas.raillard@....com>
Subject: Re: [PATCH] arm64: cpufeature: check translation granule size based
on kernel config
On Thu, May 18, 2017 at 09:02:50PM +0800, Leo Yan wrote:
> On Thu, May 18, 2017 at 01:41:15PM +0100, Mark Rutland wrote:
> > On Thu, May 18, 2017 at 01:38:41PM +0100, Will Deacon wrote:
> > > On Thu, May 18, 2017 at 07:36:27PM +0800, Leo Yan wrote:
> > > > On Thu, May 18, 2017 at 11:39:01AM +0100, Suzuki K Poulose wrote:
> > > > > On 18/05/17 11:21, Leo Yan wrote:
> > > > > >In the big.LITTLE system with two clusters, one is CA53 cluster and
> > > > > >another is CA73 cluster. CA53 doesn't support 16KB memory translation
> > > > > >granule size (4.3.21 AArch64 Memory Model Feature Register 0, EL1; ARM
> > > > > >DDI 0500F), but CA73 supports this feature (4.3.27 AArch64 Memory Model
> > > > > >Feature Register 0, EL1; ARM 100048_0002_04_en). As result, the kernel
> > > > > >reports log for "Unexpected variation" as below.
> > > > > >
> > > > > >[ 0.182113] CPU features: SANITY CHECK: Unexpected variation in SYS_ID_AA64MMFR0_EL1. Boot CPU: 0x00000000001122, CPU4: 0x00000000101122
> > > > >
> > > > > >
> > > > > >This patch is to change the checking CPU feature for memory translation
> > > > > >granule size based on kernel configuration. If kernel configuration has
> > > > > >selected to use one specific memory translation granule size, then we
> > > > > >will do strict sanity checking cross all CPUs. Otherwise we can skip to
> > > > > >check unused features for memory translation granule size if kernel
> > > > > >doesn't use it.
> > > > > >
> > > > >
> > > > > If we were to suppress the warning (more on that below), we could simply
> > > > > make this feature a NON_STRICT, since the unsupported CPUs won't boot
> > > > > with 16K to hit this sanity check.
> > > > >
> > > > > However, there is a problem with disabling this warning. If a VM starts
> > > > > using 16KB page size on a 4K/64K host, the VM could end up in unknown
> > > > > failures when it switches to an unsupported CPU (after it has booted).
> > > > > Of course the real fix lies in making the KVM exposing the safe value
> > > > > for granule support to the VCPUs (which is currently being worked on by
> > > > > Douglas in Cc). So, when we have that ready, we could make it NON_STRICT
> > > > > instead of this approach.
> > > >
> > > > Thanks for the info :)
> > > >
> > > > I will use below patch for production branch temporarily. You could
> > > > work out one formal patch for upstreaming when the dependency patches
> > > > are get ready:
> > >
> > > The other thing we could do is change the way we taint on mismatch so that
> > > we don't dump the scary (and pointless) backtrace.
> >
> > The backtrace is due to the WARN part of the WARN_TAINT_ONCE.
> >
> > We could chagne that as below, if we really want to get rid of the backtrace.
> >
> > Thanks,
> > Mark.
> >
> > ---->8----
> > diff --git a/arch/arm64/kernel/cpufeature.c b/arch/arm64/kernel/cpufeature.c
> > index 94b8f7f..1f53314 100644
> > --- a/arch/arm64/kernel/cpufeature.c
> > +++ b/arch/arm64/kernel/cpufeature.c
> > @@ -639,8 +639,8 @@ void update_cpu_features(int cpu,
> > * Mismatched CPU features are a recipe for disaster. Don't even
> > * pretend to support them.
> > */
> > - WARN_TAINT_ONCE(taint, TAINT_CPU_OUT_OF_SPEC,
> > - "Unsupported CPU feature variation.\n");
> > + pr_warn_once("Unsupported CPU feature variation detected.\n");
> > + add_taint(TAINT_CPU_OUT_OF_SPEC);
>
> Should be add_taint(TAINT_CPU_OUT_OF_SPEC, LOCKDEP_STILL_OK)?
Whoops; yes.
Thanks,
Mark.
Powered by blists - more mailing lists